-
marker
Convert PDF to markdown + JSON quickly with high accuracy
Python ★ 36k 14d agoExplain → -
surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Python ★ 21k 8d agoExplain → -
chandra
OCR model that handles complex tables, forms, handwriting with full layout.
Python ★ 11k 1mo agoExplain → -
pdftext
Extract structured text from pdfs quickly
Python ★ 700 10d agoExplain → -
lift
Extract structured data from documents quickly and accurately.
Python ★ 294 1d agoExplain → -
sdk
No description.
Python ★ 11 5d agoExplain → -
docext
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
★ 11 1y agoExplain → -
datalab-on-prem
Scripts to run Datalab's self-service on-prem container
Shell ★ 9 9d agoExplain → -
inference-mirror
No description.
Python ★ 4 10mo agoExplain → -
pykatex
No description.
Python ★ 3 4mo agoExplain → -
results
No description.
HTML ★ 2 2mo agoExplain → -
oss_container
No description.
Python ★ 1 8mo agoExplain →
No repos match these filters.