An on-premises, OCR-free unstructured data extraction and benchmarking toolkit. (https://idp-leaderboard.org/)
nlp machine-learning ocr extraction document onprem document-analysis table-extraction unstructured-data rag onpremise llms vlms document-information-extraction ocr-onpremise document-data-extraction onprem-vision onprem-ocr llm-ocr ocr-benchmark
-
Updated
May 17, 2025 - Python