Updated 6 months ago

pypdf • Rank 35.6 • Science 36%

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Updated 6 months ago

vision-parse • Rank 14.9 • Science 44%

Parse PDFs into markdown using Vision LLMs

Updated 5 months ago

https://github.com/adithya-s-k/marker-api • Rank 8.2 • Science 23%

Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.

Updated 5 months ago

https://github.com/bytedance/dolphin • Science 36%

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.