Updated 6 months ago
pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Updated 5 months ago
https://github.com/adithya-s-k/marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Updated 5 months ago
https://github.com/bytedance/dolphin
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.