Scientific Software
Updated 6 months ago

daiR — Peer-reviewed • Rank 10.9 • Science 93%

daiR: an R package for OCR with Google Document AI - Published in JOSS (2021)

Scientific Software · Peer-reviewed
Updated 6 months ago

tesseract-ocr • Rank 16.5 • Science 54%

Tesseract Open Source OCR Engine (main repository)

Updated 6 months ago

ocr-fileformat • Rank 7.8 • Science 62%

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Updated 6 months ago

imgocr • Rank 9.0 • Science 44%

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

Updated 6 months ago

taulu • Rank 8.4 • Science 44%

Taulu is a Python package designed to segment tabular data in scanned or photographed documents.

Updated 6 months ago

gt-mufilevelrules • Rank 1.8 • Science 44%

OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.

Updated 5 months ago

https://github.com/ai-forever/ocr-model • Rank 4.5 • Science 36%

An easy-to-run OCR model pipeline based on CRNN and CTC loss

Updated 6 months ago

apple-ocr • Rank 10.2 • Science 26%

Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.

Updated 6 months ago

tarsier • Rank 20.1 • Science 13%

Vision utilities for web interaction agents 👀

Updated 6 months ago

docutron • Rank 3.2 • Science 26%

Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.

Updated 5 months ago

https://github.com/adithya-s-k/omniparse • Rank 10.4 • Science 13%

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Updated 5 months ago

https://github.com/amirabbasasadi/persianocr • Rank 3.0 • Science 10%

Simple word-level OCR program for the Persian language based on Recurrent Neural Networks using Pytorch and OpenCV

Updated 6 months ago

dtrocr • Science 57%

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Updated 6 months ago

keyboardgt • Science 44%

Offer of different keyboards for transcription software (Aletheia, Transkribus, LAREX, QURATOR-neat, eScriptorium)

Updated 6 months ago

ocr17plus • Science 26%

Data for layout analysis and HTR.

Updated 6 months ago

cremma-mss-16 • Science 44%

CREMMA HTR GT for 16th century MSS

Updated 6 months ago

lines_merge_ocr • Science 67%

This script merges lines after extracting plain text from a pdf that produces OCR

Updated 6 months ago

reichsanzeiger-gt • Science 57%

Ground truth for German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–1945)

Updated 6 months ago

german-brazilian-newspapers-dataset_1 • Science 26%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

Updated 5 months ago

https://github.com/bytedance/dolphin • Science 36%

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Updated 6 months ago

bachelors-thesis • Science 31%

Bachelor's thesis on Automatic License Plate Recognition.

Updated 6 months ago

wordart • Science 54%

The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.

Updated 6 months ago

visarchpy • Science 44%

pipelines for the extraction and processing of visuals from PDFs

Updated 6 months ago

hemdig-framework • Science 49%

HEMDIG(pt) framework: Métodos, ferramentas e hemerotecas digitais em português. Pesquisa de pós-doutorado.

Updated 5 months ago

https://github.com/ai-forever/readingpipeline • Science 26%

Text reading pipeline that combines segmentation and OCR-models.

Updated 6 months ago

german-brazilian-newspapers-dataset_2 • Science 44%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

Updated 5 months ago

https://github.com/awslabs/aws-ai-solution-kit • Science 26%

Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.

Updated 5 months ago

https://github.com/bytedance/sptsv2 • Science 36%

The official implementation of SPTS v2: Single-Point Text Spotting

Updated 5 months ago

https://github.com/cemenenkoff/runedark-public • Science 13%

Runedark is a color-based automation framework (for OSRS).

Updated 6 months ago

cremma-medieval-lat • Science 36%

CREMMA HTR GT for medieval latin manuscripts

Updated 6 months ago

cremma-medieval • Science 49%

Transcription corpora for training HTR models for medieval manuscripts from the 12th to the 15th century.

Updated 6 months ago

cataloguessegmentationocr • Science 26%

Dataset and models for catalogs' Layout analysis and HTR