Projects | Open Source Science

Scientific Software

Updated 10 months ago

daiR — Peer-reviewed • Rank 10.9 • Science 93%

daiR: an R package for OCR with Google Document AI - Published in JOSS (2021)

google-cloud ocr r

Scientific Software · Peer-reviewed

Updated 10 months ago

mmocr • Rank 22.0 • Science 54%

OpenMMLab Text Detection, Recognition and Understanding Toolbox

abcnet abinet crnn dbnet deep-learning fcenet key-information-extraction maskrcnn ocr pan panet psenet pytorch sar sdmg-r segmentation-based-text-recognition spts svtr text-detection text-recognition

Updated 10 months ago

tesseract-ocr • Rank 16.5 • Science 54%

Tesseract Open Source OCR Engine (main repository)

hacktoberfest lstm machine-learning ocr ocr-engine tesseract tesseract-ocr

Updated 10 months ago

ocr-fileformat • Rank 7.8 • Science 62%

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

alto finereader hocr ocr ocr-d page-xml transformation validation

Updated 10 months ago

quipucamayoc • Rank 7.6 • Science 54%

dev repo for article

ocr ocr-post-processing ocr-python poppler table-extraction table-ocr textract

Updated 10 months ago

https://github.com/aim-uofa/adelaidet • Rank 11.7 • Science 46%

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

abcnet adelaidet blendmask boxinst condinst densecl fcos instance-segmentation meinst object-detection ocr solo solov2 text-detection text-recognition

Updated 10 months ago

imgocr • Rank 9.0 • Science 44%

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理，中英文OCR开源SOTA，推理速度超快。

chinese-ocr ocr ocr-python

Updated 10 months ago

taulu • Rank 8.4 • Science 44%

Taulu is a Python package designed to segment tabular data in scanned or photographed documents.

data-extraction historic-documents htr ocr segmentation tabular-data

Updated 10 months ago

handprint • Rank 11.8 • Science 36%

Apply different text recognition services to images of handwritten documents.

amazon-rekognition amazon-textract google-api google-cloud google-vision-api handwritten-text-recognition htr library-automation machine-learning microsoft-azure ocr optical-character-recognition python

Updated 10 months ago

gt-mufilevelrules • Rank 1.8 • Science 44%

OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.

ground-truth guidelines ocr ocr-d transcription

Updated 10 months ago

autocorrect • Rank 9.1 • Science 36%

Spelling corrector in python

autocorrect autocorrection czech english languages levenshtein-distance multilanguage multilingual nlp ocr polish portuguese python russian spanish spellchecker spelling spelling-corrector turkish ukrainian

Updated 10 months ago

https://github.com/amazon-science/semimtr-text-recognition • Rank 5.5 • Science 36%

Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)

computer-vision consistency-regularization contrastive-learning deep-learning ocr pytorch scene-text-recognition self-supervised-learning semi-supervised-learning text-recognition

Updated 10 months ago

https://github.com/ai-forever/ocr-model • Rank 4.5 • Science 36%

An easy-to-run OCR model pipeline based on CRNN and CTC loss

crnn ocr pytorch text-recognition

Updated 10 months ago

apple-ocr • Rank 10.2 • Science 26%

Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.

apple clustering kmeans nlp ocr ocr-recognition pyobjc python scatter-plot sklearn

Updated 10 months ago

tarsier • Rank 20.1 • Science 13%

Vision utilities for web interaction agents 👀

gpt4v llms ocr playwright pypi-package python selenium webscraping

Materials Science (40%)

Updated 10 months ago

docutron • Rank 3.2 • Science 26%

Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.

cv2 detecron2 detection document legal legaltech legaltools llm machine-learning nlp ocr ocr-recognition preprocessing

Updated 10 months ago

https://github.com/adithya-s-k/omniparse • Rank 10.4 • Science 13%

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

ingestion-api ocr omniparser parse-server parser-library vision-transformer web-crawler whisper-api

Updated 10 months ago

https://github.com/amirabbasasadi/persianocr • Rank 3.0 • Science 10%

Simple word-level OCR program for the Persian language based on Recurrent Neural Networks using Pytorch and OpenCV

image-processing machine-learning ocr ocr-recognition opencv python pytorch

Updated 10 months ago

https://github.com/bytedance/sptsv2 • Science 36%

The official implementation of SPTS v2: Single-Point Text Spotting

artificial-intelligence computer-vision deep-learning ocr research

Updated 10 months ago

https://github.com/amazon-science/textadain-robust-recognition • Science 10%

TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers

deep-learning handwriting-recognition ocr pytorch regularization scene-text-recognition shortcut-learning text-recognition

Updated 10 months ago

https://github.com/cemenenkoff/runedark-public • Science 13%

Runedark is a color-based automation framework (for OSRS).

automation computer-vision ocr old-school-runescape

Updated 10 months ago

dtrocr • Science 57%

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

decoder dtrocr ocr pytorch recognition text-recognition transformer

Updated 10 months ago

cremma-mss-16 • Science 44%

CREMMA HTR GT for 16th century MSS

das ocr

Updated 10 months ago

wordart • Science 54%

The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.

artistic-text-recognition contrastive-learning corner dataset ocr text-recognition transformer

Updated 10 months ago

visarchpy • Science 44%

pipelines for the extraction and processing of visuals from PDFs

ai computer-vision data-pipelines ocr pdf tu-delft

Updated 10 months ago

https://github.com/caltechlibrary/htr-test-cases • Science 26%

Images of documents for testing HTR.

handwritten-text-recognition htr machine-learning ocr text-recognition

Updated 10 months ago

cremma-medieval-lat • Science 36%

CREMMA HTR GT for medieval latin manuscripts

dataset ground-truth htr latin ocr

Updated 10 months ago

bachelors-thesis • Science 31%

Bachelor's thesis on Automatic License Plate Recognition.

alpr anpr license-plate-detection license-plate-recognition ocr text-detection text-recognition

Updated 10 months ago

keyboardgt • Science 44%

Offer of different keyboards for transcription software (Aletheia, Transkribus, LAREX, QURATOR-neat, eScriptorium)

aletheia escriptorium ground-truth keyboard-layout larex ocr qurator-neat transcription transkribus

Updated 10 months ago

cremma-16-17-print • Science 39%

dataset ground-truth htr ocr

Updated 10 months ago

kraken • Science 67%

OCR engine for all the languages

alto-xml handwritten-text-recognition hocr htr layout-analysis neural-networks ocr optical-character-recognition page-xml

Updated 10 months ago

https://github.com/ai-forever/readingpipeline • Science 26%

Text reading pipeline that combines segmentation and OCR-models.

object-detection ocr pytorch segmentation text-recognition

Updated 10 months ago

german-brazilian-newspapers-dataset_2 • Science 44%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

ground-truth newspaper ocr page-xml training

Updated 10 months ago

https://github.com/awslabs/aws-ai-solution-kit • Science 26%

Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.

car-license-plate-recognition chinese-ocr deep-learning face-recognition human-segmentation image-similarity machine-learning ocr ocr-recognition optical-character-recognition simplified-chinese super-resolution text-similarity traditional-chinese

Updated 10 months ago

cataloguessegmentationocr • Science 26%

Dataset and models for catalogs' Layout analysis and HTR

alto-xml catalog htr ocr page-xml segmentation segmenter

Updated 10 months ago

lines_merge_ocr • Science 67%

This script merges lines after extracting plain text from a pdf that produces OCR

ocr pdf

Updated 10 months ago

reichsanzeiger-gt • Science 57%

Ground truth for German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–1945)

fraktur ground-truth latin-language newspaper ocr ocr-d-level-2

Updated 10 months ago

german-brazilian-newspapers-dataset_1 • Science 26%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

ground-truth newspaper ocr page-xml training

Updated 10 months ago

langchain-chatbot • Science 44%

AI Chatbot for analyzing/extracting information from data in conversational format.

ai artificial-intelligence bot chromadb discord discord-bot embeddings extractive-question-answering gpt-3 gpt-4 langchain ocr openai openai-api openai-api-chatbot pdf pdf-chat-bot pdf-ocr pinecone vector-database

Updated 10 months ago

uni-vision • Science 44%

Python library for handling most vision tasks .\

3d-reconstruction classification depth-estimation detection foundation-models gan instance-segmentation ocr pose-estimation segmentation superres tracking vision-rag vlm

Updated 10 months ago

https://github.com/bytedance/dolphin • Science 36%

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

document-analysis layout-analysis ocr parser pdf pdf-converter pdf-parser python vlm-ocr

Updated 10 months ago

hemdig-framework • Science 49%

HEMDIG(pt) framework: Métodos, ferramentas e hemerotecas digitais em português. Pesquisa de pós-doutorado.

digital-history digital-humanities methodology newspaper ocr

Updated 10 months ago

ocr17plus • Science 26%

Data for layout analysis and HTR.

alto-xml dataset htr ocr page-xml png segmentation segmenter xml

Updated 10 months ago

cremma-medieval • Science 49%

Transcription corpora for training HTR models for medieval manuscripts from the 12th to the 15th century.

dataset ground-truth htr ocr