Updated 6 months ago

ocr-fileformat • Rank 7.8 • Science 62%

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Updated 5 months ago

https://github.com/bertsky/ocrd_detectron2 • Rank 9.0 • Science 46%

OCR-D wrapper for detectron2 based segmentation models

Updated 6 months ago

gt-mufilevelrules • Rank 1.8 • Science 44%

OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.

Updated 5 months ago

https://github.com/bertsky/ocrd_wrap • Rank 7.0 • Science 36%

OCR-D wrapper for arbitrary coords-preserving image operations

Updated 5 months ago

https://github.com/bertsky/nmalign • Rank 6.5 • Science 36%

forced alignment of lists of string by fuzzy string matching

Updated 5 months ago

https://github.com/bertsky/ocrd_doxa • Rank 4.2 • Science 36%

OCR-D wrapper for DoxaPy image binarization via locally adaptive thresholding

Updated 5 months ago

https://github.com/bertsky/workflow-configuration • Rank 4.2 • Science 36%

a makefilization for OCR-D workflows, with configuration examples

Updated 5 months ago

https://github.com/bertsky/docstruct • Rank 2.5 • Science 36%

Document structure detection from PAGE-XML to METS-XML

Updated 5 months ago

https://github.com/bertsky/ocrd_publaynet • Rank 5.1 • Science 10%

convert PubLayNet data into METS/PAGE-XML

Updated 6 months ago

gt_structure_1_2 • Science 26%

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 5 months ago

https://github.com/bertsky/ocrd_origami • Science 23%

OCR-D wrapper for poke1024/origami OLR+OCR

Updated 6 months ago

gt-repo-scripts • Science 44%

XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

Updated 6 months ago

gt_structure_1_4 • Science 26%

About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 5 months ago

https://github.com/bertsky/ocrd_jdeskew • Science 23%

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

Updated 6 months ago

gt_structure_1_1 • Science 26%

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 6 months ago

gt_structure_1_3 • Science 26%

The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.