Updated 9 months ago

ocr-fileformat • Rank 7.8 • Science 62%

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Updated 9 months ago

gt_structure_1_3 • Science 26%

The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 9 months ago

ocr17plus • Science 26%

Data for layout analysis and HTR.

Updated 9 months ago

cataloguessegmentationocr • Science 26%

Dataset and models for catalogs' Layout analysis and HTR

Updated 9 months ago

gt-repo-scripts • Science 44%

XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

Updated 9 months ago

gt_structure_1_2 • Science 26%

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 9 months ago

gt_structure_1_4 • Science 26%

About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 9 months ago

german-brazilian-newspapers-dataset_2 • Science 44%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

Updated 9 months ago

gt_structure_1_1 • Science 26%

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Updated 9 months ago

german-brazilian-newspapers-dataset_1 • Science 26%

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.