https://github.com/bertsky/gt_structure_all

https://github.com/bertsky/gt_structure_all

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.2%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: bertsky
  • Default Branch: main
  • Size: 43.9 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Fork of OCR-D/gt_structure_all
Created about 2 years ago · Last pushed over 1 year ago

https://github.com/bertsky/gt_structure_all/blob/main/

# gt_structure_all

The 'gt_structure_all' repository is a comprehensive collection that catalogues all the individual Ground Truth Structure repositories. Collectively, these repositories make up the OCR-D Ground Truth Structure corpus. This corpus exclusively contains data in page format, capturing the structural elements (segments/regions) of printed pages. It was established as part of the DFG project OCR-D.
 
 

## Data-Repositories
 - https://OCR-D.github.io/gt_structure_1_1/
 - https://OCR-D.github.io/gt_structure_1_2/
 - https://OCR-D.github.io/gt_structure_1_3/
 - https://OCR-D.github.io/gt_structure_1_4/
 - https://OCR-D.github.io/gt_structure_2_1/
 - https://OCR-D.github.io/gt_structure_2_2/
 - https://OCR-D.github.io/gt_structure_2_3/
 - https://OCR-D.github.io/gt_structure_2_4/
 - https://OCR-D.github.io/gt_structure_3_1/
 - https://OCR-D.github.io/gt_structure_3_2/
 - https://OCR-D.github.io/gt_structure_3_3/
 - https://OCR-D.github.io/gt_structure_4_1/
 - https://OCR-D.github.io/gt_structure_4_2/
 - https://OCR-D.github.io/gt_structure_4_3/
 - https://OCR-D.github.io/gt_structure_5_1/
 - https://OCR-D.github.io/gt_structure_5_2/
 - https://OCR-D.github.io/gt_structure_5_3/
---
## ![zenodo logo](https://about.zenodo.org/static/img/logos/zenodo-gradient-round.svg)

All data records are also listed in Zenodo.  And thus also have a DOI.
When changes are made and a new release is created, the data set is given a new DOI. 

Access to the OCR-D datasets in Zenodo: https://zenodo.org/communities/ocr-d/records?q=&f=subject%3Aground-truth&l=list&p=1&s=10&sort=newest

 
## Text Data

If you wish to incorporate text data into these structural datasets, please refer to the overview repository available at the following link:  https://github.com/deutschestextarchiv/gt_structure_dtaText












Owner

  • Name: Robert Sachunsky
  • Login: bertsky
  • Kind: user

GitHub Events

Total
  • Create event: 1
Last Year
  • Create event: 1