gt_structure_1_1

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

https://github.com/ocr-d/gt_structure_1_1

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (4.9%) to scientific vocabulary

Keywords

ground-truth ocr-d page-xml repository segmentation
Last synced: 6 months ago · JSON representation

Repository

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 1
  • Open Issues: 1
  • Releases: 23
Topics
ground-truth ocr-d page-xml repository segmentation
Created almost 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

gt_structure_1_1

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.

Metadata

Language:
deu
Format:
Page-XML
Time:
1600-1920
GT Type:
data_structure
License:
CC0-1.0
Transcription Guidelines:
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
Project:
OCR-D
Project-URL:
https://ocr-d.de/

Sources

The volume of transcriptions:

TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnknownRegion CustomRegion TextLine Page
8008 0 0 106 10 0 1327 0 0 6 0 0 0 0 0 1317

List of transcriptions

document TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnknownRegion CustomRegion TextLine Page
arnimb_guenderode01_1840 47 20
arnimb_goethe01_1835 77 3 20
abel_leibmedicus_1699 244 9 26 28
beer_antonius_1697 190 3 5 20
beer_lebensbeschreibung_1680 101 2 2 20
bengel_abriss01_1751 119 2 22 22
arndt_christentum03_1610 182 4 182 26
benner_herrnhuterey04_1748 151 2 7 26
alexis_ruhe04_1852 126 20 20
arnold_cyprian_1700 87 1 1 25
beck_eisen05_1903 109 2 18 20
becher_psychosophia_1683 140 4 29
arndt_christentum01_1610 159 4 156 25
altmann_elementarorganismen_1890 115 1 20
arnold_ketzerhistorie01_1699 163 2 3 27
albertinus_landtstoertzer01_1615 157 5 1 24
alexis_ruhe03_1852 131 20 20
anhaltkoethen_fruchtbringende_1628 287 2 1 23
beier_buchhandel_1690 100 2 21 23
beck_eisen02_1895 104 5 88 20
baumstark_encyclopaedie_1835 52 1 2 8
becher_discurs_1668 101 2 25
arnima_invalide_1818 46 20
andreas_fenitschka_1898 165 20
arnimb_goethe02_1835 110 2 20
barclay_argenis_1626 173 2 3 25
bauer_buehnenleben_1871 118 20
alexis_ruhe01_1852 127 20 20
benner_herrnhuterey02_1747 130 1 11 25
achenwall_staatswissenschaft_1749 143 4 2 23
arnimb_guenderode02_1840 56 20
benner_herrnhuterey01_1746 107 2 9 23
beck_eisen04_1899 101 14 20
benner_herrnhuterey03_1748 148 3 22 26
abschatz_gedichte_1704 236 2 42 30
bach_versuch01_1759 202 6 3 25
arent_dichtercharaktere_1885 206 4 31 25
bauller_lasterspiegel_1681 130 1 24 27
bastian_voelkergedanke_1881 99 3 20
beckmann_technologie_1777 128 1 5 30
arndt_christentum02_1610 172 4 175 28
becher_narrheit_1682 74 1 3 21
basedow_weisheit_1768 109 2 5 22
beck_eisen01_1884 186 3 12 20
alexis_ruhe02_1852 132 20 20
basedow_philanthropinum_1774 112 2 24 24
alexis_ruhe05_1852 109 20 20
behrens_hercynia_1703 143 6 22 27
arndt_christentum04_1610 190 3 158 24
arnimb_goethe03_1835 95 17 19
arnold_ketzerhistorie02_1700 249 6 36 29
anthus_esskunst_1838 117 21 20
bach_versuch02_1762 160 6 6 6 23
bebel_frau_1879 100 3 20
beer_nero_1685 247 2 2 21
beck_eisen03_1897 119 19 20
arnim_wunderhorn03_1808 84 2 14 20
becke_soldaten_1605 243 1 29

Extent

Owner

  • Name: OCR-D
  • Login: OCR-D
  • Kind: organization

DFG-Koordinierungsprojekt zur Weiterentwicklung von Verfahren der Optical Character Recognition

GitHub Events

Total
  • Pull request event: 1
  • Fork event: 1
Last Year
  • Pull request event: 1
  • Fork event: 1

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 0
  • Total pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 2
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • bertsky (2)
  • stweil (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/gtrepo.yml actions
  • JamesIves/github-pages-deploy-action v4 composite
  • actions/checkout v4 composite
  • mikefarah/yq master composite
  • ncipollo/release-action v1 composite
  • thedoctor0/zip-release master composite