gt_structure_1_2

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

https://github.com/ocr-d/gt_structure_1_2

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (4.9%) to scientific vocabulary

Keywords

ground-truth ocr-d page-xml repository segmentation
Last synced: 6 months ago · JSON representation

Repository

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Basic Info
Statistics
  • Stars: 0
  • Watchers: 2
  • Forks: 1
  • Open Issues: 0
  • Releases: 62
Topics
ground-truth ocr-d page-xml repository segmentation
Created over 3 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

gt_structure_1_2

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.

Metadata

Language:
deu
Format:
Page-XML
Time:
1600-1900
GT Type:
data_structure
License:
CC0-1.0
Transcription Guidelines:
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
Project:
OCR-D
Project-URL:
https://ocr-d.de/

Sources

The volume of transcriptions:

TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnknownRegion CustomRegion TextLine Page
6454 1 0 153 28 0 513 75 0 0 0 0 0 0 0 1336

List of transcriptions

document TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnknownRegion CustomRegion TextLine Page
brockes_vergnuegen05_1736 142 2 28 28
boelsche_liebesleben03_1903 166 1 38
braeuner_pest_1714 167 3 19 29
beseler_volksrecht_1843 74 20
brentano_gockel_1838 52 20
bodmer_sammlung03_1742 115 1 3 22
bodmer_sammlung12_1744 120 1 2 24
boerne_paris01_1832 51 21 20
berg_ostasien04_1873 84 10 20
bodmer_sammlung04_1742 157 1 1 29
bohse_helicon_1696 127 7 22 25
berg_ostasienbotanik_1866 75 10 1 20
bodmer_sammlung08_1743 183 1 3 22
blumenbach_naturgeschichte01_1779 117 1 5 26
bodmer_sammlung07_1743 118 1 6 21
boerne_paris06_1834 52 20 20
bodmer_sammlung01_1741 127 3 2 26
blumenbach_anatomie_1805 141 2 2 20
brockes_vergnuegen06_1740 122 4 26 27
boltzmann_gastheorie02_1898 169 3 10 48 20
brandt_taubmanns_1675 67 2 1 21
brentano_kasperl_1838 67 20
boelsche_liebesleben01_1898 167 35
brockes_vergnuegen03_1730 115 9 24 23
birken_friedensvergleich_1652 58 1 1 21
boerne_paris02_1832 55 20 20
berg_ostasien02_1866 98 1 2 20
berg_ostasien03_1873 92 1 6 20
bismarck_erinnerungen02_1898 75 2 20
blumenbach_menschengeschlecht_1798 87 20
bluntschli_voelkerrecht_1868 105 16 20
blum_spatziergaenge02_1775 66 24 3 26
berg_ostasien01_1864 100 5 20
boelsche_liebesleben02_1900 99 20
berg_ostasienzoologie01_1876 72 9 1 20
blum_spatziergaenge01_1774 71 24 7 26
boerne_paris03_1833 58 20 20
bodmer_sammlung10_1743 198 7 2 28
bodmer_sammlung02_1741 157 1 7 27
berg_ostasienzoologie02_1867 97 6 4 20
bodmer_sammlung06_1742 135 1 2 24
bose_electricitaet_1744 112 6 8 24
bodmer_sammlung09_1743 277 1 3 29
bodmer_sammlung11_1743 114 2 3 22
boerne_paris04_1833 57 20 20
bierbaum_stilpe_1897 132 20 20
brockes_vergnuegen02_1727 121 5 29 29
birken_gespraechspiel_1665 119 2 21 22
boerne_paris05_1834 58 20 20
bismarck_erinnerungen01_1898 85 20
boehmemi_viehartzney_1712 188 2 1 21
bodmer_sammlung05_1742 168 1 26
berlepsch_alpen_1861 50 16 20
boltzmann_gastheorie01_1896 142 2 5 27 20
bohse_helicon01_1703 143 3 22 28
bernd_lebensbeschreibung_1738 105 2 24 25
birken_sonntagswandel_1681 116 3 23 25
brockes_vergnuegen04_1735 69 5 15 17

Extent

In this section they can insert additional information, instructions or notes.

Owner

  • Name: OCR-D
  • Login: OCR-D
  • Kind: organization

DFG-Koordinierungsprojekt zur Weiterentwicklung von Verfahren der Optical Character Recognition

GitHub Events

Total
  • Fork event: 1
Last Year
  • Fork event: 1

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels