gt_structure_text_test
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (4.8%) to scientific vocabulary
Keywords from Contributors
interpretability
standardization
hack
Last synced: 10 months ago
·
JSON representation
Repository
Basic Info
- Host: GitHub
- Owner: tboenig
- License: cc-by-sa-4.0
- Default Branch: main
- Size: 36.5 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 18
Created about 2 years ago
· Last pushed about 2 years ago
Metadata Files
Readme
License
Citation
README.md
gt_structure_text_test
The OCR-D Ground Truth text and structure corpus was created between 2015 -2017. In the years since 2017, this corpus has been further curated and supplemented with metadata where appropriate. The corpus includes page XML files within annotations of the text and structure include. The data is based on transcription data stored in the German Text Archive (DTA) (https://www.deutschestextarchiv.de/).
Metadata
- Language:
- eng, fra, deu, heb, lat
- Format:
- Page-XML
- Time:
- 1500-1900
- GT Type:
- data_structure_and_text
- License:
- CC-BY-SA-4.0
- Transcription Guidelines:
- OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/
- Project:
- OCR-D
- Project-URL:
- https://ocr-d.de/
Sources
The volume of transcriptions:
| TextLine | Page | TxtRegion | GraphRegion |
|---|---|---|---|
| 101 | 4 | 20 | 3 |
List of transcriptions
| document | TxtRegion | ImgRegion | LineDrawRegion | GraphRegion | TabRegion | ChartRegion | SepRegion | MathRegion | ChemRegion | MusicRegion | AdRegion | NoiseRegion | UnknownRegion | CustomRegion | TextLine | Page |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| aepinus_bekentnis_1548 | 20 | 3 | 101 | 4 |
Extent
In this section they can insert additional information, instructions or notes.
Owner
- Name: Matthias Boenig
- Login: tboenig
- Kind: user
- Repositories: 13
- Profile: https://github.com/tboenig
GitHub Events
Total
Last Year
Committers
Last synced: about 1 year ago
Top Committers
| Name | Commits | |
|---|---|---|
| Matthias Boenig | m****g@g****t | 21 |
| github-actions[bot] | 4****] | 16 |
Committer Domains (Top 20 + Academic)
gmx.net: 1
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
.github/workflows/gtrepo.yml
actions
- JamesIves/github-pages-deploy-action v4 composite
- actions/checkout v4 composite
- mikefarah/yq master composite
- ncipollo/release-action v1 composite
- thedoctor0/zip-release master composite