2024--carolingian-latin
Ground truth transcriptions of ÖNB Cod. 940, featuring an Irish commentary on the Gospel of Matthew, developed during the 2024 HTR Winter School using Transkribus.
https://github.com/htr-school-vienna/2024--carolingian-latin
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.3%) to scientific vocabulary
Repository
Ground truth transcriptions of ÖNB Cod. 940, featuring an Irish commentary on the Gospel of Matthew, developed during the 2024 HTR Winter School using Transkribus.
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.MD
HTR Winter School 2024 - Carolingian Latin ÖNB 940
Ground truth of some part ÖNB Cod. 940, ff. 13r - 142v: automatically read pages with Transkribus
ATTENTION: To clone this repo you need to have Git LFS installed and then clone the repository like this:
git lfs clone git@github.com:htr-school-vienna/2024--carolingian-latin.git
Description
- An anonymous Irish commentary on the Gospel of Matthew (end of the 8th/early 9th c.), from a manuscript from Salzburg/Saint Amand possibly under bishop Arn (785-821).
- Files: 2
- Lines: 7835
- Latin, Carolingian Minuscule (9th century)
Origin of the data:
- source of images: Austrian National Library (Cod. ÖNB 940: ~https://manuscripta.at/hs_detail.php?ID=10262~),
- Description or citation of transcription guidelines:
- we used the transcription guidelines prepared by Tim Geelhaar according to his public model Latin - Carolingian Minuscule on Transkribus (~https://readcoop.eu/de/modelle/latin-carolingian-minuscule/~). These will be published soon in a forthcoming book: Tim Geelhaar, Tobias Hodel, Jan Odstrcilik, and Michael Schonhardt, “Vademecum of ATR.”
Data organisation
- we have one folder, with the images and transcription of Cod. 940.
How to cite
This dataset was created by Cinzia Grifoni, Leon Pürstinger, Fabio Mantegazza, Elena De Luca, Eddie Meehan, Daniela Schulz, Till Stüber, Elena Alexa Riepl, Camilla Bertoletti, Kendall M. Bitner, Leonardo Napoletano, Martina Carandino, Silvio Lorenzo Ruberto, Nathalie Pfeuffer, Luca Abelli, Anne Sieberichs, Barbora Kulhová, Gerda Heydemann, Ulrike Steurer, Jesper van der Most, Xinyue Xu and Lode Moens. The digitisation is not copyright free, but the transcription is. However, properly annotating a corpus takes time and is a task that should be recognised. If you use any item from this corpus as ground truth, cite the dataset using the following information
Copy citation BibTeX from Zenodo 10.5281/zenodo.10589561
Copyright and licence
This dataset was created as part of the Winter School of Handwritten Text Recognition of Medieval Manuscripts 2024, Vienna at the Österreichische Akademie der Wissenschaften, Institut für Mittelalterforschung, all transcriptions are licensed under the Creative Commons 4 licence. Images were provided by the Austrian National Library (ÖNB) and are licensed under Creative Commons 4 licence.
Owner
- Name: HTR School Vienna
- Login: HTR-School-Vienna
- Kind: organization
- Repositories: 1
- Profile: https://github.com/HTR-School-Vienna
GitHub Events
Total
- Push event: 3
Last Year
- Push event: 3