ulb-groundtruth-eval-odem-other

OCR Groundtruth ULB VD18 - OCR-D Phase III

https://github.com/ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-other

Science Score: 52.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
    Organization ulb-sachsen-anhalt has institutional domain (bibliothek.uni-halle.de)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (2.6%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

OCR Groundtruth ULB VD18 - OCR-D Phase III

Basic Info
Statistics
  • Stars: 4
  • Watchers: 1
  • Forks: 2
  • Open Issues: 1
  • Releases: 4
Created over 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

README.md

ulb-groundtruth-eval-odem-other

OCR-D Phase III - OCR Groundtruth ULB VD18 mixed/other

Metadata

Language:
fra, lat, deu, ita, heb
Format:
Page-XML
Time:
1700-1799
GT Type:
data_structure_and_text
License:
CC-BY 4.0

Sources

The volume of transcriptions:

TextLine Page TxtRegion ImgRegion GraphRegion SepRegion MathRegion NoiseRegion
11311 247 2028 3 65 17 13 16

List of transcriptions

document TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnknownRegion CustomRegion TextLine Page
lat+grc+heb 1 28 1
heb+lat 2 38 1
ita+ger 77 184 4
ine+ger 9 89 1
lat+ara+per 4 54 2
eng 5 25 1
fre 489 13 2 2388 72
eng+ger 6 34 1
hun 8 35 1
lat+ger+grc 5 27 1
lat+heb+grc 30 299 2
fre+ita 1 44 1
ger+eng+fre 7 2 22 1
lat+spa 1 1 21 1
ger+eng 70 387 3
grc 20 1 1 7 110 3
ger+grc 6 33 1
fre+lat 6 43 1
ger+fre 49 1 280 8
fre+ger+lat 39 112 1
lat+heb 8 1 69 2
ger+lat+fre 2 28 1
lat+ger 388 11 2 2202 51
lat+fre 23 11 28 1
ita 26 1 1 1 230 6
grc+ger 3 1 5 30 1
lat+ger+fre 117 318 2
ger+lat 356 31 2 8 2399 42
ger+ita 9 62 2
ger+fre+lat 20 1 2 250 4
lav+ger 12 65 2
lat+gre 12 37 1
yid+heb+lat 7 1 24 1
fre+ger 60 1 3 271 6
grc+lat 29 1 275 5
lat+grc 111 1 1 684 12
ger+heb+grc 10 86 1

Extent

In this section they can insert additional information, instructions or notes.

Owner

  • Name: Universitäts- und Landesbibliothek Sachsen-Anhalt
  • Login: ulb-sachsen-anhalt
  • Kind: organization
  • Location: Germany, Halle (Saale)

Citation (CITATION.cff)

cff-version: 1.2.0
title: ulb-groundtruth-eval-odem-other
message: If you use this dataset, please cite it using the metadata from this file.
type: dataset
authors:
    - given-names: Uwe
      family-names: Hartwig
      orcid: 'https://orcid.org/0000-0001-7164-6376'
repository-code: 'https://github.com/ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-other'
url: 'https://github.com/ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-other'
abstract: OCR Grountruth ULB VD18 - OCR-D Phase III
keywords:
    - ocr-d
    - repository
    - segmentation
    - ground-truth
    - data_structure_and_text
license: CC-BY 4.0
commit: v1.2.1
version: 4_v1.2.1
date-released: '2024-05-23'

GitHub Events

Total
  • Push event: 1
Last Year
  • Push event: 1

Dependencies

.github/workflows/gtrepo.yml actions
  • JamesIves/github-pages-deploy-action v4.4.1 composite
  • actions/checkout v3 composite
  • mikefarah/yq master composite
  • ncipollo/release-action v1 composite
  • thedoctor0/zip-release master composite