nlp-on-multilingual-coin-datasets
https://github.com/frankfurt-bigdatalab/nlp-on-multilingual-coin-datasets
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.3%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Frankfurt-BigDataLab
- License: other
- Language: Jupyter Notebook
- Default Branch: main
- Size: 48.6 MB
Statistics
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 1
- Releases: 1
Metadata Files
README.md
NLP-on-multilingual-coin-descriptions
Experimental: You can try it as a human: Open under Corpus Numorum
Use it in your code via FastAPI:
There is also a FastAPI were you can POST requests with parameters in a JSON block: https://data.corpus-nummorum.eu/api/nlp#/default/ Providing: - /ner/ Named Entity Recognition - /re/ Predict Relation Extraction - /rdf/ The generation of a RDF representation for NER and RE
Access relevant information from iconographies of the Corpus Numorum Online using natural language processing. Improvement of named entity recognition, followed by the extraction of relationships for the entity types "PERSON", "OBJECT", "ANIMAL" and "PLANT".
Requirements:
- python 3.8.12
- pandas 1.3.3
- spacy 2.3.7
- scikit-learn 0.24.2
- SQL database
Credits
This project is based on a number of theses: - Patricia Klinger: Natural Language Processing to enable semantic search on numismatic descriptions.
Sebastian Gampe: Kombination maschineller Lernmethoden der Bild- und Texterkennung auf antiken Münzdaten
Kerim Gencer und Chrisowalandis Deligio: Natural Language Processing auf mehrsprachigen Münzdatensätzen – Untersuchung der Qualität, Datenqualität und Übertragbarkeit auf andere Datensätze
Katrin Peikert: Optimierung beim Annotieren mehrsprachiger Münzdatensätze im Kontext des Natural Language Processing
Reference
- https://www.corpus-nummorum.eu/
- http://numismatics.org/ocre/
- Patricia Klinger, Sebastian Gampe, Karsten Tolle, Ulrike Peter (2018). Semantic Search based on Natural Language Processing – a Numismatic example. Journal of Ancient History and Archaeology (JAHA) Vol 5 No. 3 (2018) 68-79 (DOI: 10.14795/j.v5i3.334, http://jaha.org.ro/index.php/JAHA/article/view/334/244)
Owner
- Name: Frankfurt-BigDataLab
- Login: Frankfurt-BigDataLab
- Kind: organization
- Repositories: 2
- Profile: https://github.com/Frankfurt-BigDataLab
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: NLP-on-multilingual-coin-descriptions
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- affiliation: Goethe University Frankfurt
given-names: Chrisowalandis
family-names: Deligio
orcid: 'https://orcid.org/0000-0002-5708-4271'
- given-names: Sebastian
family-names: Gampe
email: gampe@em.uni-frankfurt.de
affiliation: Goethe University Frankfurt
orcid: 'https://orcid.org/0009-0002-8700-3505'
- given-names: Karsten
family-names: Tolle
email: K.Tolle@em.uni-frankfurt.de
affiliation: Goethe University Frankfurt
orcid: 'https://orcid.org/0000-0002-9953-7638'
- given-names: Patricia
family-names: Klinger
- given-names: Kerim
family-names: Gencer
abstract: >
Pipeline to extract named entities and their relation
between each other for ancient coin descriptions.
Providing:
/ner/ Named Entity Recognition
/re/ Predict Relation Extraction
/rdf/ The generation of a RDF representation for NER and RE
license: CC-BY-NC-SA-3.0
GitHub Events
Total
- Watch event: 2
- Push event: 6
Last Year
- Watch event: 2
- Push event: 6