word_normalisation_data
https://github.com/jean-baptiste-camps/word_normalisation_data
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (2.1%) to scientific vocabulary
Last synced: 6 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: Jean-Baptiste-Camps
- Language: XSLT
- Default Branch: main
- Size: 23.5 MB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 1
- Releases: 1
Created almost 5 years ago
· Last pushed almost 4 years ago
Metadata Files
Readme
Citation
README.md
Word Normalisation Datasets
Datasets to be used for training word segmentation, in particular with Pie.
They come from various sources, and from the datasets published by Oriflamms (Stutzmann et al., https://github.com/oriflamms/).
Old French
Geste
- Geste: un corpus de chansons de geste, dir. Jean-Baptiste Camps, avec la collab. d'Elena Albarran, Alice Cochet & Lucence Ing, Paris, 2016-…, DOI: 10.5281/zenodo.1744918, https://github.com/Jean-Baptiste-Camps/Geste/.
Oriflamms
Oriflamms projects, dir. Dominique Stutzmann, available at:
- https://github.com/oriflamms/AlbumMssFrXIII.
- https://github.com/oriflamms/ECMEN.
- https://github.com/oriflamms/Pelerinage.
- https://github.com/oriflamms/Graal.
Wauchier
- Pinche, Ariane, Li Seint Confessor, données issues de la thèse de doctorat : Edition nativement numérique du recueil hagiographique "Li Seint Confessor" de Wauchier de Denain d'après le manuscrit fr. 412 de la Bibliothèque nationale de France, 2021-09-01_, https://github.com/ArianePinche/EditionLiSeintConfessor.
Latin
ORIFLAMMS
Oriflamms projects, dir. Dominique Stutzmann, available at:
- https://github.com/oriflamms/Fontenay/
- https://github.com/oriflamms/Dated-and-Datable-Manuscripts_AI2A
- https://github.com/oriflamms/PsautierIMS
Owner
- Name: Jean-Baptiste Camps
- Login: Jean-Baptiste-Camps
- Kind: user
- Location: Paris
- Company: École nationale des chartes (@chartes) | PSL
- Website: www.chartes.psl.eu/jean-baptiste-camps
- Twitter: jbcamps
- Repositories: 11
- Profile: https://github.com/Jean-Baptiste-Camps
Assoc. Prof. in Computational Philology @chartes | Head of MA @Humanites-Numeriques-PSL
Citation (CITATION.CFF)
cff-version: 1.2.0
message: "If you use this dataset, please cite it as below (in addition to the referenced corpora)"
type: dataset
authors:
- family-names: Camps
given-names: Jean-Baptiste
orcid: https://orcid.org/0000-0003-0385-7037
title: "word_normalisation_data: v0.1 DH Tokyo Abstract"
doi: 10.5281/zenodo.6500649
version: v0.1
date-released: 2022-04-28
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 12 months ago
All Time
- Total issues: 1
- Total pull requests: 5
- Average time to close issues: N/A
- Average time to close pull requests: about 15 hours
- Total issue authors: 1
- Total pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.4
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- Jean-Baptiste-Camps (1)
Pull Request Authors
- Jean-Baptiste-Camps (4)
- MargueriteVernet (1)