UralicNLP
UralicNLP: An NLP Library for Uralic Languages - Published in JOSS (2019)
edsnlp
Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.
datasets-minard-napoleons-march
Data for Charles Joseph Minard's cartographic depiction of Napoleon's Russian campaign of 1812.
tapuscorpus
Ground Truth for French 20th century typewritten documents collected on Gallica and Europeana
cremma-wikipedia
A collection of ground truth to train HTR models on contemporary French handwritings
dahncorpus
Ground Truth dataset for French 20th typewritten OCR produced by the DAHN project
timeuscorpus
Ground Truth datasets for French 18th and 19th HTR produced by the ANR project TIME US
roman18
Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)
lectaurep-bronod
Lectaurep-Bronod, ground truth for Maitre Bronod's documents (French 18th century)