Updated 9 months ago

gsoc2018-spacy • Rank 10.2 • Science 23%

[GSOC] Greek language support for spacy.io python NLP software

Updated 9 months ago

textstem • Rank 12.7 • Science 10%

Tools for fast text stemming & lemmatization

Updated 9 months ago

xsnts-analyzer • Science 44%

Application that scrapes Twitter/X data, performs Polish-language text normalization and lemmatization, builds topic models with MALLET, and runs sentiment analysis for downstream analytics. Created as a project for Master Thesis.

Updated 9 months ago

nlp-pipeline • Science 67%

A German NLP Pipeline for Lexicographic Use Cases

Updated 9 months ago

https://github.com/chartes/deucalion-model-af • Science 23%

Modèle Pie pour la lemmatisation de l’ancien français

Updated 9 months ago

https://github.com/acdh-oeaw/tokeneditor • Science 26%

TokenEditor is a web application for manual annotation (or manual review of automatic annotations) of text. Albeit primarily aimed at reviewing PoS tags and lemmas, it is fully customizable, to support any annotation levels.

Updated 9 months ago

pyrrha • Science 67%

A language-independent post-correction app for POS-tagging and lemmatization

Updated 9 months ago

pet-project-nlp • Science 26%

Natural language processing pet project. It includes data web scraping, lemmatizing, stemming, and working with related words (hyponyms, hypernyms, meronyms, holonyms). This specific code gathers all data from chosen pages of the Suspilne (Суспільне) webpage. Next, the data is manipulated and processed for future analysis