Scientific Software
Updated 10 months ago

TextDescriptives — Peer-reviewed • Rank 17.6 • Science 98%

TextDescriptives: A Python package for calculating a large variety of metrics from text - Published in JOSS (2023)

Scientific Software
Updated 10 months ago

Augmenty — Peer-reviewed • Rank 15.9 • Science 98%

Augmenty: A Python Library for Structured Text Augmentation - Published in JOSS (2024)

Scientific Software
Updated 10 months ago

TRUNAJOD — Peer-reviewed • Rank 12.2 • Science 95%

TRUNAJOD: A text complexity library to enhance natural language processing - Published in JOSS (2021)

Scientific Software
Updated 10 months ago

Mordecai — Peer-reviewed • Rank 12.4 • Science 93%

Mordecai: Full Text Geoparsing and Event Geocoding - Published in JOSS (2017)

Sociology (40%)
Scientific Software · Peer-reviewed
Updated 10 months ago

edsnlp • Rank 17.2 • Science 77%

Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.

Updated 10 months ago

pymusas • Rank 12.1 • Science 62%

Python Multilingual Ucrel Semantic Analysis System

Updated 10 months ago

span-marker • Rank 17.2 • Science 54%

SpanMarker for Named Entity Recognition

Updated 10 months ago

negativas • Rank 0.0 • Science 67%

negativas, uma ferramenta para auxiliar na busca e classificação de negações sentenciais no Português Brasileiro.

Updated 10 months ago

classy-classification • Rank 12.3 • Science 54%

This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.

Updated 10 months ago

asent • Rank 14.9 • Science 44%

Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.

Updated 10 months ago

dacy • Rank 13.3 • Science 44%

DaCy: The State of the Art Danish NLP pipeline using SpaCy

Updated 10 months ago

spacy-iwnlp • Rank 8.5 • Science 44%

German lemmatization with IWNLP as extension for spaCy

Updated 10 months ago

concise-concepts • Rank 7.3 • Science 44%

This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

Updated 10 months ago

crosslingual-coreference • Rank 6.4 • Science 44%

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

Updated 10 months ago

astred • Rank 3.2 • Science 41%

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

Updated 9 months ago

wasabi • Rank 29.5 • Science 13%

🍣 A lightweight console printing and formatting toolkit

Updated 10 months ago

spacy-wrap • Rank 12.4 • Science 26%

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

Updated 9 months ago

https://github.com/bramvanroy/spacy_conll • Rank 12.6 • Science 23%

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.

Updated 9 months ago

https://github.com/brucewlee/lftk • Rank 11.0 • Science 23%

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

Updated 10 months ago

gsoc2018-spacy • Rank 10.2 • Science 23%

[GSOC] Greek language support for spacy.io python NLP software

Updated 9 months ago

https://github.com/centrefordigitalhumanities/textminer • Rank 0.0 • Science 26%

A script to detect named entities and store them in an Elasticsearch annotated_text field

Updated 9 months ago

https://github.com/argilla-io/spacy-wordnet • Rank 7.5 • Science 13%

spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface

Updated 9 months ago

epitator • Rank 9.3 • Science 10%

EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and EIDR Connect.

Updated 8 months ago

https://github.com/dcavar/antisemitismdatathon2020 • Rank 1.8 • Science 13%

This is project material for the Antisemitism Datathon and Hackathon 2020 at Indiana University

Updated 9 months ago

https://github.com/aryashah2k/sasbitathon-winningsolution • Rank 2.4 • Science 10%

1st Place solution for the SAS | GIM Bitathon, an annual Data Science Hackathon organized by SAS and Goa Institute of Management. The dataset worked on is the subset of the consumer complaints database provided by www.consumerfinance.gov

Updated 9 months ago

nlp-pipeline • Science 67%

A German NLP Pipeline for Lexicographic Use Cases

Updated 10 months ago

charles-burney-digital • Science 57%

Digitale Aufbereitung, Anreicherung und Geovisualisierung eines Reiseberichts des Musikhistorikers Charles Burney, mithilfe von Transkribus, Spacy-NER und Nodegoat

Updated 10 months ago

odycy • Science 26%

A general-purpose NLP pipeline for Ancient Greek

Updated 9 months ago

bachelor_thesis_project • Science 44%

System for Training-based Expansion of Tools for Proper Name Mentions Recognition Based on Active Learning

Updated 9 months ago

https://github.com/atharvapathak/twitter_sentiment_analysis_project • Science 23%

Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.

Updated 10 months ago

spacy-models • Science 67%

German spaCy models trained on German UD-HDT and on a collection of gold and silver standard NER corpora

Updated 9 months ago

https://github.com/centre-for-humanities-computing/conspiracies • Science 13%

A python package for discovering and examining conspiracies using NLP.