Scientific Software
Updated 6 months ago

textnets — Peer-reviewed • Rank 13.9 • Science 100%

textnets: A Python package for text analysis with networks - Published in JOSS (2020)

Scientific Software
Updated 6 months ago

jstor — Peer-reviewed • Rank 16.6 • Science 93%

jstor: Import and Analyse Data from Scientific Texts - Published in JOSS (2018)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

corporaexplorer — Peer-reviewed • Rank 14.8 • Science 93%

corporaexplorer: An R package for dynamic exploration of text collections - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

TRUNAJOD — Peer-reviewed • Rank 12.2 • Science 95%

TRUNAJOD: A text complexity library to enhance natural language processing - Published in JOSS (2021)

Updated 6 months ago

cntext • Rank 12.3 • Science 67%

text analysis, supporting multiple methods including word count, readability, document similarity, sentiment analysis, Word2Vec/GloVe, and Large Language Models (LLMs).文本分析包,支持字数统计、可读性、文档相似度、情感分析在内的多种文本分析方法。

Updated 6 months ago

LSX • Rank 11.5 • Science 59%

Semi-supervised algorithm for document scaling

Updated 6 months ago

rainette • Rank 11.2 • Science 59%

R implementation of the Reinert text clustering method

Updated 6 months ago

occupationcoder • Rank 9.9 • Science 44%

Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.

Updated 6 months ago

align • Rank 13.1 • Science 36%

Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpora.

Updated 6 months ago

newsmap • Rank 13.4 • Science 33%

Semi-supervised algorithm for geographical document classification

Updated 6 months ago

qdap • Rank 18.4 • Science 23%

Quantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis

Updated 6 months ago

textclean • Rank 15.8 • Science 23%

Tools for cleaning and normalizing text data

Updated 5 months ago

https://github.com/brucewlee/lftk • Rank 11.0 • Science 23%

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability assessment, essay scoring, fake news detection, hate speech detection, etc.

Updated 5 months ago

stylest • Rank 10.0 • Science 23%

R package for estimating speaker style distinctiveness in texts. Install it from CRAN!

Updated 6 months ago

lingmatch • Rank 8.8 • Science 23%

An all-in-one R package for the assessment of linguistic similarity

Updated 3 days ago

anvay: A Web-based Tool for Interpretive Topic Modelling in Bengali • Science 87%

anvay: A Web-based Tool for Interpretive Topic Modelling in Bengali - Published in JOSS (2026)

Updated 6 months ago

taguette • Science 54%

Free and open source qualitative research tool -- MIRROR OF GITLAB REPOSITORY

Updated 5 months ago

https://github.com/chainsawriot/textplex • Science 13%

Calculate textual complexity using the algorithm by Tolochko & Boomgaarden (2019).

Updated 6 months ago

wtt • Science 57%

The Word-Text-Topic (WTT) extraction approach, implemented in Python and R.

Updated 5 months ago

https://github.com/chainsawriot/rectr • Science 26%

💒 Reproducible Extraction of Cross-lingual Topics using R

Updated 6 months ago

universitatespodcastdata • Science 67%

An R package for downloading, extracting, and analyzing interview transcripts from the Universitates podcast series. It provides tools for data processing, searching, and visualization

Updated 6 months ago

iramuteqlike • Science 26%

💬⛏️ IRaMuTeQ Software Analyses in R

Updated 6 months ago

architxt • Science 44%

ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data. It enables automated database generation and seamless data integration.