Scientific Software
Updated 10 months ago
Turftopic
Turftopic: Topic Modelling with Contextual Representations from Sentence Transformers - Published in JOSS (2025)
Updated 10 months ago
scattertext
Beautiful visualizations of how language differs among document types.
Updated 10 months ago
hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
Updated 10 months ago
https://github.com/amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
Updated 10 months ago
https://github.com/bernard-ng/drc-news-ml
DRC News Corpus, Towards a scalable and intelligent system for Congolese News curation
Updated 10 months ago
authorship_clustering_code_repo
LAC: Latent Authorial Clustering of Shorter Texts
Updated 10 months ago
https://github.com/caimeng2/topicmodelingworkshop
SSDA workshop: topic modeling for exploratory text analysis
Updated 10 months ago
topictcga
Notebooks for "A topic model analysis of TCGA transcriptomic data of breast and lung cancer"
Updated 10 months ago
latent_dirichlet_allocation
Topic modeling (LDA) extracts latent topics distributed across documents and represent them with most likely words.