Scientific Software
Updated 6 months ago
Turftopic
Turftopic: Topic Modelling with Contextual Representations from Sentence Transformers - Published in JOSS (2025)
Updated 6 months ago
scattertext
Beautiful visualizations of how language differs among document types.
Updated 5 months ago
hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
Updated 6 months ago
latent_dirichlet_allocation
Topic modeling (LDA) extracts latent topics distributed across documents and represent them with most likely words.
Updated 5 months ago
https://github.com/bernard-ng/drc-news-ml
DRC News Corpus, Towards a scalable and intelligent system for Congolese News curation
Updated 6 months ago
topictcga
Notebooks for "A topic model analysis of TCGA transcriptomic data of breast and lung cancer"
Updated 5 months ago
https://github.com/caimeng2/topicmodelingworkshop
SSDA workshop: topic modeling for exploratory text analysis
Updated 6 months ago
authorship_clustering_code_repo
LAC: Latent Authorial Clustering of Shorter Texts
Updated 5 months ago
https://github.com/amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion