Projects

Updated 10 months ago

gensim • Rank 15.8 • Science 77%

Topic Modelling for Humans

data-mining data-science document-similarity fasttext gensim information-retrieval machine-learning natural-language-processing neural-network nlp python topic-modeling word-embeddings word-similarity word2vec

Mathematics (38%)

Updated 10 months ago

spec2vec • Rank 14.2 • Science 67%

Word2Vec based similarity measure of mass spectrometry data.

fuzzy-matching fuzzy-search mass-spectrometry word2vec

Updated 10 months ago

cntext • Rank 12.3 • Science 67%

text analysis, supporting multiple methods including word count, readability, document similarity, sentiment analysis, Word2Vec/GloVe, and Large Language Models (LLMs).文本分析包，支持字数统计、可读性、文档相似度、情感分析在内的多种文本分析方法。

chinese content-analysis discourse-analysis glove llm nlp semantic-analysis sentiment-analysis social-science text-analysis text-mining word2vec

Updated 10 months ago

text2vec • Rank 17.9 • Science 54%

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Updated 9 months ago

scattertext • Rank 19.4 • Science 49%

Beautiful visualizations of how language differs among document types.

computational-social-science d3 eda exploratory-data-analysis japanese-language machine-learning natural-language-processing nlp scatter-plot semiotic-squares sentiment stylometric stylometry text-as-data text-mining text-visualization topic-modeling visualization word-embeddings word2vec

Updated 10 months ago

word2vecelastic • Rank 0.7 • Science 54%

Collect sentences from ElasticSearch, preprocess and train diachronic Word2Vec models

elasticsearch embeddings word2vec

Updated 9 months ago

sense2vec • Rank 18.5 • Science 36%

🦆 Contextually-keyed word vectors

gensim gensim-word2vec machine-learning natural-language-processing nlp python sense2vec spacy word2vec

Updated 9 months ago

align • Rank 13.1 • Science 36%

Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpora.

conversation-analysis corpus-tools linguistic-alignment linguistic-analysis ngram-analysis nltk notebooks python text-analysis word2vec

Updated 10 months ago

itembed • Rank 4.7 • Science 44%

Python library to train shallow embeddings on unordered sequences

embedding-vectors python python-library word2vec

Updated 9 months ago

word2vec • Rank 15.1 • Science 23%

Distributed Representations of Words using word2vec

embeddings natural-language-processing r-package word2vec

Updated 9 months ago

word2vec • Rank 18.7 • Science 10%

Python interface to Google word2vec

doc2vec python word2vec

Updated 9 months ago

https://github.com/alibaba/alink • Rank 11.0 • Science 13%

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Updated 9 months ago