Scientific Software
Updated 6 months ago

Augmenty — Peer-reviewed • Rank 15.9 • Science 98%

Augmenty: A Python Library for Structured Text Augmentation - Published in JOSS (2024)

Updated 6 months ago

hanlp • Rank 24.3 • Science 54%

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Updated 6 months ago

torchdistill • Rank 15.0 • Science 59%

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

Updated 6 months ago

knowprompt • Rank 6.4 • Science 67%

[WWW 2022] KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction

Updated 6 months ago

rainette • Rank 11.2 • Science 59%

R implementation of the Reinert text clustering method

Updated 4 months ago

https://github.com/google-research/retvec • Rank 12.3 • Science 54%

RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.

Updated 6 months ago

classy-classification • Rank 12.3 • Science 54%

This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.

Updated 6 months ago

pytextclassifier • Rank 12.2 • Science 54%

pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。

Updated 6 months ago

cappr • Rank 9.8 • Science 44%

Completion After Prompt Probability. Make your LLM make a choice

Updated 6 months ago

spacy-wrap • Rank 12.4 • Science 26%

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

Updated 6 months ago

tlaf • Rank 7.5 • Science 23%

A comprehensive tool for linguistic analysis of communities

Updated 6 months ago

band • Rank 8.7 • Science 13%

BAND:BERT Application aNd Deployment, A simple and efficient BERT model training and deployment framework.

Updated 6 months ago

text-sim • Rank 7.9 • Science 10%

文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本

Updated 5 months ago

https://github.com/cair/textunderstandingtsetlinmachine • Science 36%

Using the Tsetlin Machine to learn human-interpretable rules for high-accuracy text categorization with medical applications

Updated 5 months ago

https://github.com/autodistill/autodistill-setfit • Science 13%

Train a SetFit model for use in text classification.

Updated 5 months ago

https://github.com/capjamesg/textannotate • Science 13%

An annotation and review tool for text classification model training.

Updated 6 months ago

hackathon-leaderboard • Science 44%

Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

Updated 5 months ago

https://github.com/ben-aaron188/snlp • Science 13%

2-day course on Statistical Natural Language Processing in R (foundational level)

Updated 6 months ago

porkcnn • Science 44%

A Small Project for Pork Barrel Legislation Classification Using Convolutional Neural Networks (Lour's Pork Barrel Classifier (羅老師肉桶法案分類器)🍖🐖 🥩🐷

Updated 5 months ago

https://github.com/chaoscodes/untl • Science 10%

EMNLP'2022: Unsupervised Non-transferable Text Classification

Updated 6 months ago

azimuth • Science 67%

Helping AI practitioners better understand their datasets and models in text classification. From ServiceNow.

Updated 6 months ago

banglasenti-dataset-prep • Science 44%

BanglaSenti Dataset Preparation: Bangla Sentiment Analysis CSV Dataset for NLP & Machine Learning