Scientific Software
Updated 9 months ago

Jury — Peer-reviewed • Rank 14.8 • Science 93%

Jury: A Comprehensive Evaluation Toolkit - Published in JOSS (2024)

Updated 9 months ago

skorch • Rank 24.5 • Science 54%

A scikit-learn compatible neural network library that wraps PyTorch

Updated 9 months ago

mlm-bias • Rank 4.9 • Science 67%

Measuring Biases in Masked Language Models for PyTorch Transformers. Support for multiple social biases and evaluation measures.

Updated 9 months ago

span-marker • Rank 17.2 • Science 54%

SpanMarker for Named Entity Recognition

Updated 9 months ago

knn-transformers • Rank 5.6 • Science 62%

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Updated 9 months ago

distilabel • Rank 11.6 • Science 54%

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Updated 9 months ago

detoxify • Rank 21.0 • Science 44%

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

Updated 9 months ago

frame-semantic-transformer • Rank 10.9 • Science 54%

Frame Semantic Parser based on T5 and FrameNet

Updated 9 months ago

cappr • Rank 9.8 • Science 44%

Completion After Prompt Probability. Make your LLM make a choice

Updated 9 months ago

tamingllms • Rank 7.2 • Science 44%

Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software

Updated 9 months ago

pizzly • Rank 5.6 • Science 44%

Pizzly, financial market analysis combining technical indicators with LLMs, featuring real-time data processing and AI-powered market insights ⚡️

Updated 9 months ago

logfire-callback • Rank 2.6 • Science 44%

A callback for logging training events from Hugging Face's Transformers to Logfire 🤗

Updated 9 months ago

napolab • Rank 7.0 • Science 36%

The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language tasks.

Updated 9 months ago

spacy-wrap • Rank 12.4 • Science 26%

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

Updated 9 months ago

https://github.com/cakecrusher/mimicbot • Rank 6.5 • Science 26%

Mimicbot enables the effortless yet modular creation of an AI chat bot model that imitates another person's manner of speech.

Updated 9 months ago

https://github.com/beomi/easy-lm-trainer • Rank 4.8 • Science 26%

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

Updated 9 months ago

https://github.com/beomi/gemma-easylm • Rank 6.3 • Science 10%

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

Updated 9 months ago

mergekit • Science 44%

Tools for merging pretrained Large Language Models and create Mixture of Experts (MoE) from open-source models.

Updated 9 months ago

minicorpus • Science 54%

Investigating, reproducing, and improving MiniPile with PyTorch and HuggingFace

Updated 9 months ago

partial-embedding-matrix-adaptation • Science 41%

Vocabulary-level memory efficiency for language model fine-tuning.

Updated 9 months ago

mazegpt • Science 44%

AI model for making mazes that extends OpenAIs GPT2 model

Updated 9 months ago

layoutreader • Science 44%

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Updated 9 months ago

hfcommunity • Science 52%

HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub

Updated 9 months ago

simplyretrieve • Science 54%

Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. For Retrieval-Centric & Retrieval-Augmented Generation.

Updated 9 months ago

legalkit-pipeline • Science 44%

Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.

Updated 8 months ago

https://github.com/huggingface/zapier • Science 26%

Hugging Face's Zapier Integration 🤗⚡️

Updated 9 months ago

text-to-ml • Science 44%

Programmable automated machine learning - proof of concept

Updated 9 months ago

lcc • Science 67%

Fine-tuning classifier NNs with Latent Cluster Correction

Updated 9 months ago

pix2seq • Science 67%

Simple Implementation of Pix2Seq model for object detection in PyTorch

Updated 9 months ago

moe-infinity • Science 54%

PyTorch library for cost-effective, fast and easy serving of MoE models.

Updated 9 months ago

2uml • Science 26%

A user-friendly tool that generates UMLs starting from natural language inputs.