Updated 6 months ago

lm-eval • Rank 28.4 • Science 59%

A framework for few-shot evaluation of language models.

Updated 6 months ago

rwkv-lm • Rank 11.2 • Science 67%

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Updated 6 months ago

gpt-neox • Rank 13.8 • Science 64%

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Updated 6 months ago

prompt-engineering-guide • Rank 16.4 • Science 54%

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Updated 6 months ago

tokenizers • Rank 14.0 • Science 54%

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Updated 5 months ago

https://github.com/blinkdl/chatrwkv • Rank 21.3 • Science 46%

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Updated 6 months ago

opennmt-py • Rank 24.6 • Science 36%

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Updated 6 months ago

feste • Rank 6.7 • Science 44%

Feste is a free and open-source framework allowing scalable composition of NLP tasks using a graph execution model that is optimized and executed by specialized schedulers.

Updated 6 months ago

llama-classification • Rank 4.7 • Science 44%

Text classification with Foundation Language Model LLaMA

Updated 5 months ago

https://github.com/chiang-yuan/llamp • Rank 6.3 • Science 33%

A web app and Python API for multi-modal RAG framework to ground LLMs on high-fidelity materials informatics. An agentic materials scientist powered by @materialsproject, @langchain-ai, and @openai

Updated 6 months ago

spacy-wrap • Rank 12.4 • Science 26%

spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to include existing fine-tuned models within your SpaCy workflow.

Updated 5 months ago

https://github.com/brucewlee/h-test • Rank 0.7 • Science 33%

[ACL 2024] Language Models Don't Learn the Physical Manifestation of Language

Updated 5 months ago

https://github.com/cvi-szu/linly • Rank 9.4 • Science 23%

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Updated 5 months ago

https://github.com/beomi/easy-lm-trainer • Rank 4.8 • Science 26%

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

Updated 5 months ago

https://github.com/beomi/gemma-easylm • Rank 6.3 • Science 10%

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

Updated 6 months ago

cecilia • Science 57%

The Cuban Language Model

Updated 6 months ago

cramming • Science 54%

Cramming the training of a (BERT-type) language model into limited compute.

Updated 6 months ago

nusabert • Science 54%

NusaBERT: Teaching IndoBERT to be multilingual and multicultural!

Updated 6 months ago

staged-training • Science 54%

Staged Training for Transformer Language Models

Updated 5 months ago

https://github.com/awslabs/gap-text2sql • Science 10%

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Updated 5 months ago

https://github.com/awslabs/mlm-scoring • Science 10%

Python library & examples for Masked Language Model Scoring (ACL 2020)

Updated 5 months ago

https://github.com/alon-albalak/data-selection-survey • Science 49%

A Survey on Data Selection for Language Models

Updated 6 months ago

roberta-legal-portuguese • Science 41%

Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.

Updated 4 months ago

https://github.com/disi-unibo-nlp/easumm • Science 13%

[DATA22 and Springer LNCS] Graph-Enhanced Biomedical Abstractive Summarization via Factual Evidence Extraction

Updated 6 months ago

llms-from-scratch • Science 26%

Build your own Large Language Model from scratch with this code repository. Learn the ins and outs of LLMs like GPT. 🚀💻

Updated 5 months ago

https://github.com/amazon-science/bold • Science 13%

Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper

Updated 6 months ago

snap-umls-clusters • Science 36%

Master Thesis Project in Arab American University Palestine with Palestinian Neuro Initiative Educational Research Center - Clustering medical sentences based on Unified Medical Language System (UMLS) terms and expanded UMLS terms present in them

Updated 6 months ago

kokomind • Science 54%

KokoMind: Can LLMs Understand Social Interactions?

Updated 6 months ago

mdlm • Science 62%

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

Updated 6 months ago

movie-chatbot • Science 44%

A conversational agent that answers movie related questions based on a knowledge graph.

Updated 6 months ago

gerpt2 • Science 57%

German small and large versions of GPT2.

Updated 5 months ago

https://github.com/ac-rad/xdl-generation • Science 13%

CLAIRify: Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting

Updated 5 months ago

https://github.com/rentruewang/bocoel • Science 26%

Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.

Updated 6 months ago

chronos • Science 36%

Debugging-first language model achieving 65.3% autonomous bug fixing (6-7x better than GPT-4). Research, benchmarks & evaluation framework. Model available Q1 2026 via Kodezi OS.

Updated 6 months ago

operagents • Science 44%

Dynamic, highly customizable language agents framework

Updated 5 months ago

https://github.com/alexeyev/letnyayashkola-dl4nlp-2018 • Science 13%

Deep Learning workshop @ letnyayashkola.org, supporting materials for lectures by @alexeyev [in Russian]

Updated 6 months ago

ai_story_scale • Science 67%

The AI story scale (AISS): A human rating scale for texts written with generative language models.

Updated 6 months ago

deeprank-gnn-esm • Science 26%

Graph Network for protein-protein interface including language model features

Updated 6 months ago

pix2seq • Science 67%

Simple Implementation of Pix2Seq model for object detection in PyTorch