Projects | Open Source Science

Scientific Software

Updated 10 months ago

LangFair — Peer-reviewed • Rank 14.9 • Science 95%

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases - Published in JOSS (2025)

ai ai-safety artificial-intelligence bias bias-detection ethical-ai fairness fairness-ai fairness-ml fairness-testing large-language-models llm llm-evaluation llm-evaluation-framework llm-evaluation-metrics python responsible-ai

Engineering Mathematics (42%)

Scientific Software · Peer-reviewed

Updated 10 months ago

lazyllm-llamafactory • Rank 25.6 • Science 77%

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers

Updated 10 months ago

ontogpt • Rank 16.7 • Science 77%

LLM-based ontological extraction tools, including SPIRES

ai chat-gpt data-modeling gpt-3 information-extraction language-models large-language-models linkml llm monarchinitiative named-entity-recognition ner nlp oaklib obofoundry relation-extraction

Updated 10 months ago

kitsune • Rank 4.2 • Science 85%

Kitsune is a next-generation data steward and harmonization tool.

data-harmonization data-stewardship embeddings large-language-models semantic-mapping

Updated 10 months ago

litgpt • Rank 23.6 • Science 64%

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

ai artificial-intelligence deep-learning large-language-models llm llm-inference llms

Updated 10 months ago

txtai • Rank 22.2 • Science 64%

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

ai artificial-intelligence embeddings information-retrieval language-model large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search

Updated 10 months ago

kani • Rank 8.3 • Science 77%

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

chatgpt framework function-calling gpt-4 large-language-models llama llms microframework openai tool-use

Updated 10 months ago

spezillm • Rank 8.0 • Science 77%

Large Language Model (LLM) module for the Spezi Ecosystem

chatbot gpt ios large-language-models llm openai spezi swift swiftui

Updated 10 months ago

datastew • Rank 8.9 • Science 75%

Python library for intelligent data stewardship using Large Language Model (LLM) embeddings

data-harmonization data-stewardship large-language-models

Updated 10 months ago

dolma • Rank 19.6 • Science 64%

Data and tools for generating and inspecting OLMo pre-training data.

data-processing large-language-models llm machile-learning nlp

Updated 10 months ago

farm-haystack • Rank 28.7 • Science 54%

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

agent agents ai gemini generative-ai gpt-4 information-retrieval large-language-models llm machine-learning nlp orchestration python pytorch question-answering rag retrieval-augmented-generation semantic-search summarization transformers

Updated 10 months ago

kvpress • Rank 14.6 • Science 67%

LLM KV cache compression made easy

inference kv-cache kv-cache-compression large-language-models llm long-context python pytorch transformers

Updated 10 months ago

ray • Rank 35.1 • Science 46%

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search large-language-models llm llm-inference llm-serving machine-learning optimization parallel python pytorch ray reinforcement-learning rllib serving tensorflow

Updated 10 months ago

inseq • Rank 13.6 • Science 67%

Interpretability for sequence generation models 🐛 🔍

attribution-methods captum deep-learning explainable-ai generative-ai huggingface interpretability language-generation language-model large-language-models natural-language-processing sequence-to-sequence transformers

Updated 10 months ago

bigcodebench • Rank 15.3 • Science 64%

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

agent agents benchmark chatgpt claude-3 code-generation deepseek function-calling gemini gpt-4 instruction-following large-language-models llm program-synthesis tool-use

Updated 10 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Mathematics (40%)

Updated 10 months ago

learn_prompting • Rank 13.3 • Science 64%

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

chatgpt chatgpt-api deep-learning gpt-3 gpt-4 gpt-4-api gpt3 large-language-models llm machine-learning nlp openai-api prompt-engineering prompt-toolkit prompt-tuning prompting transformers

Updated 10 months ago

tree-of-thought-prompting • Rank 8.0 • Science 67%

Using Tree-of-Thought Prompting to boost ChatGPT's reasoning

large-language-models llm prompt-engineering

Updated 10 months ago

chronos-forecasting • Rank 20.5 • Science 54%

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

artificial-intelligence forecasting foundation-models huggingface huggingface-transformers large-language-models llm machine-learning pretrained-models time-series time-series-forecasting timeseries transformers

Updated 10 months ago

symbolicai • Rank 17.3 • Science 54%

A neurosymbolic perspective on LLMs

large-language-models neurosymbolic-ai probabilistic-programming

Artificial Intelligence and Machine Learning (38%)

Updated 10 months ago

mllmcelltype • Rank 12.3 • Science 59%

🏆 #1 Multi-LLM consensus framework | 550+ stars | 95% accuracy | 10+ LLM providers | Leading cell annotation tool

artificial-intelligence bioinformatics cell-type-annotation claude computational-biology consensus-algorithm deepseek gemini grok large-language-models llm multi-llm-consensus openai openrouter qwen scanpy scrna scrnaseq-analysis seurat single-cell

Updated 10 months ago

llms-from-scratch • Rank 14.9 • Science 54%

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatgpt deep-learning from-scratch gpt language-model large-language-models llm machine-learning python pytorch transformer

Updated 10 months ago

eval-suite • Rank 9.2 • Science 59%

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark ceval chatgpt dataset evaluation gpt-3 gpt-4 hallucination hallucination-detection hallucination-evaluation hallucinations huggingface huggingface-transformers large-language-models llm openai openai-api qwen

Updated 10 months ago

optimate • Rank 12.8 • Science 54%

A collection of libraries to optimise AI model performances

ai analytics artificial-intelligence deeplearning large-language-models llm

Updated 10 months ago

agentlego • Rank 12.6 • Science 54%

Enhance LLM agents with rich tool APIs

large-language-models llm llm-agents

Updated 10 months ago

beyondllm • Rank 12.5 • Science 54%

Build, evaluate and observe LLM apps

ai artificial-intelligence embeddings evaluate-llm genai generative-ai hacktoberfest hacktoberfest-accepted hacktoberfest2024 large-language-models llm llms rag

Updated 10 months ago

https://github.com/modelscope/data-juicer • Rank 19.2 • Science 46%

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

data data-analysis data-pipeline data-processing data-science data-visualization foundation-models instruction-tuning large-language-models llm llms multi-modal pre-training synthetic-data

Updated 10 months ago

chinese-llama-alpaca-2 • Rank 11.0 • Science 54%

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn

Updated 10 months ago

multilspy • Rank 18.6 • Science 46%

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

ai ai4code artificial-intelligence code-analysis code-completion code-generation codegen huggingface-transformers language-server-client language-server-protocol large-language-model large-language-models llm lsp lsp-client neurips neurips-2023 program-synthesis transformer

Updated 10 months ago

llmebench • Rank 10.5 • Science 54%

Benchmarking Large Language Models

benchmarking large-language-models llm multilingual

Updated 10 months ago

libre-chat • Rank 10.3 • Science 54%

🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline capable and easy to setup.

chatbot chatgpt langchain large-language-models llm llm-inference openapi self-hosted

Updated 10 months ago

surprisal • Rank 10.1 • Science 54%

A unified interface for computing surprisal (log probabilities) from language models! Supports neural, symbolic, and black-box API models.

gpt language-modeling large-language-models log-likelihood next-word-prediction surprisal

Updated 10 months ago

magicoder • Rank 9.0 • Science 54%

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

ai4code large-language-models llm llm4code

Updated 10 months ago

cambrian • Rank 9.0 • Science 54%

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

chatbot clip computer-vision dino instruction-tuning large-language-models llms mllm multimodal-large-language-models representation-learning

Updated 10 months ago

llm-sandbox • Rank 17.7 • Science 44%

Lightweight and portable LLM sandbox runtime (code interpreter) Python library.

code-generation code-interpreter large-language-models llm-sandbox

Updated 10 months ago

@llm-tools/embedjs • Rank 17.7 • Science 44%

A NodeJS RAG framework to easily work with LLMs and embeddings

ai chatgpt claude cohere embedding embeddings gpt gpt-4 gpt-4o huggingface large-language-models llm mistral ollama openai pinecone rag vector-database vertex-ai

Updated 10 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Updated 10 months ago

chinese-mixtral • Rank 7.1 • Science 54%

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

32k 64k large-language-models llm mixtral mixture-of-experts moe nlp

Updated 10 months ago

monitors4codegen • Rank 7.0 • Science 54%

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

ai ai4code artificial-intelligence code-analysis code-completion code-generation codegen dataset huggingface-transformers language-server-client language-server-protocol large-language-models llm lsp lsp-client neurips neurips-2023 program-synthesis transformer

Updated 10 months ago

https://github.com/alan-turing-institute/robots-in-disguise • Rank 6.7 • Science 54%

Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.

deep-learning diffusion-models foundation-model hut23 language-models large-language-models machine-learning nlp transformers

Updated 10 months ago

q-galore • Rank 5.3 • Science 54%

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

large-language-models low-rank memory-efficient-learning quantization

Updated 10 months ago

py-alpaca-eval • Rank 12.6 • Science 46%

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

deep-learning evaluation foundation-models instruction-following large-language-models leaderboard nlp rlhf

Updated 10 months ago

chinese-llama-alpaca • Rank 12.2 • Science 46%

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

alpaca alpaca-2 large-language-models llama llama-2 llm lora nlp plm pre-trained-language-models quantization

Updated 10 months ago

openqa-eval • Rank 3.8 • Science 54%

ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models

evaluation large-language-models open-domain-qa openai-api question-answering

Updated 10 months ago

xfinder • Rank 6.6 • Science 51%

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

benchmark cc-by-nc-nd-4 chatglm dataset evaluation gpt judge-model key-answer-extraction large-language-models llm llm-as-a-judge llm-as-evaluator lm-evaluation open-compass phi qwen regex reliability reliable-evaluation xfinder

Updated 10 months ago

nutcracker • Rank 1.9 • Science 54%

Large Model Evaluation Experiments

large-language-models llm llm-evaluation llmops

Updated 10 months ago

atlas • Rank 3.0 • Science 52%

Atlas: Adaptive Competency-Based Learning

artificial-intelligence education gen-ai generative-ai large-language-models learning machine-learning

Updated 10 months ago

odin-slides • Rank 7.8 • Science 44%

This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Transformer (GPT) of your choice. Leveraging the capabilities of Large Language Models (LLM), odin-slides enables you to turn the lengthiest Word documents into well organized presentations.

ai-assistant chatgpt-api generative-ai hacktoberfest large-language-models machine-learning natural-language-processing openai-api portfolio powerpoint powerpoint-automation pptx presentation-tools productivity-tool productivity-tools prompt-engineering rag slide-generator slides writing-tool

Updated 10 months ago

stormtrooper • Rank 7.6 • Science 44%

Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.

chatgpt few-shot-learning gpt-4 large-language-models llm scikit-learn transformer transformers zero-shot-learning

Updated 10 months ago

sgpt • Rank 7.9 • Science 41%

SGPT: GPT Sentence Embeddings for Semantic Search

gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding

Updated 10 months ago

climsight • Rank 9.3 • Science 39%

A next-generation climate information system that uses large language models (LLMs) alongside high-resolution climate model data, scientific literature, and diverse databases to deliver accurate, localized, and context-aware climate assessments.

ai-for-climate climate-assessments climate-data climate-services large-language-models llm

Updated 10 months ago

odin-tabs • Rank 3.0 • Science 44%

The Odin Tabs extension is a browser extension that allows you to navigate through your browser tabs using speech recognition and the Large Language Model (LLM) of your choice.

accessibility artificial-intelligence assistive-technology chatgpt-api chrome-extension interaction-design large-language-models machine-learning natural-language-processing openai-api portfolio productivity-tools rag speech-to-text tab-management tab-navigation ui user-interface web-accessibility web-automation

Updated 10 months ago

gemgpt • Rank 1.1 • Science 44%

Explore the power of Gemma model with GemGPT, a project leveraging AI for innovative solutions. Join us in shaping the future of AI!

fine-tuning gem-gpt gemgpt gemma gemma-2b gemma-2b-it gemma-7b gemma-7b-it gpt gpt-gem gptgem large-language-model large-language-models llm llms openai python pytorch

Updated 10 months ago

https://github.com/bytedance/salmonn • Rank 9.0 • Science 36%

SALMONN family: A suite of advanced multi-modal LLMs

audio audio-processing audio-visual-understanding bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university video video-understanding

Updated 10 months ago

upgini • Rank 17.5 • Science 26%

Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

automated-feature-engineering automl automl-pipeline chatgpt data-enrichment data-science feature-engineering feature-extraction feature-selection features kaggle kaggle-solution large-language-models llm machine-learning open-data open-datasets public-data python-library scikit-learn

Updated 10 months ago

napolab • Rank 7.0 • Science 36%

The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language tasks.

benchmarks catalan datasets english galician hate-speech huggingface huggingface-transformers large-language-models nlp portuguese python question-answering semantic-similarity spanish text-simplification textual-entailment transformers

Updated 10 months ago

https://github.com/bowang-lab/bioreason • Rank 5.8 • Science 36%

BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

bioinformatics computational-biology dna foundation-models grpo large-language-models reasoning

Updated 10 months ago

https://github.com/bigscience-workshop/petals • Rank 18.7 • Science 23%

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing

Updated 10 months ago

statlingua • Rank 8.0 • Science 26%

Explain Statistical Output with Large Language Models

data-science explainability large-language-models llm llms statistics teaching-tools

Updated 10 months ago

https://github.com/amazon-science/auto-cot • Rank 9.1 • Science 23%

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

chain-of-thought gpt-3 gpt3-prompts gpt3-resources large-language-models prompt-engineering reasoning

Updated 10 months ago

cellama • Rank 5.0 • Science 26%

Cell type annotation with local Large Language Models (LLMs) - Ensuring privacy and speed with extensive customized reports

celltype large-language-models llm rna-seq scanpy seurat single-cell

Updated 10 months ago

genaibook • Rank 4.6 • Science 26%

"Generative AI in Action" book's code repository

ai azure azure-openai book gemini-api genai langchain large-language-models llama3 llm openai responsible semantic-k slm small-language-models

Updated 10 months ago

https://github.com/aisuko/notebooks • Rank 4.6 • Science 26%

Implementation for the different ML tasks on Kaggle platform with GPUs.

accelerator computer-vision fine-tuning kaggle large-language-models multimodal natural-language-processing neural-network peft pytorch quantization renforcement-learning tensorboard transformers visulization wandb

Updated 10 months ago

https://github.com/bigscience-workshop/xmtf • Rank 7.4 • Science 23%

Crosslingual Generalization through Multitask Finetuning

bloom bloomz instruction-tuning language-models large-language-models mt0 multilingual-nlp multitask-learning t5 zero-shot-learning

Updated 10 months ago

https://github.com/amazon-science/mezo_svrg • Rank 3.7 • Science 26%

Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"

deep-learning fine-tuning language-model large-language-models llm-training llms machine-learning machine-learning-algorithms optimization optimization-algorithms svrg variance-reduction zero-order-methods

Updated 10 months ago

https://github.com/altunenes/calcarine • Rank 1.4 • Science 26%

Desktop VLM: Real-time FastVLM analysis of video & textures with live compute shaders

compute-shader fastvlm large-language-models onnxruntime shaders vision-language-model vlm wgpu

Updated 10 months ago

https://github.com/astorfi/llm-alignment-project • Rank 3.5 • Science 23%

A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment solution with ease.

ai alignment deep-learning generative-ai large-language-models llms machine-learning rlhf template

Updated 10 months ago

https://github.com/bigscience-workshop/data-preparation • Rank 8.1 • Science 13%

Code used for sourcing and cleaning the BigScience ROOTS corpus

dataset large-language-models multilingual

Updated 10 months ago

https://github.com/astorfi/large-scale-ai-blueprint • Rank 4.0 • Science 13%

A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.

deep-learning large-language-models large-scale large-scale-ai large-scale-machine-learning llms machinel-learning production-ml

Updated 10 months ago

https://github.com/agnostiqhq/tutorials_covalent_pycon_2024 • Rank 1.4 • Science 13%

agents ai ai-foundry autonomous-agents chatgpt covalent gpu hpc huggingface large-language-models llama llamacpp llm ml vllm

Updated 10 months ago

llms-from-scratch • Science 26%

Build your own Large Language Model from scratch with this code repository. Learn the ins and outs of LLMs like GPT. 🚀💻

bert book chatgpt deberta flan-t5 from-scratch language-model large-language-models llm llms-book machine-learning mcp neural-networks nlp prompt-engineering python pytorch roberta

Updated 10 months ago

https://github.com/chen-yang-liu/promptcc • Science 36%

PyTorch implementation of 'A Decoupling Paradigm With Prompt Learning for Remote Sensing Image Change Captioning'

large-language-models prompt-learning remote-sensing-image-change-captioning

Updated 10 months ago

curator • Science 44%

Scalable data pre processing and curation toolkit for LLMs

data data-curation data-prep data-preparation data-processing data-processing-pipelines data-quality datacuration datarecipes deduplication fast-data-processing fine-tuning large-language-models large-scale-data-processing llm llm-data-quality llmapps python semantic-deduplication

Updated 10 months ago

llm_seminar_series • Science 67%

Material for the series of seminars on Large Language Models

course large-language-models llm machine-learning roadmap seminar

Updated 10 months ago

langtest • Science 54%

Deliver safe & effective language models

ai-safety ai-testing artificial-intelligence benchmark-framework benchmarks ethics-in-ai large-language-models llm llm-as-evaluator llm-evaluation-toolkit llm-test llm-testing ml-safety ml-testing mlops model-assessment nlp responsible-ai trustworthy-ai

Updated 10 months ago

FMAT • Science 49%

😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.

ai artificial-intelligence bert bert-model bert-models contextualized-representation fill-in-the-blank fill-mask huggingface language-model language-models large-language-models masked-language-models natural-language-processing natural-language-understanding nlp pretrained-models transformer transformers

Updated 10 months ago

xrayglm • Science 54%

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

large-language-models llms medical multimodal visualglm-6b xray

Updated 10 months ago

loogle • Science 41%

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

acl2024 large-language-models llm long-context

Updated 10 months ago

https://github.com/yu-yang-li/starwhisper • Science 36%

LLM for Astronomy[星语4.0]

astronomy astrophysics large-language-models llm

Updated 10 months ago

phenomics-assistant • Science 44%

LLM retrieval augmented generation agent for the Monarch Knowledge graph.

knowledge-graph large-language-models monarchinitiative

Updated 10 months ago

mergekit • Science 44%

Tools for merging pretrained Large Language Models and create Mixture of Experts (MoE) from open-source models.

dare-ties huggingface large-language-models leaderboard llm merge-llm mergekit mixture-of-experts moe slerp ties transformer

Updated 10 months ago

autonomousssiishare • Science 44%

Implementation and evaluation of an autonomous SSI-enhanced iSHARE framework, enabling decentralized identity management, schema alignment for property matching, and automated generation of alternative verification requests, improving scalability, privacy, and flexibility in data spaces and SSI ecosystems.

context-embeddings data-spaces decentralized-identifiers graph-embeddings ishare large-language-models schema-alignment veramo verifiable-credentials verifiable-presentations

Updated 10 months ago

longmem • Science 54%

Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".

large-language-models long-context-modeling long-term-memory

Updated 10 months ago

efficient-llms-survey • Science 67%

[TMLR 2024] Efficient Large Language Models: A Survey

efficient-deep-learning generative-ai large-language-models machine-learning-systems survey

Updated 10 months ago

https://github.com/bytedance/shot2story • Science 10%

A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

benchmark dataset large-language-models research video-captioning video-language video-language-pretraining video-question-answering video-story video-story-generation video-summarization vision-language

Updated 10 months ago

dpl • Science 44%

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

benchmark computer-vision deep-learning hpo hyperparameter-optimization hyperparameter-tuning large-language-models llm machine-learning mlp natural-language-processing neurips neurips-2023 power-laws scaling-laws tabular-data transformer

Updated 10 months ago

awesome-japanese-llm • Science 75%

日本語LLMまとめ - Overview of Japanese LLMs

foundation-models generative-ai generative-model generative-models japanese japanese-language japanese-language-model japanese-llm language-model language-models large-language-model large-language-models llm llm-japanese llms multimodal vision-and-language vision-language vision-language-model

Updated 10 months ago

aphra • Science 54%

An open-source translation agent designed to enhance the quality of text translations by leveraging large language models

agentic-workflow large-language-models prompt-engineering translation

Updated 10 months ago

moe-infinity • Science 54%

PyTorch library for cost-effective, fast and easy serving of MoE models.

huggingface inference-engine large-language-models llm-inference mixture-of-experts pytorch

Updated 10 months ago

flashrag • Science 67%

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

benchmark datasets large-language-models retrieval-augmented-generation

Updated 10 months ago

fuzi.mingcha • Science 57%

夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发，以 ChatGLM 为大模型底座，基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能，旨在为用户提供全方位、高精准的法律咨询与解答服务。

chatglm-6b judicial large-language-models legal legal-ai legalai llms nlp pretrained-models

Updated 10 months ago

https://github.com/bigcode-project/selfcodealign • Science 13%

[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation

large-language-models llm llm4code

Updated 10 months ago

https://github.com/agnostiqhq/multi-agent-llm • Science 23%

Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)

agent ai cot genai iot large-language-models llm multi-agent multiagent-systems

Updated 10 months ago

https://github.com/ai4co/reevo • Science 36%

[NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution

ant-colony-optimization automatic-algorithm-generation bin-packing-problem electronic-design-automation evolutionary-algorithms genetic-algorithm hyper-heuristics large-language-models llm-agent multiple-knapsack-problem neural-combinatorial-optimization orienteering-problem reinforcement-learning traveling-salesman-problem vehicle-routing-problem

Updated 10 months ago

kg-mm-survey • Science 67%

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

awsome awsome-list cross-modal-retrieval entity-alignment entity-linking image-classification image-generation information-extraction knowledge-graph knowledge-graph-embeddings large-language-models multi-modal-fusion multi-modal-knowledge-graph multi-modal-learning paper-list survey surveys visual-question-answering

Updated 10 months ago

author-profiling-pan2023 • Science 44%

Symbol Team model for PAN@AP 2023 shared task on Profiling Cryptocurrency Influencers with Few-shot Learning

author-profiling contrastive-learning encoder-decoder-model few-shot-learning flan-t5 large-language-models

Updated 10 months ago

folktexts • Science 52%

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!

fairness large-language-models machine-learning python tabular-data transformers uncertainty

Updated 10 months ago

vellmes-ai-deception-framework • Science 44%

Interactive, dynamic, and realistic LLM honeypots

deception honeypots large-language-models llm security

Updated 10 months ago

automated-brain-explanations • Science 36%

Generating and validating natural-language explanations for the brain.

ai-for-science artificial-intelligence automated-interpretability data-science explanation fmri fmri-data-analysis gpt gpt4 huggingface interpretability interpretable-embeddings language-model large-language-models machine-learning mechanistic-interpretability natural-language-processing neuroscience xai

Updated 10 months ago

https://github.com/amazon-science/dstc12-controllable-conversational-theme-detection • Science 36%

Data & code for DSTC12, Controllable Conversational Theme Detection track

challenge conversational-ai deep-learning dialog dialog-system dialog-system-technology-challenge dstc intent large-language-models llm machine-learning natural-language-processing neural-network nlp theme