Scientific Software
Updated 6 months ago

ollamar — Peer-reviewed • Rank 12.7 • Science 93%

ollamar: An R package for running large language models - Published in JOSS (2025)

Scientific Software · Peer-reviewed
Updated 6 months ago

litgpt • Rank 23.6 • Science 64%

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Updated 6 months ago

kani • Rank 8.3 • Science 77%

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

Updated 6 months ago

oumi • Rank 20.3 • Science 64%

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Updated 6 months ago

flexrag • Rank 12.8 • Science 67%

FlexRAG: A RAG Framework for Information Retrieval and Generation.

Updated 6 months ago

gpt-researcher • Rank 15.2 • Science 64%

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Updated 6 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Updated 6 months ago

guardrails • Rank 1.9 • Science 67%

VSCode extension to help developers set up guardrails around their functions, by helping them disambiguate purpose statements.

Updated 6 months ago

distilabel • Rank 11.6 • Science 54%

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Updated 6 months ago

https://github.com/lancedb/lance • Rank 28.1 • Science 36%

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Updated 6 months ago

linear-relational • Rank 7.8 • Science 54%

Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch

Updated 6 months ago

opennmt-py • Rank 24.6 • Science 36%

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Updated 6 months ago

ml-keyframer • Rank 3.8 • Science 54%

Keyframer: Empowering Animation Design Using Large Language Models

Updated 6 months ago

https://github.com/adaptivemotorcontrollab/amadeusgpt • Rank 11.4 • Science 46%

We turn natural language descriptions of behaviors into machine-executable code

Updated 6 months ago

https://github.com/assert-kth/repairllama • Rank 5.5 • Science 46%

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair http://arxiv.org/pdf/2312.15698

Updated 6 months ago

tamingllms • Rank 7.2 • Science 44%

Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software

Updated 6 months ago

symmetry-cli • Rank 4.4 • Science 44%

The client for the Symmetry peer-to-peer inference network. Enabling users to connect with each other, share computational resources, and collect valuable machine learning data.

Updated 6 months ago

boilerbot • Rank 1.9 • Science 44%

Official Open-Source Implementation of BoilerBot: A Reliable Task-Oriented Chatbot Enhanced with Large Language Models.

Updated 6 months ago

gemgpt • Rank 1.1 • Science 44%

Explore the power of Gemma model with GemGPT, a project leveraging AI for innovative solutions. Join us in shaping the future of AI!

Updated 6 months ago

token-wars-dataviz • Rank 0.0 • Science 44%

A data visualisation in `matplotlib` of the number of parameters in major LLMs as well as the number of tokens of text they were trained on.

Updated 6 months ago

https://github.com/amazon-science/refchecker • Rank 8.2 • Science 33%

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Updated 6 months ago

https://github.com/bminixhofer/tokenkit • Rank 3.5 • Science 36%

A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.

Updated 4 months ago

https://github.com/google-deepmind/simulation_streams • Rank 3.0 • Science 36%

Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows.

Updated 6 months ago

keras-gpt-copilot • Rank 6.7 • Science 31%

Integrate an LLM copilot within your Keras model development workflow

Updated 6 months ago

statlingua • Rank 8.0 • Science 26%

Explain Statistical Output with Large Language Models

Updated 6 months ago

https://github.com/spencerpresley/academicmetrics • Rank 7.7 • Science 26%

AI-powered toolkit for analyzing and classifying academic research publications using LLMs and automated data collection. Output options: Mongodb database via providing your databse url. Json. Excel spreadsheet. See README for the quick setup, see documentation for implementation details.

Updated 4 months ago

https://github.com/deepset-ai/haystack-demos • Rank 7.4 • Science 26%

Fully working applications that demonstrate how to use Haystack to implement various use cases

Updated 6 months ago

tarsier • Rank 20.1 • Science 13%

Vision utilities for web interaction agents 👀

Updated 6 months ago

https://github.com/citiususc/blinkg • Rank 0.7 • Science 26%

BLINKG: Benchmark for LLM-Integrated Knowledge Graph Generation

Updated 6 months ago

https://github.com/astorfi/llm-alignment-project • Rank 3.5 • Science 23%

A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment solution with ease.

Updated 6 months ago

https://github.com/astorfi/large-scale-ai-blueprint • Rank 4.0 • Science 13%

A comprehensive guide designed to empower readers with advanced strategies and practical insights for developing, optimizing, and deploying scalable AI models in real-world applications.

Updated 6 months ago

https://github.com/pixeltable/pixelagent • Science 26%

Pixelagent — Build your own stateful agent framework

Updated 6 months ago

https://github.com/amazon-science/factual-confidence-of-llms • Science 13%

Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"

Updated 6 months ago

semexp-thesis • Science 77%

The Semantic Workspace: Augmenting Exploratory Programming with Integrated Generative AI Tools [Master's Thesis]

Updated 6 months ago

fuzi.mingcha • Science 57%

夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答服务。

Updated 6 months ago

search-agents • Science 54%

Code for the paper 🌳 Tree Search for Language Model Agents

Updated 6 months ago

hackathon-leaderboard • Science 44%

Automated Leaderboard System for Hackathon Evaluation Using Large Language Models

Updated 6 months ago

https://github.com/aiplanethub/ai-stacks • Science 26%

Explore a variety of use cases and example stacks powered by LLMs, seamlessly integrated into the GenAI Stack platform.

Updated 6 months ago

https://github.com/aielte-research/hacksynth • Science 23%

LLM Agent and Evaluation Framework for Autonomous Penetration Testing

Updated 6 months ago

llmstack • Science 26%

No-code multi-agent framework to build LLM Agents, workflows and applications with your data

Updated 6 months ago

intro-llms-python • Science 44%

Introduction to Large Language Models in Python

Updated 6 months ago

https://github.com/akaiko1/langchain_examples • Science 26%

Practical, minimal examples for building with LangChain

Updated 6 months ago

https://github.com/chonghin33/semantic-logic-system-1.0 • Science 39%

Semantic Logic System v1.0 — A modular prompt-based semantic framework for LLMs.

Updated 6 months ago

lcm-1.13-whitepaper • Science 57%

This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It introduces a novel framework for prompt-layered semantic control in large language models (LLMs), built upon the Meta Prompt Layering (MPL) structure. LCM formalizes a modular system of prompt orchestration, enabling

Updated 6 months ago

awesome-rlaif • Science 54%

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

Updated 6 months ago

pie-perf • Science 36%

Training language models to make programs faster

Updated 6 months ago

interactive-learning-duo • Science 44%

Unlock Your Coding Potential: Explore, Learn, and Code with Our Interactive Python and SQL ChatBot!

Updated 6 months ago

https://github.com/safellmhub/hguard-go • Science 26%

Guardrails for LLMs: detect and block hallucinated tool calls to improve safety and reliability.

Updated 6 months ago

awesome-llms • Science 44%

🤓 A collection of AWESOME structured summaries of Large Language Models (LLMs)

Updated 6 months ago

https://github.com/chirayu-tripathi/mongodb-querifier • Science 13%

Improving LLMs MongoDB query generation capability with the help of advanced retrieval augmented generation.

Updated 6 months ago

llmswitcher • Science 44%

Routes to the most performant and cost efficient LLM based on your prompt [ 🚧 WIP ]

Updated 6 months ago

https://github.com/cosmaadrian/strawberry-problem • Science 36%

Official repository for "The Strawberry Problem 🍓: Emergence of Character-level Understanding in Tokenized Language Models"

Updated 6 months ago

llms4om • Science 54%

LLMs4OM: Matching Ontologies with Large Language Models

Updated 6 months ago

scisum • Science 31%

Resources for the paper Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals? (ACL 2024).

Updated 6 months ago

pkgmatch • Science 26%

Find R packages matching either descriptions or other R packages

Updated 6 months ago

xrayglm • Science 54%

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Updated 6 months ago

mdi-llm • Science 54%

Implementation of Model-Distributed Inference for Large Language Models, built on top of LitGPT

Updated 6 months ago

ai-for-grant-writing • Science 67%

A curated list of resources for using LLMs to develop more competitive grant applications.

Updated 6 months ago

backdoorllm • Science 36%

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models

Updated 6 months ago

gptlint • Science 44%

A linter with superpowers! 🔥 Use LLMs to enforce best practices across your codebase.

Updated 6 months ago

curlora • Science 67%

The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.

Updated 6 months ago

https://github.com/asanchezyali/talking-avatar-with-ai • Science 13%

This project is a digital human that can talk and listen to you. It uses OpenAI's GPT to generate responses, OpenAI's Whisper to transcript the audio, Eleven Labs to generate voice and Rhubarb Lip Sync to generate the lip sync.

Updated 6 months ago

fed-rag • Science 75%

A framework for fine-tuning retrieval-augmented generation (RAG) systems.

Updated 4 months ago

https://github.com/dimits-ts/synthetic_moderation_experiments • Science 26%

Experiments relating to synthetic LLM user-agents and LLM facilitators in online discussions

Updated 6 months ago

ColBERT • Science 41%

Efficient late-interaction retrieval systems in Julia!

Updated 6 months ago

awesome-llm-papers.github.io • Science 26%

A curated collection of the most impactful papers, tools, and resources on Large Language Models (LLMs). Continuously updated to help researchers, developers, and enthusiasts stay on top of LLM advancements.

Updated 6 months ago

self-refine • Science 54%

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.