Projects | Open Source Science

Updated 11 months ago

lazyllm-llamafactory • Rank 25.6 • Science 77%

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers

Updated 4 months ago

chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R • Rank 8.4 • Science 87%

chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R - Published in JOSS (2026)

ai bioinformatics chatgpt gpt image image-generation r

Updated 10 months ago

@openhands/ui • Rank 27.3 • Science 64%

🙌 OpenHands: Code Less, Make More

agent artificial-intelligence chatgpt claude-ai cli developer-tools gpt llm openai

Updated 11 months ago

spezillm • Rank 8.0 • Science 77%

Large Language Model (LLM) module for the Spezi Ecosystem

chatbot gpt ios large-language-models llm openai spezi swift swiftui

Updated 11 months ago

curategpt • Rank 6.9 • Science 77%

LLM-driven curation assist tool

ai biocuration curation gpt llm monarchinitiative obofoundry ontogpt ontologies ontology-tools

Updated 11 months ago

rwkv-lm • Rank 11.2 • Science 67%

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm pytorch rnn rwkv transformer transformers

Updated 11 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Mathematics (40%)

Updated 10 months ago

https://github.com/bowang-lab/scgpt • Rank 17.0 • Science 59%

foundation-model gpt single-cell

Updated 11 months ago

llms-from-scratch • Rank 14.9 • Science 54%

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatgpt deep-learning from-scratch gpt language-model large-language-models llm machine-learning python pytorch transformer

Updated 11 months ago

sesgos_llm • Rank 1.1 • Science 67%

¿Cómo “se equivocan” los modelos LLM?

bard bard-api codigo cohere datos gpt llm sesgo

Updated 11 months ago

tokenizers • Rank 14.0 • Science 54%

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

bert gpt language-model natural-language-processing natural-language-understanding nlp transformers

Updated 11 months ago

medicalgpt • Rank 10.7 • Science 54%

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

chatgpt dpo gpt llama llm medical medicalgpt

Updated 11 months ago

surprisal • Rank 10.1 • Science 54%

A unified interface for computing surprisal (log probabilities) from language models! Supports neural, symbolic, and black-box API models.

gpt language-modeling large-language-models log-likelihood next-word-prediction surprisal

Updated 11 months ago

@llm-tools/embedjs • Rank 17.7 • Science 44%

A NodeJS RAG framework to easily work with LLMs and embeddings

ai chatgpt claude cohere embedding embeddings gpt gpt-4 gpt-4o huggingface large-language-models llm mistral ollama openai pinecone rag vector-database vertex-ai

Updated 10 months ago

https://github.com/nixtla/nixtla • Rank 22.6 • Science 36%

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.

anomaly-detection artificial-intelligence deep-learning forecasting generative-ai-time-series gpt gpts llm machine-learning time-series time-series-forecasting timegpt

Updated 11 months ago

xfinder • Rank 6.6 • Science 51%

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

benchmark cc-by-nc-nd-4 chatglm dataset evaluation gpt judge-model key-answer-extraction large-language-models llm llm-as-a-judge llm-as-evaluator lm-evaluation open-compass phi qwen regex reliability reliable-evaluation xfinder

Updated 11 months ago

iamai • Rank 13.6 • Science 44%

A rule-driven comprehensive AI toolkit emphasizing simultaneous support for multimodal machine learning and the ability to construct cross-platform robots using logic.（规则驱动式的综合性人工智能工具库，强调同时支持多模态机器学习和利用逻辑构建跨平台机器人的能力）

ai artificial-intelligence bot flow flowchart gpt logic logic-programming machine-learning ml python rasa rete rules whl

Updated 11 months ago

llama_ros • Rank 7.2 • Science 44%

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

audio cpp embeddings ggml gguf gpt langchain llama llamacpp llava llavacpp llm multimodal rerank reranking ros2 vlm

Updated 11 months ago

chatgpt-api-scanner • Rank 7.0 • Science 44%

Scan GitHub for available OpenAI API Keys

chatgpt chatgpt-api freegpt gpt gpt-3 gpt4 selenium

Updated 11 months ago

prompt-engineering-guide-zh-cn • Rank 6.6 • Science 44%

🐙 关于提示词工程（prompt）的指南、论文、讲座、笔记本和资源大全（自动持续更新）

chatgpt gpt gpt4 openai prompt prompt-engineering

Updated 11 months ago

sgpt • Rank 7.9 • Science 41%

SGPT: GPT Sentence Embeddings for Semantic Search

gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding

Updated 11 months ago

llama-classification • Rank 4.7 • Science 44%

Text classification with Foundation Language Model LLaMA

classification foundation-model gpt language-model llama pytorch

Updated 11 months ago

lrv-instruction • Rank 5.7 • Science 41%

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

chatgpt evaluation evaluation-metrics foundation-models gpt gpt-4 hallucination iclr iclr2024 llama llava multimodal object-detection prompt-engineering vicuna vision vision-and-language vqa

Updated 11 months ago

gpt • Rank 1.8 • Science 44%

Generative Pre-trained Transformer in PyTorch from scratch

attention deep-learning from-scratch gpt machine-learning pytorch transformer

Updated 11 months ago

gemgpt • Rank 1.1 • Science 44%

Explore the power of Gemma model with GemGPT, a project leveraging AI for innovative solutions. Join us in shaping the future of AI!

fine-tuning gem-gpt gemgpt gemma gemma-2b gemma-2b-it gemma-7b gemma-7b-it gpt gpt-gem gptgem large-language-model large-language-models llm llms openai python pytorch

Updated 11 months ago

minigpt • Rank 0.7 • Science 44%

An open-source project to show how to build a mini language model using PyTorch

gpt neural-networks pytorch small-language-models

Updated 11 months ago

video-compression-and-future-prediction-using-gpt • Rank 0.0 • Science 44%

This repository presents a project focused on advanced video compression and future prediction using Generative Pre-trained Transformer (GPT) and other state-of-the-art techniques.

computer-vision deep-learning gpt nlp python pytorch vq-vae

Updated 10 months ago

https://github.com/bigscience-workshop/petals • Rank 18.7 • Science 23%

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

bloom chatbot deep-learning distributed-systems falcon gpt guanaco language-models large-language-models llama machine-learning mixtral neural-networks nlp pipeline-parallelism pretrained-models pytorch tensor-parallelism transformer volunteer-computing

Updated 11 months ago

autogpt • Rank 14.8 • Science 26%

🦀 A Pure Rust Framework For Building AGI (WIP).

adk agent agi ai artificial-intelligence autogpt evcxr gemini gemini-flash gemini-flash-2 getimg gpt jupyter-notebook nylas-api openai rust sdk stable-diffusion

Updated 11 months ago

keras-gpt-copilot • Rank 6.7 • Science 31%

Integrate an LLM copilot within your Keras model development workflow

deep-learning deep-neural-networks gpt keras llms openai skynet

Updated 11 months ago

quickai • Rank 10.5 • Science 26%

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

ai artificial-intelligence bert deep-learning dl easy-to-use fast gpt gpt-neo huggingface-transformers ml neural-network nlp object-detection python pytorch quickai research tensorflow2 yolo

Updated 10 months ago

https://github.com/gabotechs/musicgpt • Rank 19.1 • Science 13%

Generate music based on natural language prompts using LLMs running locally

ai gpt llm machine-learning music

Updated 10 months ago

https://github.com/cedrickchee/chatgpt-universe • Rank 6.6 • Science 23%

ChatGPT Universe is fleeting notes on ChatGPT, GPT, and large language models (LLMs)

chatgpt generative-model gpt language-models resource-list

Updated 10 months ago

https://github.com/bytedance/lightseq • Rank 18.7 • Science 10%

LightSeq: A High Performance Library for Sequence Processing and Generation

accelerate bart beam-search bert cuda diverse-decoding gpt inference multilingual-nmt sampling training transformer

Updated 10 months ago

https://github.com/dc-research/tempo • Science 36%

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

forecasting forecasting-models forecasting-time-series foundation-models gpt pretrained-language-model pretrained-models time-series time-series-analysis transformer transformers transformers-models

Updated 11 months ago

llm-reliability • Science 49%

Code for the paper "Larger and more instructable language models become less reliable"

bloom evaluation gpt llama llm reliability rlhf scaling supervision

Updated 10 months ago

https://github.com/claromes/resumo-periodico • Science 13%

Telegram bot para resumos de artigos científicos

generative-ai gpt grobid telegram-bot

Updated 11 months ago

gptlint • Science 44%

A linter with superpowers! 🔥 Use LLMs to enforce best practices across your codebase.

best-practices gpt linter llms static-analysis

Updated 10 months ago

https://github.com/asanchezyali/ai-avatar • Science 13%

Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built with Next.js and Tailwind CSS. This avatar responds to user input by generating both text and speech, offering a dynamic and immersive user experience

ai ai-avatars azure-cognitive-services digital-human gpt langchain lip-sync llm openai talking-head talking-heads tts-api visemes

Updated 10 months ago

https://github.com/johnsnowlabs/johnsnowlabs • Science 26%

Gateway into the John Snow Labs Ecosystem

bert databricks gpt machine-learning natural-language-processing nlp python seq2seq spark t5

Updated 11 months ago

gpt4dfci • Science 57%

A private and secure generative AI tool, based on GPT-4 and deployed for non-clinical use at Dana-Farber Cancer Institute

chatgpt gpt gpt-4 llmops operations research

Updated 11 months ago

black • Science 26%

ad-blocker agents ai android-unpack autopep8 awesome blackbox blocker dashboard dhcp dhcp-server gofmt gpt loopback python raspberry-pi unpacker virtualapp

Updated 10 months ago

https://github.com/adithya-s-k/world-of-ai • Science 13%

WORLD OF AI : An open-source repository for AI-based projects 🚀, from beginner to expert level, helping contributors start their journey in Artificial Intelligence and Deep Learning. Our projects provide hands-on experience to real-world problems👨‍💻. Join our community and contribute to the development of AI-based solutions 👥.

ai deep-learning generative-ai gpt machine-learning neural-network open-source

Updated 11 months ago

operagents • Science 44%

Dynamic, highly customizable language agents framework

agent crewai gpt langgraph language-model sop

Updated 11 months ago

llm_from_scratch • Science 44%

I am using here just to share what I have been struggling to understand

fine-tuning-llm gpt gpt-from-scratch instruction-tuning nlp pytorch tutorial

Updated 11 months ago

ai-essayist • Science 44%

SKKU AI X Bookathon 4회 [쿠봇] 팀의 레포지토리입니다.

essay-generation gpt natural-language-generation pytorch-lightning transformers

Updated 11 months ago

awesome-attention-heads • Science 67%

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

attention-head-mining attention-mechanism awesome chain-of-thought circuit-analysis cognitive-neuroscience gpt interpretability large-language-models llm llm-reasoning machine-psychology research-paper survey transformer visualization-tools

Updated 11 months ago

firebase-gpt-chat-completion-stream • Science 44%

Example of ChatGPT Stream implementation in Firebase with functions. For this method you have to use `fetch`.

chatgpt chatgpt-api firebase firebase-function firebase-functions gpt openai openai-api

Updated 10 months ago

https://github.com/copyleftdev/shellbot • Science 26%

ShellBot is a vibrant CLI tool that leverages OpenAI's cutting-edge AI to provide expert guidance on computer science, cloud computing, and Linux systems. Designed for tech enthusiasts and IT professionals alike, ShellBot answers your queries, troubleshoots issues, and shares best practices—all with a splash of color for enhanced readability

ai bash golang gpt zsh

Updated 11 months ago

llm-confidentiality • Science 54%

Whispers in the Machine: Confidentiality in Agentic Systems

chatgpt confidentiality deep-learning framework gpt llm llm-security machine-learning openai prompt-engineering prompt-injection prompt-toolkit security systems-security transformers

Updated 11 months ago

llm-change-agent • Science 67%

LLM Change Agent is a command-line interface (CLI) tool designed to interact with various language models for generating and modifying ontology definitions. It offers a range of functionalities to streamline KGCL command generation tasks.

ai artificial-intelligence change-language gpt kgcl llm ontologies ontology

Updated 11 months ago

memit • Science 44%

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

editing gpt pytorch transformer

Updated 11 months ago

system-outliers • Science 44%

Deviance classification

anomaly-detection framework gpt outliers

Updated 11 months ago

automated-brain-explanations • Science 36%

Generating and validating natural-language explanations for the brain.

ai-for-science artificial-intelligence automated-interpretability data-science explanation fmri fmri-data-analysis gpt gpt4 huggingface interpretability interpretable-embeddings language-model large-language-models machine-learning mechanistic-interpretability natural-language-processing neuroscience xai

Updated 11 months ago

rome • Science 44%

Locating and editing factual associations in GPT (NeurIPS 2022)

gpt interpretability pytorch transformers

Updated 10 months ago

https://github.com/algo7/day-planner-gpt-data-portal • Science 13%

Data integration solution designed to seamlessly connect data sources such as emails, news feeds, and calendar information to plan your day using OpenAI's GPTs

chatgpt go golang gpt

Updated 11 months ago

ai_story_scale • Science 67%

The AI story scale (AISS): A human rating scale for texts written with generative language models.

artificial-intelligence gpt language-model novelai psychometrics

Updated 10 months ago

https://github.com/amazon-science/transformers-data-augmentation • Science 26%

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

bart bert bert-model data-augmentation gpt