Projects | Open Source Science

Scientific Software

Updated 10 months ago

Turftopic — Peer-reviewed • Rank 13.8 • Science 98%

Turftopic: Topic Modelling with Contextual Representations from Sentence Transformers - Published in JOSS (2025)

contextual llm topic-modeling transformers

Mathematics

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

UralicNLP — Peer-reviewed • Rank 12.8 • Science 98%

UralicNLP: An NLP Library for Uralic Languages - Published in JOSS (2019)

clustering conll-u constraint-grammar dutch finnish french fst german large-language-model lemmatizer llm moksha morphological-analysis morphological-generation nlp-library russian sami spanish swedish uralic-languages

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

LangFair — Peer-reviewed • Rank 14.9 • Science 95%

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases - Published in JOSS (2025)

ai ai-safety artificial-intelligence bias bias-detection ethical-ai fairness fairness-ai fairness-ml fairness-testing large-language-models llm llm-evaluation llm-evaluation-framework llm-evaluation-metrics python responsible-ai

Engineering Mathematics (42%)

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

EcoLogits — Peer-reviewed • Rank 8.1 • Science 98%

EcoLogits: Evaluating the Environmental Impacts of Generative AI - Published in JOSS (2025)

genai generative-ai green-ai green-software llm llm-inference python sustainability sustainable-ai

Scientific Software · Peer-reviewed

Scientific Software

Updated 10 months ago

ollamar — Peer-reviewed • Rank 12.7 • Science 93%

ollamar: An R package for running large language models - Published in JOSS (2025)

ai api llm llms ollama ollama-api r

Scientific Software · Peer-reviewed

Updated 10 months ago

trafilatura • Rank 26.3 • Science 77%

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

article-extractor corpus-builder corpus-tools crawler html-to-markdown html2text llm news-aggregator news-crawler nlp rag readability rss-feed scraping tei text-cleaning text-extraction text-mining text-preprocessing web-scraping

Updated 10 months ago

transformers • Rank 38.7 • Science 64%

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

audio deep-learning deepseek gemma glm hacktoberfest llm machine-learning model-hub natural-language-processing nlp pretrained-models python pytorch pytorch-transformers qwen speech-recognition transformer vlm

Updated 10 months ago

lazyllm-llamafactory • Rank 25.6 • Science 77%

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt instruction-tuning large-language-models llama llama3 llm lora moe nlp peft qlora quantization qwen rlhf transformers

Scientific Software

Updated 10 months ago

MM-PoE — Peer-reviewed • Rank 5.8 • Science 93%

MM-PoE: Multiple Choice Reasoning via. Process of Elimination using Multi-Modal Models - Published in JOSS (2025)

inference llm multi-modal question-answering

Artificial Intelligence and Machine Learning

Scientific Software · Peer-reviewed

Updated 10 months ago

datasets • Rank 34.4 • Science 64%

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

ai artificial-intelligence computer-vision dataset-hub datasets deep-learning llm machine-learning natural-language-processing nlp numpy pandas pytorch speech tensorflow

Updated 10 months ago

ontogpt • Rank 16.7 • Science 77%

LLM-based ontological extraction tools, including SPIRES

ai chat-gpt data-modeling gpt-3 information-extraction language-models large-language-models linkml llm monarchinitiative named-entity-recognition ner nlp oaklib obofoundry relation-extraction

Updated 10 months ago

maidr-legacy • Rank 7.4 • Science 85%

[DEPRECATED prototype] Multimodal Access and Interactive Data Representation

ai blind braille chart data description image impairments llm low-vision multimodality plot representation science sonification tactile visual visualization

Updated 10 months ago

biochatter • Rank 15.0 • Science 77%

Backend library for conversational AI in biomedicine

biocypher chatbot knowledge-graph llm retrieval-augmented-generation vector-database

Updated 10 months ago

gpullama3.java • Rank 6.6 • Science 85%

GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

accelerators compilers deepseek-r1 gguf gpu java java21 llama3 llm mistral mistral-7b nvidia phi-3 phi-3-mini qwen2-5 qwen3 tornadovm

Updated 10 months ago

@openhands/ui • Rank 27.3 • Science 64%

🙌 OpenHands: Code Less, Make More

agent artificial-intelligence chatgpt claude-ai cli developer-tools gpt llm openai

Updated 10 months ago

embed • Rank 20.7 • Science 67%

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

bert-embeddings llm text-embeddings

Updated 10 months ago

litgpt • Rank 23.6 • Science 64%

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

ai artificial-intelligence deep-learning large-language-models llm llm-inference llms

Updated 10 months ago

lida • Rank 19.4 • Science 67%

Automatic Generation of Visualizations and Infographics using Large Language Models

cohere datavisualization hacktoberfest llm openai openai-api palm2 visualization

Updated 10 months ago

txtai • Rank 22.2 • Science 64%

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

ai artificial-intelligence embeddings information-retrieval language-model large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search search-engine semantic-search sentence-embeddings transformers txtai vector-database vector-search

Updated 10 months ago

spezillm • Rank 8.0 • Science 77%

Large Language Model (LLM) module for the Spezi Ecosystem

chatbot gpt ios large-language-models llm openai spezi swift swiftui

Updated 10 months ago

kubeflow-katib • Rank 20.7 • Science 64%

Automated Machine Learning on Kubernetes

ai automl huggingface hyperparameter-tuning jax kubeflow kubernetes llm machine-learning mlops neural-architecture-search pytorch scikit-learn tensorflow

Updated 10 months ago

curategpt • Rank 6.9 • Science 77%

LLM-driven curation assist tool

ai biocuration curation gpt llm monarchinitiative obofoundry ontogpt ontologies ontology-tools

Updated 10 months ago

dolma • Rank 19.6 • Science 64%

Data and tools for generating and inspecting OLMo pre-training data.

data-processing large-language-models llm machile-learning nlp

Updated 10 months ago

llama_index • Rank 33.9 • Science 49%

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex llm multi-agents rag vector-database

Engineering (40%)

Updated 10 months ago

farm-haystack • Rank 28.7 • Science 54%

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

agent agents ai gemini generative-ai gpt-4 information-retrieval large-language-models llm machine-learning nlp orchestration python pytorch question-answering rag retrieval-augmented-generation semantic-search summarization transformers

Updated 10 months ago

letta_api • Rank 18.4 • Science 64%

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

ai ai-agents llm llm-agent

Updated 10 months ago

kvpress • Rank 14.6 • Science 67%

LLM KV cache compression made easy

inference kv-cache kv-cache-compression large-language-models llm long-context python pytorch transformers

Updated 10 months ago

ray • Rank 35.1 • Science 46%

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search large-language-models llm llm-inference llm-serving machine-learning optimization parallel python pytorch ray reinforcement-learning rllib serving tensorflow

Updated 10 months ago

evalica • Rank 13.9 • Science 67%

Evalica, your favourite evaluation toolkit

arena bradley-terry elo evalica evals evaluation hacktoberfest leaderboard library llm pagerank pairwise-comparison pyo3 python ranking rating rust serbia statistics winrate

Updated 10 months ago

bentoml • Rank 26.1 • Science 54%

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

ai-inference deep-learning generative-ai inference-platform llm llm-inference llm-serving llmops machine-learning ml-engineering mlops model-inference-service model-serving multimodal python

Updated 10 months ago

cntext • Rank 12.3 • Science 67%

text analysis, supporting multiple methods including word count, readability, document similarity, sentiment analysis, Word2Vec/GloVe, and Large Language Models (LLMs).文本分析包，支持字数统计、可读性、文档相似度、情感分析在内的多种文本分析方法。

chinese content-analysis discourse-analysis glove llm nlp semantic-analysis sentiment-analysis social-science text-analysis text-mining word2vec

Updated 10 months ago

bigcodebench • Rank 15.3 • Science 64%

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

agent agents benchmark chatgpt claude-3 code-generation deepseek function-calling gemini gpt-4 instruction-following large-language-models llm program-synthesis tool-use

Updated 10 months ago

tiny_qa_benchmark_pp • Rank 2.1 • Science 77%

Tiny QA Benchmark++ a micro-benchmark suite (52-item gold + on-demand multilingual synthetic packs), generator CLI, and CI-ready eval harness for ultra-fast LLM smoke-testing & regression-catching.

benchmark dataset evaluation huggingface-datasets litellm llm llm-testing llmops qa-dataset smoke-test synthetic-data tinybenchmarks

Updated 10 months ago

pandas-ai • Rank 24.5 • Science 54%

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

ai csv data data-analysis data-science data-visualization database datalake gpt-4 llm pandas sql text-to-sql

Updated 10 months ago

medusa-llm • Rank 13.6 • Science 64%

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

llm llm-inference

Updated 10 months ago

tensorzero • Rank 23.4 • Science 54%

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai generative-ai gpt large-language-models llama llm llmops llms machine-learning ml ml-engineering mlops openai python rust

Mathematics (40%)

Updated 10 months ago

learn_prompting • Rank 13.3 • Science 64%

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

chatgpt chatgpt-api deep-learning gpt-3 gpt-4 gpt-4-api gpt3 large-language-models llm machine-learning nlp openai-api prompt-engineering prompt-toolkit prompt-tuning prompting transformers

Updated 10 months ago

asreview • Rank 19.2 • Science 57%

Active learning for systematic reviews

active-learning asreview deep-learning language-model learning-algorithms literature llm natural-language-processing neural-network research systematic-literature-reviews systematic-reviews utrecht-university

Updated 10 months ago

mlora-cli • Rank 11.3 • Science 64%

An Efficient "Factory" to Build Multiple LoRA Adapters

baichuan chatglm dpo finetune gpu llama llama2 llm lora mlora peft rlhf

Updated 10 months ago

rdocdump • Rank 8.1 • Science 67%

rdocdump: Dump ‘R’ Package Source, Documentation, and Vignettes into One File

llm r-package rag text

Updated 10 months ago

langchain • Rank 39.0 • Science 36%

🦜🔗 Build context-aware reasoning applications 🦜🔗

ai anthropic gemini langchain llm openai python

Updated 10 months ago

tree-of-thought-prompting • Rank 8.0 • Science 67%

Using Tree-of-Thought Prompting to boost ChatGPT's reasoning

large-language-models llm prompt-engineering

Updated 10 months ago

hallucination-leaderboard • Rank 10.8 • Science 64%

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

generative-ai hallucinations llm

Updated 10 months ago

vulntrain • Rank 7.6 • Science 67%

A tool to generate datasets and models based on vulnerabilities descriptions from @Vulnerability-Lookup.

dataset llm nlp text-generation vulnerability vulnerability-lookup

Updated 10 months ago

chronos-forecasting • Rank 20.5 • Science 54%

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

artificial-intelligence forecasting foundation-models huggingface huggingface-transformers large-language-models llm machine-learning pretrained-models time-series time-series-forecasting timeseries transformers

Updated 10 months ago

mosec • Rank 20.4 • Science 54%

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

cv deep-learning gpu hacktoberfest jax llm llm-serving machine-learning machine-learning-platform mlops model-serving mxnet nerual-network python pytorch rust tensorflow tts

Updated 10 months ago

mllmcelltype • Rank 12.3 • Science 59%

🏆 #1 Multi-LLM consensus framework | 550+ stars | 95% accuracy | 10+ LLM providers | Leading cell annotation tool

artificial-intelligence bioinformatics cell-type-annotation claude computational-biology consensus-algorithm deepseek gemini grok large-language-models llm multi-llm-consensus openai openrouter qwen scanpy scrna scrnaseq-analysis seurat single-cell

Updated 10 months ago

chemlift • Rank 4.0 • Science 67%

Language-interfaced fine-tuning for chemistry

chemistry few-shot-learning fine-tuning hacktoberfest llm materials

Updated 10 months ago

alignment-handbook • Rank 16.9 • Science 54%

Robust recipes to align language models with human and AI preferences

llm rlhf transformers

Updated 10 months ago

json_repair • Rank 26.6 • Science 44%

A python module to repair invalid JSON from LLMs

deep-learning gpt-4 json llama3 llm machine-learning mistral openai-api parser repair

Updated 10 months ago

llms-from-scratch • Rank 14.9 • Science 54%

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatgpt deep-learning from-scratch gpt language-model large-language-models llm machine-learning python pytorch transformer

Updated 10 months ago

fastrag • Rank 14.7 • Science 54%

Efficient Retrieval Augmentation and Generation Framework

benchmark colbert diffusion generative-ai information-retrieval knowledge-graph llm multi-modal nlp question-answering semantic-search sentence-transformers summarization transformers

Updated 10 months ago

eval-suite • Rank 9.2 • Science 59%

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

benchmark ceval chatgpt dataset evaluation gpt-3 gpt-4 hallucination hallucination-detection hallucination-evaluation hallucinations huggingface huggingface-transformers large-language-models llm openai openai-api qwen

Updated 10 months ago

sesgos_llm • Rank 1.1 • Science 67%

¿Cómo “se equivocan” los modelos LLM?

bard bard-api codigo cohere datos gpt llm sesgo

Updated 10 months ago

neptune-client • Rank 24.0 • Science 44%

📘 The experiment tracker for foundation model training

comparison dl foundation keras learning lightgbm llm logger logging machine ml mlops monitoring optuna pytorch rl tensorflow versioning visualization xgboost

Updated 10 months ago

optimate • Rank 12.8 • Science 54%

A collection of libraries to optimise AI model performances

ai analytics artificial-intelligence deeplearning large-language-models llm

Updated 10 months ago

agentlego • Rank 12.6 • Science 54%

Enhance LLM agents with rich tool APIs

large-language-models llm llm-agents

Updated 10 months ago

https://github.com/netflix/metaflow • Rank 30.5 • Science 36%

Build, Manage and Deploy AI/ML Systems

agents ai aws azure data-science datascience gcp generative-ai high-performance-computing kubernetes llm llmops machine-learning ml ml-infrastructure ml-platform mlops model-management python

Updated 10 months ago

beyondllm • Rank 12.5 • Science 54%

Build, evaluate and observe LLM apps

ai artificial-intelligence embeddings evaluate-llm genai generative-ai hacktoberfest hacktoberfest-accepted hacktoberfest2024 large-language-models llm llms rag

Updated 10 months ago

openllm • Rank 22.1 • Science 44%

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

bentoml fine-tuning llama llama2 llama3-1 llama3-2 llama3-2-vision llm llm-inference llm-ops llm-serving llmops mistral mlops model-inference open-source-llm openllm vicuna

Updated 10 months ago

bugbug • Rank 11.6 • Science 54%

Platform for Machine Learning projects on Software Engineering

ai developer-tools llm machine-learning ml python software-engineering

Updated 10 months ago

banks • Rank 21.3 • Science 44%

LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. It allows attaching metadata to prompts to ease their management, and versioning is first-class citizen. Banks provides ways to store prompts on disk along with their metadata.

chatgpt llm nlp openai prompt-engineering prompt-toolkit

Updated 10 months ago

beeai-framework • Rank 21.3 • Science 44%

Build production-ready AI agents in both Python and Typescript.

agents ai ai-agent beeai framework llm multiagent python typescript

Updated 10 months ago

https://github.com/modelscope/data-juicer • Rank 19.2 • Science 46%

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

data data-analysis data-pipeline data-processing data-science data-visualization foundation-models instruction-tuning large-language-models llm llms multi-modal pre-training synthetic-data

Updated 10 months ago

chinese-llama-alpaca-2 • Rank 11.0 • Science 54%

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn

Updated 10 months ago

medicalgpt • Rank 10.7 • Science 54%

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

chatgpt dpo gpt llama llm medical medicalgpt

Updated 10 months ago

oss-fuzz-gen • Rank 10.6 • Science 54%

LLM powered fuzzing via OSS-Fuzz.

ai fuzzing llm security

Updated 10 months ago

multilspy • Rank 18.6 • Science 46%

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

ai ai4code artificial-intelligence code-analysis code-completion code-generation codegen huggingface-transformers language-server-client language-server-protocol large-language-model large-language-models llm lsp lsp-client neurips neurips-2023 program-synthesis transformer

Updated 10 months ago

llmebench • Rank 10.5 • Science 54%

Benchmarking Large Language Models

benchmarking large-language-models llm multilingual

Updated 10 months ago

libre-chat • Rank 10.3 • Science 54%

🦙 Free and Open Source Large Language Model (LLM) chatbot web UI and API. Self-hosted, offline capable and easy to setup.

chatbot chatgpt langchain large-language-models llm llm-inference openapi self-hosted

Updated 10 months ago

sparql-llm • Rank 10.3 • Science 54%

🦜✨ Chat system and reusable components to improve LLMs capabilities when generating SPARQL queries

expasy llm sparql sparql-query-builder

Updated 10 months ago

llm-lct-sequencing • Rank 2.2 • Science 62%

AI Semantic Insights: LLM Toolkit for Analysing Educational Practices and Knowledge Building.

classification knowledge-representation llm

Updated 10 months ago

https://github.com/flyteorg/flyte • Rank 27.3 • Science 36%

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

data data-analysis data-science dataops declarative fine-tuning flyte golang grpc hacktoberfest kubernetes kubernetes-operator llm machine-learning mlops orchestration-engine production python scale workflow

Updated 10 months ago

magicoder • Rank 9.0 • Science 54%

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

ai4code large-language-models llm llm4code

Updated 10 months ago

basic-memory • Rank 18.8 • Science 44%

AI conversations that actually remember. Never re-explain your project to Claude again. Local-first, integrates with Obsidian. Join our Discord: https://discord.gg/tyvKNccgqN

ai claude knowledge-management knowlege-graph llm local-first markdown mcp obsidian obsidian-md open-source privacy-first privacy-first-ai productivity python

Updated 10 months ago

@llm-tools/embedjs • Rank 17.7 • Science 44%

A NodeJS RAG framework to easily work with LLMs and embeddings

ai chatgpt claude cohere embedding embeddings gpt gpt-4 gpt-4o huggingface large-language-models llm mistral ollama openai pinecone rag vector-database vertex-ai

Updated 10 months ago

deeplake • Rank 25.6 • Science 36%

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

ai computer-vision cv data-science datalake datasets deep-learning image-processing langchain large-language-models llm machine-learning ml mlops multi-modal python pytorch tensorflow vector-database vector-search

Updated 10 months ago

chinese-mixtral • Rank 7.1 • Science 54%

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

32k 64k large-language-models llm mixtral mixture-of-experts moe nlp

Updated 10 months ago

monitors4codegen • Rank 7.0 • Science 54%

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.

ai ai4code artificial-intelligence code-analysis code-completion code-generation codegen dataset huggingface-transformers language-server-client language-server-protocol large-language-models llm lsp lsp-client neurips neurips-2023 program-synthesis transformer

Updated 10 months ago

x-anylabeling • Rank 16.5 • Science 44%

Effortless data labeling with AI support from Segment Anything and other awesome models.

annotation-tool classification clip deep-learning deeplearning depth-estimation grounding-dino image-segmentation labeling-tool llm matting object-detection onnx paddle pose-estimation pytorch resnet sam vlm yolo

Updated 10 months ago

https://github.com/khoj-ai/khoj • Rank 24.1 • Science 36%

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

agent ai assistant chat chatgpt emacs image-generation llama3 llamacpp llm obsidian obsidian-md offline-llm productivity rag research self-hosted semantic-search stt whatsapp-ai

Updated 10 months ago

https://github.com/zenml-io/zenml • Rank 23.9 • Science 36%

ZenML 🙏: MLOps for Reliable AI: from Classical AI to Agents. https://zenml.io.

ai automl data-science deep-learning devops-tools hacktoberfest llm llmops machine-learning metadata-tracking ml mlops pipelines production-ready pytorch tensorflow workflow zenml

Updated 10 months ago

contextgem • Rank 15.2 • Science 44%

ContextGem: Effortless LLM extraction from documents

ai contract-analysis data-extraction document-intelligence docx docx2md docx2txt generative-ai legaltech llm llm-extraction llm-framework llm-pipeline llms nlp prompt-engineering text-analysis unstructured-data

Updated 8 months ago

https://github.com/gradio-app/fastrtc • Rank 23.2 • Science 36%

The python library for real-time communication

artificial-intelligence llm python real-time speech-to-text text-to-speech

Updated 9 months ago

https://github.com/deepset-ai/haystack-core-integrations • Rank 22.9 • Science 36%

Additional packages (components, document stores and the likes) to extend the capabilities of Haystack

ai haystack llm mlops nlp

Updated 10 months ago

https://github.com/floneum/floneum • Rank 22.8 • Science 36%

Instant, controllable, local pre-trained AI models in Rust

ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper

Updated 10 months ago

https://github.com/nixtla/nixtla • Rank 22.6 • Science 36%

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code 🚀.

anomaly-detection artificial-intelligence deep-learning forecasting generative-ai-time-series gpt gpts llm machine-learning time-series time-series-forecasting timegpt

Updated 10 months ago

rl-llm-calibration-test • Rank 4.5 • Science 54%

Attempt at replication of the parts of the paper "Language models (mostly) know what they know", on open datasets, and models.

llama2 llm

Updated 10 months ago

tap_llm_course • Rank 1.4 • Science 57%

Materials for the TAP (Tendencias en Aprendizaje Pronfundo) subject from the Master in Robotics and Artificial Intelligence from the Universidad de León

deep-learning llm robotics ros2 vlm

Updated 10 months ago

chinese-llama-alpaca • Rank 12.2 • Science 46%

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

alpaca alpaca-2 large-language-models llama llama-2 llm lora nlp plm pre-trained-language-models quantization

Updated 10 months ago

xfinder • Rank 6.6 • Science 51%

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

benchmark cc-by-nc-nd-4 chatglm dataset evaluation gpt judge-model key-answer-extraction large-language-models llm llm-as-a-judge llm-as-evaluator lm-evaluation open-compass phi qwen regex reliability reliable-evaluation xfinder

Updated 10 months ago

grammarflow • Rank 5.5 • Science 52%

Powering Agent Chains by Constraining LLM Outputs

agent ai ai-eng llama llama-cpp llm llm-agent

Updated 10 months ago

lotr • Rank 3.4 • Science 54%

Low Tensor Rank adaptation of large language models

fine-tuning llm lora lotr parameter-efficient-tuning peft

Updated 10 months ago

canada-labour-research-assistant • Rank 3.3 • Science 54%

The Canada Labour Research Assistant (CLaRA) is a privacy-first LLM-powered RAG AI assistant proposing Easily Verifiable Direct Quotations (EVDQ) to mitigate hallucinations in answering questions about Canadian labour laws, standards, and regulations. It works entirely offline and locally, guaranteeing the confidentiality of your conversations.

chatbot-application chatbot-framework labour labour-relations lcs-algorithm llm llm-inference llm-serving metadata ollama question-answering quotations rag-chatbot retrieval-augmented-generation sentence-transformers source-referencing streamlit string-matching-algorithms vector-database vllm

Updated 10 months ago