Updated 6 months ago

trafilatura • Rank 26.3 • Science 77%

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Updated 6 months ago

paper-qa • Rank 22.0 • Science 77%

High accuracy RAG for answering questions from scientific documents with citations

Updated 6 months ago

vidore-benchmark • Rank 14.9 • Science 77%

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Updated 6 months ago

flashrank • Rank 20.1 • Science 67%

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Updated 6 months ago

llama_index • Rank 33.9 • Science 49%

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Updated 6 months ago

farm-haystack • Rank 28.7 • Science 54%

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Updated 6 months ago

flexrag • Rank 12.8 • Science 67%

FlexRAG: A RAG Framework for Information Retrieval and Generation.

Updated 6 months ago

rdocdump • Rank 8.1 • Science 67%

rdocdump: Dump ‘R’ Package Source, Documentation, and Vignettes into One File

Updated 6 months ago

deepsearch-toolkit • Rank 16.5 • Science 54%

Interact with the Deep Search platform for new knowledge explorations and discoveries

Updated 6 months ago

https://github.com/khoj-ai/khoj • Rank 24.1 • Science 36%

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Updated 6 months ago

ragoon • Rank 9.3 • Science 44%

High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡

Updated 6 months ago

odin-slides • Rank 7.8 • Science 44%

This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Transformer (GPT) of your choice. Leveraging the capabilities of Large Language Models (LLM), odin-slides enables you to turn the lengthiest Word documents into well organized presentations.

Updated 6 months ago

porag • Rank 4.9 • Science 44%

Fully Configurable RAG Pipeline for Bengali Language RAG Applications. Supports both Local and Huggingface Models, Built with Langchain.

Updated 6 months ago

odinrunes • Rank 4.5 • Science 44%

Odin Runes, a java-based GPT client, facilitates interaction with your preferred GPT model right through your favorite text editor. There is more: It also facilitates prompt-engineering by extracting context from diverse sources using technologies such as OCR, enhancing overall productivity and saving costs.

Updated 6 months ago

https://github.com/apache/hamilton • Rank 12.1 • Science 36%

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Updated 6 months ago

ragxplorer • Rank 13.3 • Science 26%

Open-source tool to visualise your RAG 🔮

Updated 5 months ago

https://github.com/casperdcl/brace • Rank 1.1 • Science 36%

:book: BRACE — Bible retrieval-augmented (Catholic edition)

Updated 6 months ago

https://github.com/adithya-s-k/varag • Rank 7.6 • Science 23%

Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine

Updated 6 months ago

fed-rag • Science 75%

A framework for fine-tuning retrieval-augmented generation (RAG) systems.

Updated 4 months ago

https://github.com/deepset-ai/haystack-cookbook • Science 26%

👩🏻‍🍳 A collection of example notebooks using Haystack

Updated 5 months ago

https://github.com/cuc-zihang-liu/text-based-rag-framework • Science 26%

基于文本的RAG框架(多种编码器组合)

Updated 6 months ago

quick-start-creating-a-vector-database-for-rag • Science 44%

A playground for vector database exploration using Chroma

Updated 6 months ago

rag-evaluation-harnesses • Science 54%

An evaluation suite for Retrieval-Augmented Generation (RAG).

Updated 6 months ago

https://github.com/amazon-science/memerag • Science 36%

MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation

Updated 6 months ago

ragthoven • Science 26%

RAGthoven, a Retrieval Augmented Generation Toolkit that helps you easily set up and execute your RAG experiments

Updated 5 months ago

https://github.com/buaadreamer/easyrag • Science 10%

Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案

Updated 6 months ago

ColBERT • Science 41%

Efficient late-interaction retrieval systems in Julia!

Updated 5 months ago

https://github.com/chirayu-tripathi/mongodb-querifier • Science 13%

Improving LLMs MongoDB query generation capability with the help of advanced retrieval augmented generation.

Updated 6 months ago

geospatial-rag • Science 26%

AI Framework for Remote Sensing Image Analysis using RAG - 88%+ accuracy, multi-modal queries, ChatGPT-like interface

Updated 6 months ago

https://github.com/akaiko1/langchain_examples • Science 26%

Practical, minimal examples for building with LangChain

Updated 6 months ago

promptfoo • Science 26%

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Updated 6 months ago

rankify • Science 54%

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.

Updated 6 months ago

rage • Science 26%

RagE (RAG Engine) - A tool supporting the construction and training of components of the Retrieval-Augmented-Generation (RAG) model. It also facilitates the rapid development of Q&A systems and chatbots following the RAG model.

Updated 6 months ago

janus-llm • Science 44%

Leveraging LLMs for modernization through intelligent chunking, iterative prompting and reflection, and retrieval augmented generation (RAG).

Updated 6 months ago

rag_fundamentals • Science 36%

Retrieval-Augmented Generation (RAG) Fundamentals and Semantic Chunking

Updated 6 months ago

https://github.com/amberlee2427/nancy-brain • Science 36%

Nancy's RAG backend and HTTP API/MCP server connectors.

Updated 6 months ago

rag-constitucion-chile • Science 31%

Platform to compare Chile's current constitution with its new proposed constitution using LLMs.

Updated 6 months ago

smyth-docs • Science 26%

Everything you need to build, deploy, and collaborate with agents. Ride the llama, avoid the drama.

Updated 6 months ago

https://github.com/svilupp/aihelpme.jl • Science 13%

Harnessing Julia's Rich Documentation for Tailored AI-Assisted Coding Guidance

Updated 6 months ago

wovensnips • Science 44%

WovenSnips: A Lightweight, Free, and Open-source Implementation of Retrieval-Augmented Generation (RAG) using Straico API