vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
retomaton
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
act
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
tax-retrieval-benchmark
An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
rag-chunking-evaluation
Assess the effectiveness of chunking strategies in RAG systems via a custom evaluation framework.
https://github.com/epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
https://github.com/datascienceuibk/llm-reranking-generalization-study
How Good are LLM-based Rerankers? Accepted at EMNLP Findings 2025
ladis
A database for cleaning laser tools created in a project at the Potsdam University of Applied Sciences.
rankify
🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.
ccrk
[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
https://github.com/chirayu-tripathi/mongodb-querifier
Improving LLMs MongoDB query generation capability with the help of advanced retrieval augmented generation.