Projects

Scientific Software

Updated 10 months ago

libfmp — Peer-reviewed • Rank 14.1 • Science 93%

libfmp: A Python Package for Fundamentals of Music Processing - Published in JOSS (2021)

audio beat chord dtw f0 fourier hmm mir music nmf onset processing python retrieval synchronization tempo

Engineering (40%)

Scientific Software · Peer-reviewed

Updated 10 months ago

vidore-benchmark • Rank 14.9 • Science 77%

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

colpali rag retrieval search vision-language-model

Updated 10 months ago

retomaton • Rank 9.6 • Science 72%

PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)

icml-2022 knn knn-lm language modeling nearest nearest-neighbor neuro-symbolic neurosymbolic pytorch retomaton retrieval

Updated 10 months ago

act • Rank 17.1 • Science 59%

Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets

atmospheric-science corrections meteorological-data meteorology retrieval time-series visualization

Updated 10 months ago

sgpt • Rank 7.9 • Science 41%

SGPT: GPT Sentence Embeddings for Semantic Search

gpt information-retrieval language-model large-language-models neural-search retrieval semantic-search sentence-embeddings sgpt text-embedding

Updated 10 months ago

cherche • Rank 7.6 • Science 41%

Neural Search

bm25 flashtext information-retrieval machine-learning natural-language-processing neural-networks neural-search nlp question-answering reader retrieval search searching semantic-search vector-search

Updated 10 months ago

tax-retrieval-benchmark • Rank 0.7 • Science 44%

An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.

benchmark droit embeddings fiscal fiscalite information-retrieval mteb rag retrieval retrieval-augmented-generation sbert semantic-search sentence-embeddings sentence-transformers stp tax taxation

Updated 10 months ago

rag-chunking-evaluation • Rank 0.0 • Science 44%

Assess the effectiveness of chunking strategies in RAG systems via a custom evaluation framework.

chunking evaluation-framework retrieval retrieval-augmented-generation

Updated 10 months ago

https://github.com/epsilla-cloud/vectordb • Rank 16.8 • Science 26%

Epsilla is a high performance Vector Database Management System

ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search

Updated 10 months ago

https://github.com/buaadreamer/easyrag • Science 10%

Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案

aiops bge bm25 ccf glm4 gte hyde llm minicpm network-operations qa rag retrieval retrieval-augmented-generation rrf

Updated 10 months ago

https://github.com/chirayu-tripathi/mongodb-querifier • Science 13%

Improving LLMs MongoDB query generation capability with the help of advanced retrieval augmented generation.

database llms mongodb rag retrieval retrieval-augmented-generation

Updated 10 months ago

ccrk • Science 54%

[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning

cross-lingual cross-lingual-retrieval cross-modal cross-modal-retrieval iglue image-text-retrieval image-text-search kdd2024 mscoco multi30k retrieval swin-transformer vision-language-pretraining wit xflickrco xlm-roberta

Updated 10 months ago

ladis • Science 44%

A database for cleaning laser tools created in a project at the Potsdam University of Applied Sciences.

database information laser retrieval

Updated 10 months ago

rankify • Science 54%

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techniques, 24+ state-of-the-art Reranking models, and multiple RAG methods.

agent ai chatgpt information-retrieval llm nlp question-answering rag ranked-retrieval reranking retrieval retrival-augmented-generation

Updated 10 months ago

https://github.com/datascienceuibk/llm-reranking-generalization-study • Science 36%

How Good are LLM-based Rerankers? Accepted at EMNLP Findings 2025

dataset leaderboard llm reranker reranking retrieval

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

libfmp — Peer-reviewed • Rank 14.1 • Science 93%

vidore-benchmark • Rank 14.9 • Science 77%

retomaton • Rank 9.6 • Science 72%

act • Rank 17.1 • Science 59%

sgpt • Rank 7.9 • Science 41%

cherche • Rank 7.6 • Science 41%

tax-retrieval-benchmark • Rank 0.7 • Science 44%

rag-chunking-evaluation • Rank 0.0 • Science 44%

https://github.com/epsilla-cloud/vectordb • Rank 16.8 • Science 26%

https://github.com/buaadreamer/easyrag • Science 10%

https://github.com/chirayu-tripathi/mongodb-querifier • Science 13%

ccrk • Science 54%

ladis • Science 44%

rankify • Science 54%

https://github.com/datascienceuibk/llm-reranking-generalization-study • Science 36%