`SimilaritySearch.jl`
`SimilaritySearch.jl`: Autotuned nearest neighbor indexes for Julia - Published in JOSS (2022)
Reiz
Reiz: Structural Source Code Search - Published in JOSS (2021)
usearch
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
graphannis
This is a new backend implementation of the ANNIS linguistic search and visualization system.
tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
ghs
GitHub Search: Platform used to crawl, store and present projects from GitHub, as well as any statistics related to them
weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
com.arcadedb:arcadedb-console
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
similarities
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
https://github.com/epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System
https://github.com/deeprec-ai/deeprec
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
https://github.com/aryanvbw/auto-blogger
Professional WordPress automation tool with AI content generation, Getty Images integration, SEO optimization, and auto-updates. Supports OpenAI DALL-E, Google Gemini, and multi-domain management
msamanda
MS Amanda is a scoring system to identify peptides out of tandem mass spectrometry data using a database of known proteins.
https://github.com/clarin-eric/fcs-sru-aggregator
CLARIN Federated Content Search v3 Aggregator – Augmenting your Search Engine
https://github.com/bigbio/msgfplus
MS-GF+ (aka MSGF+ or MSGFPlus) performs peptide identification by scoring MS/MS spectra against peptides derived from a protein sequence database.
https://github.com/cdpierse/bleve_example
Building a search engine in Go using Bleve!
candidatevectorsearch
Searching for peptide candidates using sparse matrix + matrix/vector multiplication.
https://github.com/baioc/busk
Plain text search using an inverted trigram index
msannika
MS Annika is a crosslink search engine based on MS Amanda, aimed at identifying crosslinks of cleavable and non-cleavable crosslinkers from MS2 and MS3 spectra.
lucene-cranfield-search-engine
🔍 A Lucene demo for searching the Cranfield collection.(用于搜索Cranfield数据集的Lucene搜索引擎示例。)
https://github.com/capjamesg/nanosearch
Build a search engine from a website sitemap.
https://github.com/aksw/sante
The Ontology, Dataset and Knowledge Search Engine