PyExperimenter
PyExperimenter: Easily distribute experiments and track results - Published in JOSS (2023)
dwctaxon, an R package for editing and validating taxonomic data in Darwin Core format
dwctaxon, an R package for editing and validating taxonomic data in Darwin Core format - Published in JOSS (2024)
usearch
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Caesar
Robust robotic localization and mapping, together with NavAbility(TM). Reach out to info@wherewhen.ai for help.
cazy-webscraper
Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.
chip-atlas
ChIP-Atlas: Browse and analyze all public ChIP/DNase-seq data on your browser
protease_activity_analysis
Python toolkit and package for analyzing enzyme activity data
efp-seq_browser
An RNA-Seq data exploration tool that shows read map coverage of a gene of interest along with a coloured "electronic fluorescent pictographic" (eFP) based on its RPKM expression level.
pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
enhancing_reaxff_dft_database
Database used for retraining the ReaxFF force field for the inorganic compound LiF.
ustore
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️
com.arcadedb:arcadedb-console
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
loris
LORIS is a web-accessible database solution for longitudinal multi-site studies.
epiphyte
Python toolkit for working with high-dimensional neural data recorded during naturalistic, continuous stimuli @a-darcher @rachrapp
bety
Web-interface to the Biofuel Ecophysiological Traits and Yields Database (used by PEcAn and TERRA REF)
persian-news-crawler
Simple Script To Crawl Data From Persian News Agencies Including Fars, Mehr.
alchemy
Archived - High performance, realtime BaaS with RBAC, graphing, full text search, S3 / B2 Storage and GIS with a GraphQL, gRPC and REST API
sqlcmdcli
sqlcmdcli is written in Delphi RAD Studio and lets you connect to a SQL Server instance and run specific commands!
relational-databases
Teaching Materials for the "Relational Database Course" taught at the HFTM
neotoma_lakes
A repository for managing the matching of lake data between national hydrographic databases and Neotoma records.
virtual-cat-data-infrastructure
This repository contains the data infrastructure for the Virtual Cross Array Task (CAT) platform designed to assess algorithmic skills among K-12 students.
bcdatabaser
A pipeline to create reference databases for arbitrary markers and taxonomic groups from NCBI data
createtaxdb
Parallelised and automated construction of metagenomic classifier databases of different tools
women-of-coal-revisited
This is a text analysis project which utilizes localized oral histories in order to highlight topics, labor trends, and women's history in Appalachian coal mining towns. Original archival sources from the University of Kentucky Nunn Center for Oral History "Appalachia: Women of Coal" Collection & the 1996 Women of Coal primary oral history reader.
bromanscoopey
Repository for database of epigraphic monuments from Roman Dalmatia commemorating servicemen of Legio VII
twinger_chronicle_mss
data base for the known manuscripts that transmit the chronicle of Jakob Twinger von Königshofen
non-profit-link
Non-Profit Link (NP Link). Used for communication between non-profits.
chooseadb.github.io
A game to provide guidance on choosing a database for your project
germinate-vue
Germinate is an open source plant database infrastructure and application programming platform on which complex data from genetic resource collections can be stored, queried and visualized.
mathmoddb
The repo for MathModDB, the model ontology and knowledge graph developed by MaRDI TA4
quacc
quacc is a flexible platform for computational materials science and quantum chemistry that is built for the big data era.
sqlab
SQL Adventure Builder: transform a dataset and a collection of SQL exercises into a self-contained database
pia-data-model
Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project
tsdb
a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, and etc.
fetch_ngs
Workflow to Fetch Public Sequencing Data and Metadata Using iSeq and MrBiomics Module.
zincbinddb
The database and API backend for ZincBind - the database of zinc binding sites
sharetraitdatabase
ShareTrait Database — a relational SQL database providing a structured implementation of the ShareTrait data. It includes the full schema definition, data tables and supporting documentation.
oeplatform
Repository for the code of the Open Energy Platform (OEP) website. The OEP provides an interface to the Open Energy Family
eplant_plant_efp
A data visualization tool to display tissue expression data for Arabidopsis thaliana
ladis
A database for cleaning laser tools created in a project at the Potsdam University of Applied Sciences.
globalid-database
Here you can find the most recent version of the GlobaLID database. Stable versions are regularly published at https://doi.org/10.5880/fidgeo.2023.043
loris
Loris: Database and Analysis application for a Drosophila Lab (or any lab)
hfcommunity
HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub
nl2query
A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.
architxt
ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data. It enables automated database generation and seamless data integration.