Scientific Software
Updated 9 months ago

PyExperimenter — Peer-reviewed • Rank 12.5 • Science 100%

PyExperimenter: Easily distribute experiments and track results - Published in JOSS (2023)

Scientific Software · Peer-reviewed
Scientific Software
Updated 9 months ago

dwctaxon, an R package for editing and validating taxonomic data in Darwin Core format — Peer-reviewed • Rank 8.9 • Science 98%

dwctaxon, an R package for editing and validating taxonomic data in Darwin Core format - Published in JOSS (2024)

Scientific Software · Peer-reviewed
Updated 9 months ago

usearch • Rank 25.4 • Science 77%

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Updated 9 months ago

Caesar • Rank 10.5 • Science 77%

Robust robotic localization and mapping, together with NavAbility(TM). Reach out to info@wherewhen.ai for help.

Updated 9 months ago

cazy-webscraper • Rank 9.9 • Science 77%

Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.

Updated 9 months ago

chip-atlas • Rank 6.2 • Science 77%

ChIP-Atlas: Browse and analyze all public ChIP/DNase-seq data on your browser

Updated 9 months ago

geodatasets • Rank 4.6 • Science 77%

Synthetic datasets for geoscience (geo)statistical modeling

Updated 9 months ago

efp-seq_browser • Rank 4.2 • Science 75%

An RNA-Seq data exploration tool that shows read map coverage of a gene of interest along with a coloured "electronic fluorescent pictographic" (eFP) based on its RPKM expression level.

Updated 9 months ago

pandas-ai • Rank 24.5 • Science 54%

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Updated 9 months ago

morph-kgc • Rank 8.0 • Science 67%

Powerful RDF Knowledge Graph Generation with RML Mappings

Updated 9 months ago

signac • Rank 8.7 • Science 62%

Manage large and heterogeneous data spaces on the file system.

Updated 9 months ago

enhancing_reaxff_dft_database • Rank 2.9 • Science 67%

Database used for retraining the ReaxFF force field for the inorganic compound LiF.

Updated 9 months ago

ustore • Rank 15.3 • Science 54%

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

Updated 9 months ago

neutronics_material_maker • Rank 14.2 • Science 54%

A tool for making reproducible materials and standardizing use across several neutronics codes

Updated 9 months ago

com.arcadedb:arcadedb-console • Rank 21.8 • Science 44%

ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.

Updated 9 months ago

loris • Rank 11.5 • Science 54%

LORIS is a web-accessible database solution for longitudinal multi-site studies.

Updated 9 months ago

hash • Rank 11.2 • Science 54%

🚀 The open-source, multi-tenant, self-building knowledge graph

Updated 9 months ago

epiphyte • Rank 7.0 • Science 54%

Python toolkit for working with high-dimensional neural data recorded during naturalistic, continuous stimuli @a-darcher @rachrapp

Updated 9 months ago

com.wgzhao.addax:addax-all • Rank 15.4 • Science 44%

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

Updated 9 months ago

bety • Rank 8.3 • Science 51%

Web-interface to the Biofuel Ecophysiological Traits and Yields Database (used by PEcAn and TERRA REF)

Updated 9 months ago

alchemy • Rank 6.9 • Science 44%

Archived - High performance, realtime BaaS with RBAC, graphing, full text search, S3 / B2 Storage and GIS with a GraphQL, gRPC and REST API

Updated 9 months ago

soil • Rank 6.5 • Science 44%

An object oriented database that is easy to use and fun to play with

Updated 9 months ago

sqlcmdcli • Rank 3.3 • Science 44%

sqlcmdcli is written in Delphi RAD Studio and lets you connect to a SQL Server instance and run specific commands!

Updated 9 months ago

neotoma_lakes • Rank 3.2 • Science 41%

A repository for managing the matching of lake data between national hydrographic databases and Neotoma records.

Updated 9 months ago

virtual-cat-data-infrastructure • Science 67%

This repository contains the data infrastructure for the Virtual Cross Array Task (CAT) platform designed to assess algorithmic skills among K-12 students.

Updated 9 months ago

bcdatabaser • Science 41%

A pipeline to create reference databases for arbitrary markers and taxonomic groups from NCBI data

Updated 9 months ago

connectivity-search-backend • Science 57%

Django backend for hetnet connectivity search

Updated 9 months ago

createtaxdb • Science 57%

Parallelised and automated construction of metagenomic classifier databases of different tools

Updated 9 months ago

pymapify • Science 44%

PyMapify: Transform a CSV file containing Google Maps links into an interactive map and stores data in a persistent database

Updated 9 months ago

1000-tools-paper • Science 67%

Code and analysis for the 1000 tools paper

Updated 9 months ago

women-of-coal-revisited • Science 44%

This is a text analysis project which utilizes localized oral histories in order to highlight topics, labor trends, and women's history in Appalachian coal mining towns. Original archival sources from the University of Kentucky Nunn Center for Oral History "Appalachia: Women of Coal" Collection & the 1996 Women of Coal primary oral history reader.

Updated 9 months ago

bromanscoopey • Science 67%

Repository for database of epigraphic monuments from Roman Dalmatia commemorating servicemen of Legio VII

Updated 9 months ago

cpq-native-index • Science 54%

A graph database index with native support for CPQs.

Updated 9 months ago

batchcalculator • Science 44%

Batch Calculator for Zeolite synthesis

Updated 9 months ago

twinger_chronicle_mss • Science 67%

data base for the known manuscripts that transmit the chronicle of Jakob Twinger von Königshofen

Updated 9 months ago

non-profit-link • Science 31%

Non-Profit Link (NP Link). Used for communication between non-profits.

Updated 9 months ago

chooseadb.github.io • Science 44%

A game to provide guidance on choosing a database for your project

Updated 9 months ago

recipy • Science 44%

Effortless method to record provenance in Python

Updated 9 months ago

storagex • Science 44%

Kotlin Multiplatform storage utilities.

Updated 9 months ago

germinate-vue • Science 75%

Germinate is an open source plant database infrastructure and application programming platform on which complex data from genetic resource collections can be stored, queried and visualized.

Updated 9 months ago

mathmoddb • Science 67%

The repo for MathModDB, the model ontology and knowledge graph developed by MaRDI TA4

Updated 9 months ago

meertrapdb • Science 31%

Database backend and survey analysis code for MeerTRAP.

Updated 9 months ago

quacc • Science 67%

quacc is a flexible platform for computational materials science and quantum chemistry that is built for the big data era.

Updated 9 months ago

materials-design-ontology • Science 67%

An Ontology for the Materials Design Domain

Updated 9 months ago

sqlab • Science 44%

SQL Adventure Builder: transform a dataset and a collection of SQL exercises into a self-contained database

Updated 9 months ago

pia-data-model • Science 54%

Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project

Updated 9 months ago

pdbq • Science 44%

Portable Database Queries

Updated 9 months ago

genome_updater • Science 67%

Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy

Updated 9 months ago

tsdb • Science 67%

a Python toolbox loads 172 public time series datasets for machine/deep learning with a single line of code. Datasets from multiple domains including healthcare, financial, power, traffic, weather, and etc.

Updated 9 months ago

fetch_ngs • Science 67%

Workflow to Fetch Public Sequencing Data and Metadata Using iSeq and MrBiomics Module.

Updated 9 months ago

zincbinddb • Science 44%

The database and API backend for ZincBind - the database of zinc binding sites

Updated 9 months ago

sharetraitdatabase • Science 44%

ShareTrait Database — a relational SQL database providing a structured implementation of the ShareTrait data. It includes the full schema definition, data tables and supporting documentation.

Updated 9 months ago

oeplatform • Science 54%

Repository for the code of the Open Energy Platform (OEP) website. The OEP provides an interface to the Open Energy Family

Updated 9 months ago

ladis • Science 44%

A database for cleaning laser tools created in a project at the Potsdam University of Applied Sciences.

Updated 9 months ago

globalid-database • Science 57%

Here you can find the most recent version of the GlobaLID database. Stable versions are regularly published at https://doi.org/10.5880/fidgeo.2023.043

Updated 9 months ago

loris • Science 54%

Loris: Database and Analysis application for a Drosophila Lab (or any lab)

Updated 9 months ago

etymdb • Science 44%

[LREC 2020] EtymDB, an Etymological DataBase (v2.1)

Updated 9 months ago

hfcommunity • Science 52%

HFCommunity offers an offline up-to-date relational database built from the data available at the Hugging Face Hub, providing queriable data about the repositories hosted in the Hub

Updated 9 months ago

zion • Science 54%

A scalable Thing Description Directory

Updated 9 months ago

nl2query • Science 44%

A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.

Updated 9 months ago

project • Science 44%

The monorepo containing all code for the QED project

Updated 9 months ago

architxt • Science 44%

ArchiTXT is an open source Python library that transforms unstructured text into structured, searchable, and AI-ready data. It enables automated database generation and seamless data integration.