Pooch
Pooch: A friend to fetch your data files - Published in JOSS (2020)
PyCM
PyCM: Multiclass confusion matrix library in Python - Published in JOSS (2018)
Retriever
Retriever: Data Retrieval Tool - Published in JOSS (2017)
Gibbs Sea Water Oceanographic Toolbox of TEOS-10 implemented in Rust
Gibbs Sea Water Oceanographic Toolbox of TEOS-10 implemented in Rust - Published in JOSS (2024)
ytree
ytree: A Python package for analyzing merger trees - Published in JOSS (2019)
maidr-legacy
[DEPRECATED prototype] Multimodal Access and Interactive Data Representation
@uwdata/mosaic-core
An extensible framework for linking databases and interactive views.
pydata-wrangler
Wrangle messy numerical, image, and text data into consistent well-organized formats
pyprep
PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data
spanishoddata
Access national high-quality and open-access datasets on movement patterns derived from mobile telephone datasets / Accede y usa datos nacionales abiertos sobre movimientos basados en teléfonos móviles.
datahugger
One downloader for many scientific data and code repositories! DOI :open_hands: Data
pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
asreview-makita
Workflow generator for simulation studies using the command line interface of ASReview LAB
academic-observatory-workflows
Telescopes, Workflows and Data Services for the Academic Observatory
interactive_data_editor
A Software to interactively edit data in a graphical manner
oaebu-workflows
Telescopes, Workflows and Data Services for the 'Book Analytics Dashboard Project (2022-2025)', building upon the project 'Developing a Pilot Data Trust for Open Access eBook Usage (2020-2022)'
enhancing_reaxff
Jupyter notebooks used for retraining the ReaxFF force field for the inorganic compound LiF.
cbsodata
Unofficial Statistics Netherlands (CBS) open data API client for Python
constrain
Control Strainer (ConStrain) is a data-driven knowledge-integrated framework that automatically verifies that building system controls function as intended.
bio_data_guide
Standardizing Marine Biological Data Working Group - An open community to facilitate the mobilization of biological data to OBIS.
@stdlib/datasets-spache-revised
A list of simple American-English words (revised Spache).
@stdlib/datasets-harrison-boston-house-prices
A dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).
datasets-herndon-venus-semidiameters
Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.
@stdlib/datasets-harrison-boston-house-prices-corrected
A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).
@stdlib/datasets-pace-boston-house-prices
A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).
datasets-minard-napoleons-march
Data for Charles Joseph Minard's cartographic depiction of Napoleon's Russian campaign of 1812.
harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
bluecloud-plankton
Spatial interpolation of plankton data using a neural network
compound_radar_waveforms-data
Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.
open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
phenodata
An acquisition and processing toolkit for open access phenology data.