GitHub
@stdlib/random-base-minstd
A linear congruential pseudorandom number generator (LCG) based on Park and Miller.
https://github.com/ornladios/adios2
Next generation of ADIOS developed in the Exascale Computing Program
Swiftest
Swiftest: An \textit{N}-body Integrator for Gravitational Systems - Published in JOSS (2023)
com.eldrix/hermes
A library and microservice implementing the health and care terminology SNOMED CT with support for cross-maps, inference, fast full-text search, autocompletion, compositional grammar and the expression constraint language.
stress-ng
This is the stress-ng upstream project git repository. stress-ng will stress test a computer system in various selectable ways. It was designed to exercise various physical subsystems of a computer as well as the various operating system kernel interfaces.
iml
iml: An R package for Interpretable Machine Learning - Published in JOSS (2018)
aif360
A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.
eval-suite
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
randomNames
Function to generate random gender and ethnicity correct first and/or last names. Names are chosen proportionally based upon their probability of appearing in a large scale data base of real names.
weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
itslive
NOTE: this book is no longer maintained. Please go to 'Cloud-native geospatial data cube workflows,' linked below, instead.
hires-literature
All the literature on high-resolution spectroscopy of exoplanet atmospheres — theory and observation.
fmriprep
fMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse fMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
proteomics-sample-metadata
The Proteomics sample metadata: Standard for experimental design annotation in proteomics datasets
gap
Main development repository for GAP - Groups, Algorithms, Programming, a System for Computational Discrete Algebra
celeritas
Celeritas is a new Monte Carlo transport code designed to accelerate scientific discovery in high energy physics by improving detector simulation throughput and energy efficiency using GPUs.
https://github.com/clima/climaocean.jl
🌎 Regional-to-global coupled ocean and sea ice simulations based on Oceananigans
django-ddm
Data Donation Module: A Django application to setup and manage data donation projects.
public_open_source_data_science
A repository of open source data science projects for social good
giantkelpdynamics
Dynamical model for the motion of giant kelp (Macrocystis pyrifera). Accompanies "A model of tidal flow and tracer release in a giant kelp forest"(Strong-Wright and Taylor, In review)
ncse-seminar-2022
National Centre for Statistical Ecology seminar, Feb 9th, 2022
airs
The AIRS Code Collection enables data processing and analysis for remote sensing observations captured by NASA's Atmospheric InfraRed Sounder.
desarrollo_sector_pesquero_acuicola
Este trabajo identifica y analiza las oportunidades más relevantes para la evolución futura del sector pesquero
pytacite
A flexible and lightweight Python interface to the DataCite database (datacite.org)
minesweepears
MineSweepears Island is a hex-based puzzle game inspired by Minesweeper, where players use spatial audio cues to detect hidden mines. Created for Ludum Dare 32, the game blends strategy with innovative sound-based gameplay.
macrophage-regulation
Integrated time-series analysis and high-content CRISPR screening delineate the dynamics of macrophage immune regulation
nearoptimalalternatives.jl
A package implementing modelling to generate alternatives (MGA) to find near optimal alternatives to the optimal solution.
sc2-benchmark
[TMLR] "SC2 Benchmark: Supervised Compression for Split Computing"
Eclipse Golo
Eclipse Golo - Published in JOSS (2016)
PoissonRandom
Fast Poisson Random Numbers in pure Julia for scientific machine learning (SciML)
https://github.com/bashtage/linearmodels
Additional linear models including instrumental variable and panel data models that are missing from statsmodels.
data-organisation-for-better-analysis
Data Organization in Spreadsheets to Ease Further Processing
alpaqa
Library for nonconvex constrained optimization using the augmented Lagrangian method and the matrix-free PANOC algorithm.
acro
Tools for the Semi-Automatic Checking of Research Outputs. These are tools for researchers to use as drop-in replacements for common analysis commands.
https://github.com/aureme/esmecata
From taxonomic affiliations to annotated consensus proteomes using UniProt database.
@stdlib/blas-ext-base-gsumpw
Calculate the sum of strided array elements using pairwise summation.
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
nempi
Nested Effects Models based Perturbation Inference - https://doi.org/10.1093/bioinformatics/btab113 - https://bioconductor.org/packages/nempi
zntrack
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
https://github.com/biocore/q2-greengenes2
A QIIME 2 plugin for interaction with the Greengenes2 database
pybop
A parameterisation and optimisation package for battery models.
https://github.com/ydataai/ydata-synthetic
Synthetic data generators for tabular and time-series data
gstat
Spatial and spatio-temporal geostatistical modelling, prediction and simulation
https://github.com/alan-turing-institute/pymc3
Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano
dynamicsdm
An R package for species geographical distribution and abundance modelling at high spatiotemporal resolution
osmogrid
OSMoGrid is a tool to generate life like electrical grid models based on publicly available data, mainly OpenStreetMap. The tool puts a special focus on low voltage grids.
lightning-uq-box
Lightning-UQ-Box: Uncertainty Quantification for Neural Networks with PyTorch and Lightning
https://github.com/broadinstitute/str-analysis
Scripts and utilities related to analyzing short tandem repeats (STRs).
@stdlib/random-base-improved-ziggurat
Normally distributed pseudorandom numbers using the improved Ziggurat method.
constrain
Control Strainer (ConStrain) is a data-driven knowledge-integrated framework that automatically verifies that building system controls function as intended.
pbxplore
A suite of tools to explore protein structures with Protein Blocks :snake:
Potnia
Potnia: A Python library for the conversion of transliterated ancient texts to Unicode - Published in JOSS (2025)
grounded-segment-anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything