deafrica-tools
Repository for Digital Earth Africa Sandbox, including: Jupyter notebooks, scripts, tools and workflows for geospatial analysis with Open Data Cube and xarray
Water Systems Integrated Modelling framework, WSIMOD
Water Systems Integrated Modelling framework, WSIMOD: A Python package for integrated modelling of water quality and quantity across the water cycle - Published in JOSS (2023)
https://github.com/galsim-developers/galsim
The modular galaxy image simulation toolkit. Documentation:
edt
Euclidean distance & signed distance transform for multi-label 3D anisotropic images using marching parabolas.
twoaxistracking
twoaxistracking is a python package for simulating two-axis tracking solar collectors, particularly self-shading.
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
hyr-sense
NASA-SBG and NSF-ESIIL "HYR-SENSE: Hyperspectral and Thermal Remote Sensing for Environmental Justice" program. Participants will gain hands-on experience with hyperspectral and thermal imaging remote sensing technology and its applications for environmental justice issues.
https://github.com/airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
spinnmachine
A python module which contains a representation of the SpiNNaker Machine
https://github.com/chaoss/grimoirelab-perceval
Send Sir Perceval on a quest to retrieve and gather data from software repositories.
gval
A high-level Python framework to evaluate the skill of geospatial datasets by comparing candidates to benchmark maps producing agreement maps and metrics.
countryinfo
A python module for returning data about countries, ISO info and states/provinces within them.
https://github.com/uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
clima
The CBE Clima Tool is a web-based application built to support the need of architects and engineers interested in climate-adapted design. It allows users to analyze the climate data of more than 27,500 locations worldwide using the data contained in EPW files.
https://github.com/lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
scikit-build-core
A next generation Python CMake adaptor and Python API for plugins
hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
Spine-Toolbox
Spine Toolbox is an open source Python package to manage data, scenarios and workflows for modelling and simulation. You can have your local workflow, but work as a team through version control and SQL databases.
d3heatmap
d3heatmap is a Python package to create interactive heatmaps based on d3js.
lidar
lidar: A Python package for delineating nested surface depressions from digital elevation data - Published in JOSS (2021)
h2o
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
marimo
Transform data, train models, and run SQL with marimo — feels like a next-gen reactive notebook, stored as Git-friendly reproducible Python. Deploy as scripts, pipelines, endpoints, and apps. All from an AI-native editor (or your own).
flowkit
A Python toolkit for flow cytometry analysis supporting GatingML and FlowJo workspaces
basedosdados
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.org/docs/home
pyscipopt-ml
Python interface to automatically formulate Machine Learning models into Mixed-Integer Programs
https://github.com/flyteorg/flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
spinnakergraphfrontend
Front end for SpiNNaker software which only uses a graph
classeval
Evaluation of supervised predictions for two-class and multi-class classifiers
https://github.com/mikekeith52/scalecast
The practitioner's forecasting library
parshift
Python package based on Gibson's framework (2003) for turn-taking in group conversation analysis.
pycomplexheatmap
PyComplexHeatmap: A Python package to plot complex heatmap (clustermap)
Pybotics
Pybotics: Python Toolbox for Robotics - Published in JOSS (2019)
dlib
A toolkit for making real world machine learning and data analysis applications in C++
pelican
Static site generator that supports Markdown and reST syntax. Powered by Python.
basic-memory
AI conversations that actually remember. Never re-explain your project to Claude again. Local-first, integrates with Obsidian. Join our Discord: https://discord.gg/tyvKNccgqN
scikit-surgeryvtk
image fusion or overlay, for augmented reality apps, using VTK and OpenCV for a calibrated overlay.
MHKiT-Python
MHKiT-Python provides the marine renewable energy (MRE) community tools for data processing, visualization, quality control, resource assessment, and device performance.
GSAreport
GSAreport: Easy to Use Global Sensitivity Reporting - Published in JOSS (2022)
https://github.com/allenai/allenact
An open source framework for research in Embodied-AI from AI2.
surface-water-network
A Python package to create and analyze surface water networks.