public_open_source_data_science
A repository of open source data science projects for social good
zntrack
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
pbxplore
A suite of tools to explore protein structures with Protein Blocks :snake:
sumo
sumo: Command-line tools for plotting and analysis of periodic *ab initio* calculations - Published in JOSS (2018)
stream-learn
The stream-learn is an open-source Python library for difficult data stream analysis.
wiki-entity-summarization-preprocessor
Convert Wikidata and Wikipedia raw files to filterable formats with a focus of marking Wikidata as summaries based on their Wikipedia abstracts.
Integrated hydrologic model development and postprocessing for GSFLOW using pyGSFLOW
Integrated hydrologic model development and postprocessing for GSFLOW using pyGSFLOW - Published in JOSS (2022)
https://github.com/bluebrain/brayns
Visualizer for large-scale and interactive ray-tracing of neurons
auto-fox
A library for analyzing potential energy surfaces (PESs) and using the resulting PES descriptors for constructing forcefield parameters.
pyjams
A general Python package with miscellaneous utility functions used in several other packages.
https://github.com/timeseriesai/tsai
Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
https://github.com/wmd-group/pdyna
Python package to analyse the structural dynamics of perovskites
https://github.com/pycalphad/scheil
A Scheil-Gulliver simulation tool using pycalphad.
https://github.com/rerun-io/rerun
Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.
https://github.com/pycalphad/pycalphad
CALPHAD tools for designing thermodynamic models, calculating phase diagrams and investigating phase equilibria.
y-tal
(Your)-Transient Auxiliary Library - Toolkit for simulation and analysis of time-resolved light transport captures - pip install y-tal
ant_biogenic_structures
Habitat structural complexity of a biogenic reef in Ellis Fjord, Antarctica
pyvaporation
The solution for modelling pervaporation membrane performance based on experimental data
https://github.com/amenra/ranx
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
centerline-width
A Python package to find the centerline and width of rivers based on the latitude and longitude of the right and left bank
unifmu
A universal mechanism for implementing Functional Mock-up Units (FMUs) in various languages
easyfea
EasyFEA is a user-friendly Python library that simplifies finite element analysis.
itkwidgets
An elegant Python interface for visualization on the web platform to interactively generate insights into multidimensional images, point sets, and geometry.
SciMLTutorials
Tutorials for doing scientific machine learning (SciML) and high-performance differential equation solving with open source software.
imbalanced-learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
cuda-quantum
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
cna
Covarying neighborhood analysis (CNA) is a method for finding structure in- and conducting association analysis with multi-sample single-cell datasets.
diffsims
An open-source Python library providing utilities for simulating diffraction
https://github.com/antsx/antspy
A fast medical imaging analysis library in Python with algorithms for registration, segmentation, and more.
https://github.com/google-research/retvec
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
ads2inspire
Replace ADS citations with the appropriate INSPIRE ones in latex and bibtex
eyepy
A python package to read, analyse and visualize OCT and fundus data from various sources.
pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
thinc
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
https://github.com/fwilliams/point-cloud-utils
An easy-to-use Python library for processing and manipulating 3D point clouds and meshes.
hgboost
hgboost is a python package for hyper-parameter optimization for xgboost, catboost or lightboost using cross-validation, and evaluating the results on an independent validation set. hgboost can be applied for classification and regression tasks.
omc3
Python 3 codes for beam optics measurements and corrections in circular particle accelerators
https://github.com/facebookresearch/habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
summertime
An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo
crrt-route-finder
Algorithm to find promising options for combined road-rail transportation in a logistics network
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
ampligone
A tool in order to accurately remove primer sequences from NGS reads in an amplicon experiment
tclf
A scikit-learn compatible classifier to perform trade classification in Python.
pyqtgraph
Fast data visualization and GUI tools for scientific / engineering applications
Capytaine
Capytaine: a Python-based linear potential flow solver - Published in JOSS (2019)
pyemu
python modules for model-independent uncertainty analyses, data-worth analyses, and interfacing with PEST(++)
kinisi
kinisi: Bayesian analysis of mass transport from molecular dynamics simulations - Published in JOSS (2024)
beeai-framework
Build production-ready AI agents in both Python and Typescript.
lasio
Python library for reading and writing well data using Log ASCII Standard (LAS) files
translate-shell
Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.