rdmo
A tool to support the planning, implementation, and organization of research data management.
gis-python-power
Demonstration of the power of Python for GIS and Geosciences.
guardrails
VSCode extension to help developers set up guardrails around their functions, by helping them disambiguate purpose statements.
pytorch-benchmark
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
llms-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
reproducibility-icwsm25
Materials for the 2025 ICWSM workshop sessions on Computational Reproducibility and the Methods Hub
eprints2archives
Send records from an EPrints server to the Internet Archive and other web archives
glenoidplanefitting
A tool kit for performing Glenoid version measurement for shoulder reconstruction surgery
visualroberta
The first public Vietnamese visual linguistic foundation model(s)
synopticpy
Retrieve mesonet weather data as Polars DataFrames from Synoptic's Weather API.
public_open_source_data_science
A repository of open source data science projects for social good
zntrack
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
wiki-entity-summarization-preprocessor
Convert Wikidata and Wikipedia raw files to filterable formats with a focus of marking Wikidata as summaries based on their Wikipedia abstracts.
y-tal
(Your)-Transient Auxiliary Library - Toolkit for simulation and analysis of time-resolved light transport captures - pip install y-tal
ant_biogenic_structures
Habitat structural complexity of a biogenic reef in Ellis Fjord, Antarctica
pyvaporation
The solution for modelling pervaporation membrane performance based on experimental data
centerline-width
A Python package to find the centerline and width of rivers based on the latitude and longitude of the right and left bank
unifmu
A universal mechanism for implementing Functional Mock-up Units (FMUs) in various languages
easyfea
EasyFEA is a user-friendly Python library that simplifies finite element analysis.
SciMLTutorials
Tutorials for doing scientific machine learning (SciML) and high-performance differential equation solving with open source software.
https://github.com/google-research/retvec
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
eyepy
A python package to read, analyse and visualize OCT and fundus data from various sources.
pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
thinc
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
hgboost
hgboost is a python package for hyper-parameter optimization for xgboost, catboost or lightboost using cross-validation, and evaluating the results on an independent validation set. hgboost can be applied for classification and regression tasks.
summertime
An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo
crrt-route-finder
Algorithm to find promising options for combined road-rail transportation in a logistics network
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
ampligone
A tool in order to accurately remove primer sequences from NGS reads in an amplicon experiment
tclf
A scikit-learn compatible classifier to perform trade classification in Python.
beeai-framework
Build production-ready AI agents in both Python and Typescript.
lasio
Python library for reading and writing well data using Log ASCII Standard (LAS) files
translate-shell
Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.
twoaxistracking
twoaxistracking is a python package for simulating two-axis tracking solar collectors, particularly self-shading.
spinnmachine
A python module which contains a representation of the SpiNNaker Machine
python-ecology-lesson
Data Analysis and Visualization in Python for Ecologists
gval
A high-level Python framework to evaluate the skill of geospatial datasets by comparing candidates to benchmark maps producing agreement maps and metrics.
countryinfo
A python module for returning data about countries, ISO info and states/provinces within them.
scikit-build-core
A next generation Python CMake adaptor and Python API for plugins
d3heatmap
d3heatmap is a Python package to create interactive heatmaps based on d3js.
basedosdados
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.org/docs/home
pyscipopt-ml
Python interface to automatically formulate Machine Learning models into Mixed-Integer Programs
spinnakergraphfrontend
Front end for SpiNNaker software which only uses a graph
classeval
Evaluation of supervised predictions for two-class and multi-class classifiers
parshift
Python package based on Gibson's framework (2003) for turn-taking in group conversation analysis.