Visualizations with statistical details
Visualizations with statistical details: The 'ggstatsplot' approach - Published in JOSS (2021)
Python class defining a machine learning dataset ensuring key-based correspondence and maintaining integrity
Python class defining a machine learning dataset ensuring key-based correspondence and maintaining integrity - Published in JOSS (2017)
awesome-conformal-prediction
A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.
GeoStats.jl -- High-performance geostatistics in Julia
GeoStats.jl -- High-performance geostatistics in Julia - Published in JOSS (2018)
https://github.com/alan-turing-institute/clevercsv
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
public_open_source_data_science
A repository of open source data science projects for social good
https://github.com/chris-prener/testdriver
R package with data for teaching Statistics and Data Science
basic_statistics
Repository for teaching basics of statistics for machine learning
climate-change-data
:earth_africa: A curated list of APIs, open data and ML/AI projects on climate change
NYISOToolkit
Access data, statistics, and visualizations for New York's electricity grid.
https://github.com/mightymetrika/npboottprm
Nonparametric Bootstrap Test with Pooled Resampling
tree.interpreter
Decision tree interpreter for randomForest/ranger as described in
https://github.com/nup002/pymjc
A python implementation of the Minimum Jump Cost dissimilarity measure.
https://github.com/cedrickchee/kaggle-facial-detection
Facial keypoints detection challenge tutorial and solution for Singapore Kaggle ML Challenge meetup.
cRegulome
An R package to access, manage and visualize regulome (microRNA/transcription factors)-gene correlations in cancer
machine-translation-for-african-languages
This repository focuses on developing machine translation and NLP tools specifically for African languages. Join us in addressing the challenges and opportunities in this vital area of language technology! 🛠️🌍
getting-_started_data_science
Resources to get started in data science and teach yourself data science
renku
Renku provides a platform and tools for reproducible and collaborative data analysis.
dataverseur
🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
python4datascience
Teaching materials for the cusy training courses on Python-based data science workflows: https://cusy.io/en/seminars
https://github.com/ibridges-for-irods/ibridges
A wrapper around the python-irodsclient to allow for easy interaction with iRODS servers.
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
https://github.com/romerusersgroup/romerusersgroup.github.io
Local Chapter of R User Groups