VeridicalFlow
VeridicalFlow: a Python package for building trustworthy data science pipelines with PCS - Published in JOSS (2022)
toughio
toughio: Pre- and post-processing Python library for TOUGH - Published in JOSS (2020)
SeqTools
SeqTools: A python package for easy transformation, combination and evaluation of large datasets. - Published in JOSS (2018)
Siril
Siril: An Advanced Tool for Astronomical Image Processing - Published in JOSS (2024)
contextualspellcheck
✔️Contextual word checker for better suggestions (not actively maintained)
pyprep
PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data
boxsers
Python package that provides a full range of functionality to process and analyze vibrational spectra (Raman, SERS, FTIR, etc.).
dmriprep
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
english-text-normalization
Command-line interface (CLI) and library to normalize English texts.
MODIStsp
An "R" package for automatic download and preprocessing of MODIS Land Products Time Series
prospectr
R package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data
rgcxgc
An R package for preprocessing and Multivariate Analysis of Two-dimensional Gas Chromatography Data
https://github.com/bids-apps/freesurfer
BIDS app wrapping recon-all from FreeSurfer
jaconv
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
https://github.com/bencardoen/datacurator.jl
A scalable Julia package to transparently validate and transform large biomedical datasets using human readable recipes that are translated to machine verifiable templates.
docutron
Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.
https://github.com/bids-apps/hcppipelines
A BIDS App for minimal preprocessing using the HCP Pipelines
pybear
pybear is a Python computing library that augments data analytics functionality found in popular packages that use the scikit-learn API, such as scikit-learn and xgboost.
tubecleanr
(Mini) R package for preprocessing YouTube comment data collected with tuber or vosonSML
ea34568e-d86e-4720-be2f-3f826f66a26c
Describing a pipeline to preprocess NOAA gridded rainfall reanalysis dataset
https://github.com/bids-apps/afni_proc
prototype AFNI bids app implmenting participant level preprocessing with afni_proc.py
https://github.com/omigutin/img_ops
Is a minimal, device‑aware image ops library built on top of OpenCV and NumPy.
cleaned-swansf-dataset
The SWAN-SF dataset is now fully preprocessed, optimized, and ready for binary classification tasks. Our team is excited to release the enhanced version of the SWAN-SF dataset across all five partitions.
eyeris
🧠 Advanced reproducible pupillometry preprocessing in R | Interactive reports, BIDS-compliant, High-throughput database tooling out-of-the-box | Actively developed by cognitive neuroscientists at Stanford University
qub-hri
Preprocessing Repository of QUB-Perception of Human Enagagement in Assembly Operations Dataset
https://github.com/bids-apps/cpac
BIDS Application for the Configurable Pipeline for the Analysis of Connectomes (C-PAC)
nifreeze
A flexible framework for volume-wise artifact estimation and correction across multiple 4D neuroimaging modalities (diffusion MRI, functional MRI, and PET)
https://github.com/darkstarstrix/datavolt
Reusable data engineering toolkit My personal data infrastructure
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
semantic-outlier-removal
Code and data for SORE (ACL 2025), a semantic boilerplate remover.
portagetextprocessing
Text processing tools that came out of the Portage SMT project — Outils de traitement de texte issus du projet Portage de TAS