Scientific Software
Updated 6 months ago

Feature-engine — Peer-reviewed • Rank 24.3 • Science 93%

Feature-engine: A Python package for feature engineering for machine learning - Published in JOSS (2021)

Mathematics Economics
Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

fseval — Peer-reviewed • Rank 7.8 • Science 98%

fseval: A Benchmarking Framework for Feature Selection and Feature Ranking Algorithms - Published in JOSS (2022)

Scientific Software
Updated 6 months ago

UBayFS — Peer-reviewed • Rank 8.0 • Science 95%

UBayFS: An R Package for User Guided Feature Selection - Published in JOSS (2023)

Scientific Software
Updated 6 months ago

RENT — Peer-reviewed • Rank 4.3 • Science 93%

RENT: A Python Package for Repeated Elastic Net Feature Selection - Published in JOSS (2021)

Updated 5 months ago

CAST • Rank 14.7 • Science 59%

Developer Version of the R package CAST: Caret Applications for Spatio-Temporal models

Updated 5 months ago

deepfastmlu • Rank 5.2 • Science 57%

Machine learning utilities to help speed up the prototyping process.

Updated 5 months ago

nestfs • Rank 6.4 • Science 49%

An R package for cross-validated (nested) forward selection

Updated 6 months ago

zoish • Rank 8.1 • Science 44%

Zoish is a Python package that streamlines machine learning by leveraging SHAP values for feature selection and interpretability, making model development more efficient and user-friendly

Updated 6 months ago

pyimpetus • Rank 11.0 • Science 41%

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

Updated 6 months ago

upgini • Rank 17.5 • Science 26%

Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs

Updated 5 months ago

https://github.com/microsoft/finnts • Rank 17.5 • Science 26%

Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.

Updated 5 months ago

skrebate • Rank 17.9 • Science 20%

A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.

Updated 5 months ago

FSelectorRcpp • Rank 14.1 • Science 23%

Rcpp (free of Java/Weka) implementation of FSelector entropy-based feature selection algorithms with a sparse matrix support

Updated 5 months ago

pymrmr • Rank 13.7 • Science 23%

Python3 binding to mRMR Feature Selection algorithm (currently not maintained)

Updated 5 months ago

https://github.com/critical-infrastructure-systems-lab/matlab_iterative_input_selection • Rank 3.0 • Science 33%

MatLab implementation of the Iterative Input Selection (IIS) algorithm proposed by Galelli and Castelletti (2013).

Updated 5 months ago

https://github.com/critical-infrastructure-systems-lab/iterative_input_selection • Rank 2.7 • Science 33%

MatLab / C implementation of the Iterative Input Selection (IIS) algorithm proposed by Galelli and Castelletti (2013). The underlying model (i.e. Extremely Randomized Trees) relies on C code that makes it more computationally efficient.

Updated 5 months ago

lassopv • Rank 7.1 • Science 20%

Nonparametric P-Value Estimation for Predictors in Lasso

Updated 6 months ago

fcbf • Rank 3.4 • Science 18%

Categorical feature selection based on information theoretical considerations

Updated 5 months ago

SPSP • Rank 6.7 • Science 13%

A novel approach for feature selection based on the entire solution paths rather than the choice of a single tuning parameter, which significantly improves the accuracy of the selection.

Updated 5 months ago

https://github.com/ajayarunachalam/msda • Rank 4.9 • Science 10%

Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector

Updated 5 months ago

https://github.com/biodataanalysisgroup/kmeranalyzer • Science 10%

An alignment-free method capable of processing and counting k-mers in a reasonable time, while evaluating multiple values of the k parameter concurrently.

Updated 5 months ago

boso • Science 13%

Bilevel Optimization Selector of Operators

Updated 6 months ago

fbp • Science 62%

Frequency Based Pruning (FBP) is a feature selection algorithm based upon maximizing the Youden J statistic. FBP intelligently enumerates through combinations of features, using the frequency of smaller patterns to prune away large regions of the solution space.

Updated 6 months ago

behavioralproject • Science 44%

Classify TD vs ASD according to SRS behavioral report severity score. ABIDE II data set is utilized for training and testing. Freesurfer v6 is utilized for sMRI volumes preprocessing and features extraction.

Updated 6 months ago

sentetik • Science 44%

Synthetic data generation package to balance imblanaced datasets

Updated 6 months ago

gsta • Science 57%

GSTA: Gated Spatial-Temporal Attention Approach for Travel Time Prediction

Updated 6 months ago

fedfs • Science 36%

Federated Feature Selection as in the paper https://arxiv.org/abs/2109.11323

Updated 6 months ago

l0 • Science 26%

Scalable reinforcement learning pipeline for general-purpose agents. Build and train agents using the Notebook Agent (NB-Agent) in a flexible environment. 🐙🚀

Updated 5 months ago

https://github.com/dangeospatial/gee_wetland_classification • Science 13%

A sample workflow for classifying wetlands in Google Earth Engine. Uses data from multiple sources.

Updated 6 months ago

stabilityselection • Science 57%

Perform stability selection in matlab using a variety of feature selection methods

Updated 6 months ago

compressedsensing.jl • Science 28%

Contains a wide-ranging collection of compressed sensing and feature selection algorithms. Examples include matching pursuit algorithms, forward and backward stepwise regression, sparse Bayesian learning, and basis pursuit.

Updated 6 months ago

csfs • Science 52%

A cut-and-solve based feature selection for continous data

Updated 5 months ago

https://github.com/critical-infrastructure-systems-lab/multi-objective-feature-selection • Science 23%

MatLab implementation of W-QEISS, F-QEISS and W-MOSS: three algorithms for the selection of (quasi) equally informative subsets

Scientific Software
Updated 6 months ago

PyCCEA — Peer-reviewed • Science 93%

PyCCEA: A Python package of cooperative co-evolutionary algorithms for feature selection in high-dimensional data - Published in JOSS (2025)

Updated 5 months ago

https://github.com/desbordante/desbordante-core • Science 49%

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.