Scientific Software
Updated 6 months ago

pytorch-widedeep — Peer-reviewed • Rank 17.8 • Science 100%

pytorch-widedeep: A flexible package for multimodal deep learning - Published in JOSS (2023)

Mathematics Engineering Artificial Intelligence and Machine Learning (45%)
Scientific Software · Peer-reviewed
Updated 6 months ago

autopytorch • Rank 15.6 • Science 77%

Automatic architecture search and hyperparameter optimization for PyTorch

Updated 6 months ago

dataset-phenotypes • Rank 4.9 • Science 72%

Preparatory scripts for BIDS tabular phenotypic data in large neuroimaging datasets.

Updated 6 months ago

openml • Rank 20.3 • Science 54%

OpenML's Python API for a World of Data and More 💫

Updated 5 months ago

https://github.com/vaexio/vaex • Rank 24.4 • Science 36%

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Updated 6 months ago

caspr • Rank 5.5 • Science 54%

CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.

Updated 6 months ago

tinto-prueba • Rank 5.4 • Science 49%

TINTO: Software to convert Tidy Data into Image for Classification with 2-Dimensional Convolutional Neural Networks

Updated 6 months ago

taulu • Rank 8.4 • Science 44%

Taulu is a Python package designed to segment tabular data in scanned or photographed documents.

Updated 5 months ago

ktrain • Rank 18.0 • Science 33%

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Updated 5 months ago

https://github.com/atrcheema/ai4water • Rank 9.5 • Science 36%

framework for developing machine (and deep) learning models for structured data

Updated 5 months ago

https://github.com/root-11/tablite • Rank 11.7 • Science 13%

multiprocessing enabled out-of-memory data analysis library for tabular data.

Updated 5 months ago

https://github.com/astrazeneca/subtab • Rank 5.0 • Science 10%

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Updated 5 months ago

https://github.com/ajayarunachalam/msda • Rank 4.9 • Science 10%

Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector

Updated 5 months ago

https://github.com/desbordante/desbordante-core • Science 49%

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

Updated 5 months ago

https://github.com/calgo-lab/tab_err • Science 26%

Fully-controlled realistic error generation for tabular data.

Updated 6 months ago

talent • Science 67%

A comprehensive toolkit and benchmark for tabular data learning, featuring 30+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.

Updated 6 months ago

dpl • Science 44%

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

Updated 6 months ago

folktexts • Science 52%

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!

Updated 6 months ago

ts3l • Science 67%

A PyTorch Lightning-based library for self- and semi-supervised learning on tabular data.

Updated 6 months ago

nemo • Science 44%

a Stellar Dynamics Toolbox (Not Everybody Must Observe)

Updated 5 months ago

https://github.com/cosbidev/naim • Science 23%

Official implementation for the paper ``Not Another Imputation Method: A Transformer-based Model for Missing Values in Tabular Datasets´´

Updated 5 months ago

https://github.com/computationalproteomics/omicloupe • Science 10%

Understanding expression across comparisons and datasets through interactive visualization

Updated 5 months ago

https://github.com/clearbox-ai/clearbox-synthetic-kit • Science 26%

Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.

Updated 5 months ago

https://github.com/sebhaan/tabpfgen • Science 26%

TabPFGen: Synthetic Tabular Data Generation with TabPFN