Scientific Software
Updated 6 months ago

visdat — Peer-reviewed • Rank 21.1 • Science 95%

visdat: Visualising Whole Data Frames - Published in JOSS (2017)

Scientific Software
Updated 6 months ago

FSharpGephiStreamer — Peer-reviewed • Rank 13.3 • Science 98%

FSharpGephiStreamer: An idiomatic bridge between F# and network visualization - Published in JOSS (2019)

Scientific Software
Updated 6 months ago

Arabica — Peer-reviewed • Rank 12.3 • Science 93%

Arabica: A Python package for exploratory analysis of text data - Published in JOSS (2024)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

SmartEDA — Peer-reviewed • Rank 11.7 • Science 93%

SmartEDA: An R Package for Automated Exploratory Data Analysis - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Scientific Software
Updated 6 months ago

Powering single-cell analyses in the browser with WebAssembly — Peer-reviewed • Rank 6.6 • Science 98%

Powering single-cell analyses in the browser with WebAssembly - Published in JOSS (2023)

Scientific Software
Updated 6 months ago

Plotrr — Peer-reviewed • Rank 2.5 • Science 95%

Plotrr: Functions for making visual exploratory data analysis with nested data easier. - Published in JOSS (2017)

Engineering (40%)
Scientific Software · Peer-reviewed
Updated 6 months ago

GenomicSuperSignature • Rank 14.0 • Science 59%

Interpretation of RNAseq experiments through robust, efficient comparison to public databases

Updated 6 months ago

skimpy • Rank 17.0 • Science 44%

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

Scientific Software
Updated 6 months ago

edarf — Peer-reviewed • Rank 5.6 • Science 49%

edarf: Exploratory Data Analysis using Random Forests - Published in JOSS (2016)

Scientific Software · Peer-reviewed
Updated 5 months ago

lux • Rank 20.0 • Science 33%

Automatically visualize your pandas dataframe via a single print! 📊 💡

Updated 5 months ago

vtree • Rank 12.8 • Science 26%

An R package for calculating and drawing variable trees

Updated 6 months ago

geothermal_esda • Rank 0.7 • Science 28%

This repository contains exploratory spatial data analysis (ESDA) functions and scripts. These functions are designed for geothermal spatial datasets, and are applicable to other spatial datasets.

Updated 6 months ago

mltools • Rank 16.2 • Science 10%

Exploratory and diagnostic machine learning tools for R

Updated 6 months ago

mmpf • Rank 13.3 • Science 10%

Monte-Carlo methods for prediction functions

Updated 5 months ago

https://github.com/desbordante/desbordante-core • Science 49%

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

Updated 5 months ago

https://github.com/dadananjesha/eda-case-study • Science 13%

EDA Case Study is an exploratory data analysis project designed to uncover insights from a dataset through thorough visualization and statistical analysis.

Updated 6 months ago

tutorials-early • Science 44%

Tutorials to learn reading, cleaning and validating case data, and converting line list data to incidence for visualizing epidemic curves.

Updated 5 months ago

expandar • Science 13%

R Package for Interactive Panel Data Exploration

Updated 5 months ago

https://github.com/nagapv/edexplore • Science 13%

A simple widget for interactive EDA / QA. Works on top of Pandas [in Jupyter Notebook] using IPyWidgets with a sprinkle of Regex.

Updated 5 months ago

bicausality • Science 23%

A framework to infer causality on binary data using techniques in frequent pattern mining and estimation statistics. Given a set of individual vectors S={x} where x(i) is a realization value of binary variable i, the framework infers empirical causal relations of binary variables i,j from S in a form of causal graph G=(V,E).

Updated 6 months ago

movis • Science 49%

MOVIS: A Multi-Omics Software Solution for Multi-modal Time-Series Clustering, Embedding, and Visualizing Tasks, by Aleksandar Anžel, Dominik Heider, and Georges Hattab

Updated 6 months ago

electric_vehicles • Science 44%

Projeto de graduação em ciência de dados da Mackenzie