Updated 6 months ago

cytodataframe • Rank 4.3 • Science 77%

An in-memory data analysis format for single-cell profiles alongside their corresponding images and segmentation masks.

Updated 6 months ago

eland • Rank 20.0 • Science 26%

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

Updated 6 months ago

https://github.com/juliagraphs/graphdataframebridge.jl • Rank 7.5 • Science 26%

Tools for interoperability between DataFrame objects and LightGraphs and MetaGraphs objects

Updated 6 months ago

https://github.com/arbaznazir/datalineagepy • Rank 5.0 • Science 26%

86% faster data lineage tracking for pandas DataFrames with zero infrastructure. Real-time monitoring, ML anomaly detection, and enterprise compliance features.

Updated 6 months ago

https://github.com/rumbledb/rumble • Science 36%

⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Updated 6 months ago

introduction_to_r • Science 18%

This workshop provides a basic understanding of R