Updated 6 months ago
cytodataframe
An in-memory data analysis format for single-cell profiles alongside their corresponding images and segmentation masks.
Updated 6 months ago
eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Updated 6 months ago
https://github.com/juliagraphs/graphdataframebridge.jl
Tools for interoperability between DataFrame objects and LightGraphs and MetaGraphs objects
Updated 6 months ago
https://github.com/arbaznazir/datalineagepy
86% faster data lineage tracking for pandas DataFrames with zero infrastructure. Real-time monitoring, ML anomaly detection, and enterprise compliance features.
Updated 6 months ago
https://github.com/rumbledb/rumble
⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more