traffic, a toolbox for processing and analysing air traffic data
traffic, a toolbox for processing and analysing air traffic data - Published in JOSS (2019)
heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
arx
ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.
pfb-toolkit
A python package to build portable Energy Management and Information Systems (EMIS) application in buildings.
https://github.com/lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
https://github.com/girder/girder
A data management platform for the web, developed by Kitware
https://github.com/countly/countly-sdk-web
Countly Product Analytics SDK for websites and web applications
https://github.com/llnl/callflow
Visualization tool for analyzing call trees and graphs
canvasXpress
CanvasXpress: A JavaScript Library for Data Analytics with Full Audit Trail Capabilities.
contextual-anomaly-detector
Contextual anomaly detection tool application in building energy field based on Matrix Profile algorithm
https://github.com/data-prompt-query/dpq
dpq is an open-source python library that makes prompt-based data transformations and feature engineering easy
https://github.com/ambiqai/physiokit
A Python toolkit to process raw ambulatory bio-signals.
https://github.com/alan-turing-institute/semaida
Semantic Technologies for the AIDA project
https://github.com/dsnchz/solid-uplot
SolidJS wrapper for uPlot — an ultra-fast, tiny time-series & charting library with a SolidJS enhanced plugin system
https://github.com/alpha-unito/pico
A C++ framework for data analytics pipelines
https://github.com/desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
https://github.com/fasttrees/fasttrees
A fast and frugal tree classifier for sklearn
https://github.com/fgazzelloni/actex_october2024
ACTEX Learning: Introduction to R and Basic Data Analysis
predictive-analytics
A self-driven exploratory report on predictive data analytics employing a range of machine learning techniques