cuallee
cuallee: A Python package for data quality checks across multiple DataFrame APIs - Published in JOSS (2024)
UTDEventData
UTDEventData: An R package to access political event data - Published in JOSS (2019)
https://github.com/brandmaier/semtree
Recursive Partitioning for Structural Equation Models
reductstore
High Performance Storage and Streaming Solution for Data Acquisition Systems
ustore
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️
https://github.com/vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
grapetree
GrapeTree is a fully interactive, tree visualization program, which supports facile manipulations of both tree layout and metadata. Click the first link to launch: https://achtman-lab.github.io/GrapeTree/MSTree_holder.html
https://github.com/ganweisoft/gateway
Gateway is a high-performance, centralized communication and scheduling module for various device plugins. It uniformly converts heterogeneous data into standardized models and delivers core functionalities such as real-time data storage, alarm triggering, linkage control, and task planning.
https://github.com/citiususc/polypus
Polypus: a Big Data Self-Deployable Architecture for Microblogging Text Extraction and Real-Time Sentiment Analysis
https://github.com/bigbio/quantms-utils
A python library with scripts and helpers classes for quantms workflow
https://github.com/cloudslab/dspark
Source code for the work "dSpark: Deadline-Based Resource Allocation for Big Data Applications in Apache Spark" published in IEEE e-Science 2017
resilienceatlas
Resilience Atlas - Evidence-based decision-making around resilience