Scientific Software
Updated 6 months ago

cuallee — Peer-reviewed • Rank 18.9 • Science 98%

cuallee: A Python package for data quality checks across multiple DataFrame APIs - Published in JOSS (2024)

Scientific Software
Updated 6 months ago

UTDEventData — Peer-reviewed • Rank 4.7 • Science 95%

UTDEventData: An R package to access political event data - Published in JOSS (2019)

Scientific Software · Peer-reviewed
Updated 6 months ago

io.jhdf:jhdf • Rank 21.2 • Science 77%

A pure Java HDF5 library

Updated 6 months ago

legend-pydataobj • Rank 15.3 • Science 77%

LEGEND Python Data Objects

Updated 6 months ago

arvados • Rank 23.1 • Science 54%

An open source platform for managing and analyzing biomedical big data

Updated 6 months ago

reductstore • Rank 17.6 • Science 52%

High Performance Storage and Streaming Solution for Data Acquisition Systems

Updated 6 months ago

ustore • Rank 15.3 • Science 54%

Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

Updated 5 months ago

https://github.com/vaexio/vaex • Rank 24.4 • Science 36%

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Updated 6 months ago

grapetree • Rank 12.0 • Science 23%

GrapeTree is a fully interactive, tree visualization program, which supports facile manipulations of both tree layout and metadata. Click the first link to launch: https://achtman-lab.github.io/GrapeTree/MSTree_holder.html

Updated 5 months ago

https://github.com/ganweisoft/gateway • Rank 7.5 • Science 26%

Gateway is a high-performance, centralized communication and scheduling module for various device plugins. It uniformly converts heterogeneous data into standardized models and delivers core functionalities such as real-time data storage, alarm triggering, linkage control, and task planning.

Updated 5 months ago

https://github.com/citiususc/polypus • Rank 0.0 • Science 13%

Polypus: a Big Data Self-Deployable Architecture for Microblogging Text Extraction and Real-Time Sentiment Analysis

Updated 6 months ago

biglasso • Science 49%

biglasso: Extending Lasso Model Fitting to Big Data in R

Updated 5 months ago

https://github.com/bigbio/quantms-utils • Science 39%

A python library with scripts and helpers classes for quantms workflow

Updated 6 months ago

shantay • Science 44%

Trying to make sense of the EU's DSA Transparency DB

Updated 5 months ago

https://github.com/cloudslab/dspark • Science 10%

Source code for the work "dSpark: Deadline-Based Resource Allocation for Big Data Applications in Apache Spark" published in IEEE e-Science 2017

Updated 6 months ago

resilienceatlas • Science 26%

Resilience Atlas - Evidence-based decision-making around resilience