Updated 9 months ago

wradlib • Rank 19.3 • Science 77%

weather radar data processing - python package

Updated 9 months ago

com.wgzhao.addax:addax-all • Rank 15.4 • Science 44%

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

Updated 9 months ago

https://github.com/dadananjesha/spark-streaming • Science 13%

Spark Streaming KPI Processing is a real-time data processing application built using Apache Spark Streaming

Updated 9 months ago

https://github.com/dadananjesha/redshift-etl-project • Science 13%

The project covers the complete data pipeline—from importing data from an RDS source to HDFS using Sqoop, processing data with Spark, to executing analytical queries on an AWS Redshift cluster.

Updated 9 months ago

https://github.com/rumbledb/rumble • Science 36%

⛈️ RumbleDB 2.0.0 "Lemon Ironwood" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more

Updated 9 months ago

https://github.com/a-imantha/mahout-tutorial • Science 26%

Building a Recommender with Apache Mahout on Amazon Elastic MapReduce (EMR) Tutorial