Updated 6 months ago

pysparklyr • Science 26%

Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect

Updated 5 months ago

https://github.com/dadananjesha/azuredataengine • Science 13%

AzureDataEngine is a robust, scalable batch processing data architecture built on the Azure platform. It efficiently extracts, transforms, and loads massive datasets for machine learning applications, leveraging Azure Blob Storage, PostgreSQL, Databricks, and Key Vault to ensure reliability and maintainability.

Updated 5 months ago

https://github.com/data-miner00/spark • Science 26%

A laboratory to carry out experiments with PySpark