Updated 5 months ago
https://github.com/cdcgov/cdh-lava-react
CDC Data Hub Lifecycle, Analysis & Visualization Accelerator (LAVA) REACT Components based on machine readable requirements.
Updated 6 months ago
pysparklyr
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
Updated 5 months ago
https://github.com/johnsnowlabs/johnsnowlabs
Gateway into the John Snow Labs Ecosystem
Updated 5 months ago
https://github.com/dadananjesha/azuredataengine
AzureDataEngine is a robust, scalable batch processing data architecture built on the Azure platform. It efficiently extracts, transforms, and loads massive datasets for machine learning applications, leveraging Azure Blob Storage, PostgreSQL, Databricks, and Key Vault to ensure reliability and maintainability.
Updated 5 months ago
https://github.com/data-miner00/spark
A laboratory to carry out experiments with PySpark