Fruitbat
Fruitbat: A Python Package for Estimating Redshifts of Fast Radio Bursts - Published in JOSS (2019)
https://github.com/airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
https://github.com/growthbook/growthbook
Open Source Feature Flagging and A/B Testing Platform
https://github.com/elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
https://github.com/awslabs/aws-orbit-workbench
A Data Platform built for AWS, powered by Kubernetes.
dark_matter_flow_dataset
Dark matter flow dataset from cosmological N-body simulation
https://github.com/dadananjesha/redshift-etl-project
The project covers the complete data pipeline—from importing data from an RDS source to HDFS using Sqoop, processing data with Spark, to executing analytical queries on an AWS Redshift cluster.