real-state-prices-uk

Falla-Gutierrez, H. (2024). Dashoard: housing prices in England and Wales 2023 (Version 1.0) [Computer software]

https://github.com/gutfalla/real-state-prices-uk

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.1%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Falla-Gutierrez, H. (2024). Dashoard: housing prices in England and Wales 2023 (Version 1.0) [Computer software]

Basic Info
  • Host: GitHub
  • Owner: gutfalla
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 26.4 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Citation

README.md

Alt Text

1. Introduction

This repository uses data provided by the HM Land Registry to explore the distribution of real estate prices in two countries of the United Kingdom. I use the statistical dataset "Price Paid Data" which contains information on the amounts paid for properties in England and Wales. You can access the original file here: http://prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.amazonaws.com/pp-2023.csv

I used codespaces in all the stages of this project.

Alt Text

2. Methodology

2.1 Pipelines

I ran Mage on a docker container for the orchestration. I create a loader for retrieving the data directly from the HM Land Registry and submit it to Google Cloud Service (my data lake). The language employed for such a task was R.

https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/dataloaders/pricepaiddata_2023.r

Then, I created a transformer for counting missed values and eliminating them. The language employed was R.

https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/transformers/transformppd2023.r

Afterwards, I create a data exporter for sending the transformed data set to BigQuery (my cloud warehouse).

https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/dataexporters/exportppd2023bigquery.sql

Finally, I created a trigger for running all the pipelines on the 20th of each month at 23:59. The reason is that HM Land Registry updates the information at this date and time. I tried to guarantee the freshness of the data by doing that.

Alt Text

2.2 Transformation in warehouse

I employed DBT for the development, testing and documentation. I connected DBT to the Big Query database containing all the data generated in the previous pipelines. I developed a schema adding multiple tests to variables as a data engineering good practice. The final schema was: https://github.com/gutfalla/Real-state-prices-UK/blob/main/models/staging/schema.yml .

The query I ran in Big Query was: https://github.com/gutfalla/Real-state-prices-UK/blob/main/models/staging/stg_2023.sql .

I added a second trigger so the freshness of the information in the dashboard is guaranteed. The trigger runs all the transformations and SQL queries in the data warehouse on the 21st of each month at 01:00

Alt Text

2.3 Dashboard

I employed Google Lookerstudio for the generation of dashboards for the analysis.

I included on page 1 a heat map built with the data of prices. The map does support zooming and has a controller for filtering by price. It contains other filters, such as county, city, etc.

I included On page 2 pie charts for the condition of the property and the kind of property, as well as a time series with the number of sales per day.

You can access the dashboards by following the link below:

https://lookerstudio.google.com/reporting/4654857a-295e-4b0a-9720-47906b0afecd

Contact

I am established and looking for opportunities in London. If you want to contact me or have more information about my profile, please reach me through LinkedIn.

https://www.linkedin.com/in/hd-falla-gutierrez

Have a nice week!

Owner

  • Login: gutfalla
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Falla-Gutierrez
    given-names: Hernan
    orcid: https://orcid.org/0000-0001-5506-2987
title: "Dashoard: housing prices in England and Wales 2023"
version: 1.0
date-released: 2024-04-08

GitHub Events

Total
Last Year

Dependencies

Dockerfile docker
  • mageai/mageai latest build
docker-compose.yml docker
  • dpage/pgadmin4 latest
  • mageai/mageai latest
  • postgres 14
real_state_prices_uk/requirements.txt pypi