real-state-prices-uk
Falla-Gutierrez, H. (2024). Dashoard: housing prices in England and Wales 2023 (Version 1.0) [Computer software]
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.1%) to scientific vocabulary
Repository
Falla-Gutierrez, H. (2024). Dashoard: housing prices in England and Wales 2023 (Version 1.0) [Computer software]
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md

1. Introduction
This repository uses data provided by the HM Land Registry to explore the distribution of real estate prices in two countries of the United Kingdom. I use the statistical dataset "Price Paid Data" which contains information on the amounts paid for properties in England and Wales. You can access the original file here: http://prod.publicdata.landregistry.gov.uk.s3-website-eu-west-1.amazonaws.com/pp-2023.csv
I used codespaces in all the stages of this project.

2. Methodology
2.1 Pipelines
I ran Mage on a docker container for the orchestration. I create a loader for retrieving the data directly from the HM Land Registry and submit it to Google Cloud Service (my data lake). The language employed for such a task was R.
https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/dataloaders/pricepaiddata_2023.r
Then, I created a transformer for counting missed values and eliminating them. The language employed was R.
https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/transformers/transformppd2023.r
Afterwards, I create a data exporter for sending the transformed data set to BigQuery (my cloud warehouse).
https://github.com/gutfalla/Real-state-prices-UK/blob/main/realstatepricesuk/dataexporters/exportppd2023bigquery.sql
Finally, I created a trigger for running all the pipelines on the 20th of each month at 23:59. The reason is that HM Land Registry updates the information at this date and time. I tried to guarantee the freshness of the data by doing that.

2.2 Transformation in warehouse
I employed DBT for the development, testing and documentation. I connected DBT to the Big Query database containing all the data generated in the previous pipelines. I developed a schema adding multiple tests to variables as a data engineering good practice. The final schema was: https://github.com/gutfalla/Real-state-prices-UK/blob/main/models/staging/schema.yml .
The query I ran in Big Query was: https://github.com/gutfalla/Real-state-prices-UK/blob/main/models/staging/stg_2023.sql .
I added a second trigger so the freshness of the information in the dashboard is guaranteed. The trigger runs all the transformations and SQL queries in the data warehouse on the 21st of each month at 01:00

2.3 Dashboard
I employed Google Lookerstudio for the generation of dashboards for the analysis.
I included on page 1 a heat map built with the data of prices. The map does support zooming and has a controller for filtering by price. It contains other filters, such as county, city, etc.
I included On page 2 pie charts for the condition of the property and the kind of property, as well as a time series with the number of sales per day.
You can access the dashboards by following the link below:
https://lookerstudio.google.com/reporting/4654857a-295e-4b0a-9720-47906b0afecd
Contact
I am established and looking for opportunities in London. If you want to contact me or have more information about my profile, please reach me through LinkedIn.
https://www.linkedin.com/in/hd-falla-gutierrez
Have a nice week!
Owner
- Login: gutfalla
- Kind: user
- Repositories: 1
- Profile: https://github.com/gutfalla
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Falla-Gutierrez
given-names: Hernan
orcid: https://orcid.org/0000-0001-5506-2987
title: "Dashoard: housing prices in England and Wales 2023"
version: 1.0
date-released: 2024-04-08
GitHub Events
Total
Last Year
Dependencies
- mageai/mageai latest build
- dpage/pgadmin4 latest
- mageai/mageai latest
- postgres 14