https://github.com/aavilacar/santander-bikes

Analysing the data at the Santander Cycle Hire Scheme dataset (https://console.cloud.google.com/marketplace/product/greater-london-authority/london-bicycles)

https://github.com/aavilacar/santander-bikes

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.6%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Analysing the data at the Santander Cycle Hire Scheme dataset (https://console.cloud.google.com/marketplace/product/greater-london-authority/london-bicycles)

Basic Info
  • Host: GitHub
  • Owner: aavilacar
  • License: unlicense
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 8.96 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed over 2 years ago
Metadata Files
Readme License

README.md

santander-bikes

This repository includes the ongoing project of analysing the BigQuery Santander Cycle Hire Scheme dataset to gather some insights and provide useful recommendations to the Greater London Authority. The breakdown provided below depicts the tasks completed so far and the associated files used in attempting those tasks: 1. Incorporating borough and ward information to each station: Using the Statistical GIS Boundary Files for London dataset, containing geographical boundaries for Greater London, the borough and ward of each station in the 'londonbicycles.cyclestations' table was determined. The Jupyter notebook at santander_bikes.ipynb outlines the steps taken to achieve the task. 2. Queries on the main dataset: The main 'londonbicycles.cyclehire' table, encompassing more than 80 million hires between 2015 and 2023, was then explored with BigQuery, aiming to gain a better understanding of the data. All the queries of the project, from cleaning to data aggreation and manipulation, can be found in the queries.sql file. 3. Tableau visualisations: The succeeding visualisations were then constructed, illustrating the bike movement among boroughs, the station locations, and the average daily bike change per year at six stations. 4. Relocation recommendations: The relocation_plan.ipynb Jupyter notebook includes the construction of an algorithm to plan the daily manual relocation of the bikes in 2023. This is important to ensure the smooth operation of the service, as most stations have an imbalanced daily influx-outflux of bikes. Following the mean daily influxes from 2022 and assuming we can expect similar patterns for 2023, eventually some stations will fill up while others will empty. This will disrupt some trips due to a lack of bikes at pick-up or free docks at drop-off, causing user dissatisfaction and a potential loss of users. Therefore, manually relocating the bikes is essential.




Owner

  • Name: Alejandro Avila Carrión
  • Login: aavilacar
  • Kind: user

Analyst at SDG Group Spain

GitHub Events

Total
Last Year