Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.0%) to scientific vocabulary
Repository
Building models for UFP and/or BC in EXPANSE.
Basic Info
- Host: GitHub
- Owner: co822ee
- License: gpl-3.0
- Language: R
- Default Branch: main
- Size: 72.3 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
airview_ufp
Version 0.1.0
Building models for UFP in EXPANSE.
Project organization
``` . ├── .gitignore ├── CITATION.md ├── LICENSE.md ├── README.md ├── config <- Configuration files (HW) ├── data <- All project data, ignored by git │ ├── processed <- The final, canonical data sets for modeling. (PG) │ ├── raw <- The original, immutable data dump. (RO) │ └── temp <- Intermediate data that has been transformed. (PG) ├── results │ ├── figures <- Figures for the manuscript or reports (PG) │ └── output <- Other output for the manuscript or reports (PG) └── src <- Source code for this project (HW)
```
src
Pipeline for creating other city-wide data (v2)
Based upon the previous data processing step for Basel, this pipeline was built for processing raw mobile data points (to convert data points to 60m road segments by snapping the data points to the nearest road segment). Detailed steps can be found in the description in the previous section.
process_pipeline.R: to process the raw ts file to mean of mean
raw ts snapped to closest road segments:
in_dir <- "data/processed/"
fnames <- list.files(in_dir, '_rawPtsLines_v3.gpkg')
src/00_process/01_process_predictors.R: to process the predictor variables' values (extracted and downloaded from GEE) for all cities
src/00_process/01_eda_stat_meanOfmean.R: to combinne the predictor vairables and means of means for all cities
- output:
data/temp/stat_new/meanOfmean_predictors_cities.Rdata-> This output is being used for developing LUR models, further processed by a function scriptsrc/01_LUR/fun_read_mobileData.R.
src/00_process/02_process_stat.R: to generate some statistics of the mobile data (using the mean of means)
- output:
data/temp/stat_new/campaign_stat_cities.Rdata - output:
data/temp/stat_new/campaign_stat_cities2.Rdata - output:
data/temp/stat_new/campaignRoadInfo_cities.Rdata
src/00_process/03_process_stat.R: to process the mean of means and predictor variables for all cities
src/deconvolution/00_test_deconv.R: to generate the deconvolution data for the mobile data (from the 1-s data)
- output:
data/processed/deconv[city].Rdata1 - output:
results/output/deconv_cor_[city].csv
Mobile data
filename | description | R script creating the file --- | --- | --- data/processed/finalmobileData/[city]stat2v3.rds | line segments with mean of mean | processpipeline.R, fun04prodStat.R data/temp/statnew/meanOfmeanpredictorscities.Rdata | datafram with mean of mean and predictor variables | 01edastatmeanOfmean.R newStatpathfname <- "data/temp/statnew/campaignstatcities.Rdata" | dataframe with mean (on driving days and road segment UIDs) with all cities included and information of campaigns available | 01processstat.R campaignRoadInfopathfname <- "data/temp/statnew/campaignRoadInfocities.Rdata" | dataframe with information on whether a road segment in a city has mobile data measured in both campaigns or either of the campaign | 01process_stat.R
Predictor variables
```
predictorpathfname <- paste0('data/temp/roadcentroidPts/ptsPredictors', city, '.Rdata')
# ==> output predictor variables (cleaned)
spatialPtspathfname <- paste0('data/temp/roadcentroidPts/ptsUID', city, '.Rdata')
# ==> output centroid points (sf dataframe) with UID and city available
spatialLinespathfname <- paste0('data/temp/roadcentroidPts/linesUID_', city, '.Rdata')
# ==> output line segment (sf dataframe) with UID and city available
```
01_process_stat.R
data/temp/stat_new/campaign_stat_cities.Rdata: contains the averages on driving days and road segments from all cities.
01_eda_stat_meanOfmean.R
data/temp/stat_new/meanOfmean_predictors_cities.Rdata: contains all the mean of mean and predictor variables
Visualization
vis_mobileData.R: Provides visualizations specific to mobile data analysis, including trends and statistical summaries across different cities.
vis_paper_3-3_modelPerformance_externalValidation.R: Generates visualizations for model performance and external validation, focusing on geographical and statistical data representations across various international datasets.
External evaluation
vis_external_validation_new.r: Visualizes external validation results, including scatter plots and error matrices for different models and locations.
vis_paper_3-3_modelPerformance_externalValidation.R: This script is responsible for generating comprehensive visualizations that assess the measurements from the external evaluation data. It focuses on presenting both geographical and statistical data representations, which are crucial for understanding model behavior across different international datasets (e.g. Switzerland, Rome, Amsterdam, Augsburg, Claire). The code for vusalizing the results was outdated (which were commented out). The updated code for creating visual for external validation can be found in the visexternalvalidation_new.r.
src/01_LUR
03_LUR_5fold_v1.r: 5-fold CV using different subset of training and validation (city-specific, pooled, pooled deconv)
01_process_predictors.R: Processes predictor data files from GEE for various cities.
03_LUR_5fold_v1.r: Executes a 5-fold cross-validation strategy to evaluate LUR models, accommodating different subsets of training and validation data, including city-specific, pooled, and pooled deconvolution scenarios.
03_LUR_city.R: Analyzes and models local urban regression (LUR) specifically tailored to individual city datasets.
03_LUR_global_LOAOCV_v1.R: Implements a global Leave-One-Area-Out Cross-Validation (LOAOCV) strategy for LUR models to assess model robustness across different geographic areas.
04_LUR_global-model_performance.R: Evaluates the performance of global LUR models, providing insights into their effectiveness and accuracy across multiple regions.
src/02_vis
eda_ts_decomposition.R: process and explore making the model temporally detrended.
License
This project is licensed under the terms of the GNU License
Citation
Owner
- Name: Youchen Shen
- Login: co822ee
- Kind: user
- Location: Utrecht, the Netherlands
- Company: Utrecht University
- Repositories: 1
- Profile: https://github.com/co822ee
Citation (CITATION.md)
Please cite this project as follows: Youchen Shen; Jules Kerckhoffs; Massimo Stafoggia; Federica Nobile; Giorgio Cattani; Annette Peters; Kathrin Wolf; Josef Cyrys; Simonas Kecorius ; Cathryn Tonne; Magdalena Janc; Kinga Polanska; Marloes Eeftens; Medea Imboden; Benjamin Flückiger; Konstantina Dimakopoulou; Klea Katsouyanni; Evangelia Samoli; Kees de Hoogh; Gerard Hoek; Roel Vermeulen, airview_ufp - version 0.1.0 https://github.com/co822ee/airview_ufp. Shen, Y., Kerckhoffs, J., Stafoggia, M., Nobile, F., Cattani, G., Peters, A., Wolf, K., Cyrys, J., Kecorius, S., Tonne, C., Janc, M., Polanska, K., Eeftens, M., Imboden, M., Flückiger, B., Dimakopoulou, K., Katsouyanni, K., Samoli, E., Hoogh, K. de, Hoek, G., Vermeulen, R., 2025. Europe-wide spatial models for ultrafine particles in particle number concentrations based on mobile monitoring (in preparation).
GitHub Events
Total
- Release event: 1
- Push event: 5
- Create event: 3
Last Year
- Release event: 1
- Push event: 5
- Create event: 3