Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: eleoracca
  • License: agpl-3.0
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 62.5 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 11 months ago · Last pushed 8 months ago
Metadata Files
Readme License Citation

README.md

IAQ-AirCareUniTo

Repository for the analysis of data downloaded from the AirCare sensors placed at UniTo.

Setup

To start using this framework, Python and R are required. Please install their dependencies before starting to work with the repository. Then, to correctly set up your working environment, clone the repository and run the following commands in the main folder: NOTE: the scripts were tested in a Linux environment, therefore on other operative systems they might not work.

console git clone https://github.com/eleoracca/IAQ-AirCareUniTo.git cd IAQ-AirCareUniTo bash setup.sh

This will create all the necessary directories and set up the Python and R virtual environments with the required libraries to run the analysis and visualization scripts.

Downloading the data

To download the data, Firefox must be installed. On Linux, the Snap version of Firefox will not work. To start the download, run the following commands from the main folder of the Git repository on your PC:

console source env/bin/activate python scraping/scraper.py deactivate

The script will prompt you with the necessary actions and comments during the execution. Please keep in mind that every time you start the scraper, you need to change the default download folder to the raw-data folder of this repository; otherwise, the data will be saved elsewhere. Also, be sure not to minimize the browser window during the download, as this may cause the download to fail. A compatibility issue was detected and solved by updating one of the Firefox drivers (geckodriver).

After the download is complete, to reorganize the files, run the following command from the main folder of the Git repository on your PC:

console bash scraping/moving.sh

This script will delete any duplicate files and group the files from the same sensors by name into subfolders created by the script itself.

Merging data files

After the download, to improve the usability of the data, it is possible to use the data_extraction.R script in the analysis subfolder. You can execute it through RStudio or from the terminal using the following commands:

console cd analysis Rscript data_extraction.R

Make sure to execute the script from the analysis subfolder. At the end of the script execution, in the directory IAQ-AirCareUniTo/data, you will find one CSV file for each sensor containing all the data downloaded from the site, extracted from the XLS files in each sensor's directory.

Owner

  • Name: Eleonora Racca
  • Login: eleoracca
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
title: IAQ-AirCareUniTo
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: Eleonora
    family-names: Racca
    email: e.racca@unito.it
    affiliation: Università degli Studi di Torino
    orcid: 'https://orcid.org/0000-0002-9097-5666'
  - given-names: Emanuele
    family-names: Almerighi
    email: emanuele.almerighi@edu.unito.it
    affiliation: Università degli Studi di Torino
  - given-names: Alberto
    family-names: Mecca
    email: alberto.mecca@cern.ch
    affiliation: Università degli Studi di Torino
    orcid: 'https://orcid.org/0000-0003-2209-2527'
repository-code: 'https://github.com/eleoracca/IAQ-AirCareUniTo'
abstract: >-
  Repository to download and analyse the indoor environmental quality data of
  indoor air quality sensors installed in the University of Turin.
url: "https://github.com/eleoracca/IAQ-AirCareUniTo"

GitHub Events

Total
  • Delete event: 1
  • Push event: 13
  • Create event: 3
Last Year
  • Delete event: 1
  • Push event: 13
  • Create event: 3

Dependencies

requirements.txt pypi
  • beautifulsoup4 ==4.13.3
  • python_dateutil ==2.9.0.post0
  • selenium ==4.30.0