Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (7.6%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: Carlos1971Salud
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 25.7 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 7 months ago · Last pushed 7 months ago
Metadata Files
Readme Citation

README.md

A reproducible ensemble machine learning approach to forecast dengue outbreaks

model

Dengue fever, a prevalent and rapidly spreading arboviral disease, poses substantial public health and economic challenges in tropical and sub-tropical regions worldwide. Predicting infectious disease outbreaks on a countrywide scale is complex due to spatiotemporal variations in dengue incidence across administrative areas. To address this, we propose a machine learning (ML) ensemble model for forecasting the dengue incidence rate (DIR) in Brazil, with a focus on the population under 19 years old. The model integrates spatial and temporal information, providing one-month-ahead DIR estimates at the state level. Comparative analyses with a dummy model and ablation studies demonstrate the ensemble model's qualitative and quantitative efficacy across the 27 Brazilian Federal Units. Furthermore, we showcase the transferability of this approach to Peru, another Latin American country with differing epidemiological characteristics. This timely forecast system can aid local governments in implementing targeted control measures. The study advances climate services for health by identifying factors triggering dengue outbreaks in Brazil and Peru, emphasizing collaborative efforts with intergovernmental organizations and public health institutions. The innovation lies not only in the algorithms themselves but in their application to a domain marked by data scarcity and operational scalability challenges. We bridge the gap by integrating well-curated ground data with advanced analytical methods, addressing a significant deficiency in current practices. The successful transfer of the model to Peru and its consistent performance during the 2019 outbreak in Brazil showcase its scalability and practical application. While acknowledging limitations in handling extreme values, especially in regions with low DIR, our approach excels where accurate predictions are critical. The study not only contributes to advancing DIR forecasting but also represents a paradigm shift in integrating advanced analytics into public health operational frameworks. This work, driven by a collaborative spirit involving intergovernmental organizations and public health institutions, sets a precedent for interdisciplinary collaboration in addressing global health challenges. It not only enhances our understanding of factors triggering dengue outbreaks but also serves as a template for the effective implementation of advanced analytical methods in public health.

Guide

  1. Installation guide
  2. Configuration
  3. Dataset
  4. Run our code

Cite our papers

  1. Schneider, R., Sebastianelli, A., Spiller, D., Wheeler, J., Carmo, R., Nowakowski, A., ... & Lowe, R. (2021). Climate-based ensemble machine learning model to forecast Dengue epidemics (papers track). In Thirty-eighth International Conference on Machine Learning (ICML) 2021.

    @inproceedings{schneider2021climate,
        title={Climate-based ensemble machine learning model to forecast Dengue epidemics (papers track)},
        author={Schneider, Rochelle and Sebastianelli, Alessandro and Spiller, Dario and Wheeler, James and Carmo, Raquel and Nowakowski, Artur and Garcia-Herranz, Manuel and Kim, Dohyung and Barlevi, Hanoch and Cordero, Zoraya El Raiss and others},
        booktitle={Thirty-eighth International Conference on Machine Learning (ICML) 2021},
        year={2021}
    }
    
  2. Sebastianelli, A., Spiller, D., Carmo, R., Wheeler, J., Nowakowski, A., ... & Schneider, R. (2024). A reproducible ensemble machine learning approach to forecast dengue outbreaks. In Nature Scientific Reports.

    @inproceedings{sebsatianelli2024reproducible,
        title={A reproducible ensemble machine learning approach to forecast dengue outbreaks},
        author={Alessandro Sebastianelli, Dario Spiller, Raquel Carmo, James Wheeler, Artur Nowakowski, Ludmilla Viana Jacobson, Dohyung Kim, Hanoch Barlevi, Zoraya El Raiss Cordero, Felipe J Colon-Gonzalez, Rachel Lowe, Silvia Liberata Ullo, and Rochelle Schneider}, 
        journal={Nature Scientific Reports},
        year={2024},
    }
    

Owner

  • Login: Carlos1971Salud
  • Kind: user

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: ESA-UNICEF Dengue Forecast Project
message: 'If you use this software, please cite it as below.'
type: software
authors:
  - given-names: Alessandro
    family-names: Sebastianelli
    email: alessandro.sebastianelli1995@gmail.com
    orcid: 'https://orcid.org/0000-0002-9252-907X'
  - given-names: Raquel
    orcid: 'https://orcid.org/0000-0003-0808-3480'
    family-names: Carmo
    email: raquelarscarmo@gmail.com
  - given-names: Dario
    family-names: Spiller
    email: dario.spiller@uniroma1.it
    orcid: 'https://orcid.org/0000-0002-6877-3187'
  - given-names: Rochelle
    family-names: Schneider
    orcid: 'https://orcid.org/0000-0002-2905-0154'

GitHub Events

Total
  • Create event: 1
Last Year
  • Create event: 1

Dependencies

code/requirements.txt pypi
  • catboost ==1.0.0
  • contextily ==1.2.0
  • geopandas ==0.10.2
  • jupyterlab ==3.2.0
  • matplotlib ==3.4.3
  • numpy ==1.19.5
  • pandas ==1.3.4
  • scikit-learn ==1.0
  • tensorflow ==2.6.0