https://github.com/carlos1971salud/dengai-predicting-disease-spread
https://github.com/carlos1971salud/dengai-predicting-disease-spread
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Carlos1971Salud
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 16.1 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
DengAI: Predicting Disease Spread
Overview
This project predicts local epidemics of dengue fever to help fight life-threatening pandemics. This project was created as an entry to the DengAI: Predicting Disease Spread competition hosted by Driven Data.
The project predicts the number of Dengue Fever cases reported each week in the following locations:
- San Juan (Puerto Rico)
- Iquitos (Peru)
Data
The predictor variables include environmental variables describing changes in temperature, precipitation, vegitation, and more.

Usage
To allow interactive visualisation of the total cases over time by city, we built a simple Streamlit app. Click here to discover patterns in the data!
Predictive Models
This is a time series project using Random Forest and Negative Binomial regression models to predict the total cases of Dengue fever over time in the different cities.
We used the Mean Squared Error (MSE) metric for evaluating the model.
Repository Structure
data/- Contains the training and test datasets.notebooks/- Jupyter notebooks with exploratory data analysis and model development.submissions/- Prediction files ready for submission to the competition.images/- Visualizations generated during analysis, including the Scatterplot of total cases by city.
Development
Clone the repository, install the required packages listed in requirements.txt, and run the Jupyter notebooks to replicate the analysis and predictions.
Contributions
Contributions are welcome. Please open an issue or pull request if you would like to contribute to the project.
License
This project is open-source and available under the MIT license.
Owner
- Login: Carlos1971Salud
- Kind: user
- Repositories: 1
- Profile: https://github.com/Carlos1971Salud
GitHub Events
Total
Last Year
Dependencies
- pandas ==2.2.1
- plotly ==5.19.0
- streamlit ==1.32.0