usa_births_deaths_stats
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: neiladhikari
- Language: Jupyter Notebook
- Default Branch: main
- Size: 626 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
USA Birth and Death statistics.
This data is complete as of: 2022-07-29. The objective is to use pandas for web scraping, to get national vital statistics data from the CDC website, specifically births and deaths by US states and territories. After obtaining the required data, we will use pandas for getting some basic stats, and matplotlib for data visualization. Another objective is to merge this dataset with the total population dataset, trying out left join, right join, join on, and other techniques. More analyses will be conducted after this data join/ merge. Currently available data summary is for 2020 only, and that is what this url (used in the jupyter notebook) will pull. This is the most complete data, not most current. The same url will probably report the most complete numbers in the future as well. If you want, the most current vital statistics data can be requested from each state. The process for each state varies, and is a different topic for discussion for some other time. In this case, the CDC data is tabular, so we will try to use pandas to directly read in the table, instead of going through beautifulsoup.
Formulas used in this analysis:
1) Fertility Rate = Number of births per 1000 women between 15-44 years of age. 2) Crude Birth Rate = Number of births per 1000 individuals of a population. i.e. (Total Births * 1000)/ population. 3) Death Rate = Number of deaths per 100,000 individuals. i.e. (Total Deaths * 100000)/ population. 4) Percent Births = (Total Births * 100) / population. 5) Percent Deaths = (Total Deaths * 100) / population.
Useful links: 1) https://wonder.cdc.gov/controller/datarequest/D76;jsessionid=9CC7C52E44642DAFCE5BCA2CBAC9 2) https://www.cdc.gov/publichealthgateway/healthdirectories/healthdepartments.html
Citation: Centers for Disease Control and Prevention, National Center for Health Statistics. National Vital Statistics System, Mortality 1999-2020 on CDC WONDER Online Database, released in 2021. Data are from the Multiple Cause of Death Files, 1999-2020, as compiled from data provided by the 57 vital statistics jurisdictions through the Vital Statistics Cooperative Program. Accessed at http://wonder.cdc.gov/ucd-icd10.html on Jul 29, 2022 8:19:19 PM
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this code, please cite it as below." authors: - family-names: "Adhikari" given-names: "Neil" orcid: "https://orcid.org/0000-0002-2935-1870" title: "USA Birth and Death Statistics" version: 1.0 date-released: 2022-08-04 url: "https://github.com/neiladhikari/usa_births_deaths_stats"