https://github.com/cdcgov/prime-public-health-data-infrastructure
Repository for the joint CDC+USDS PRIME project's Data Ingestion prototype project. The goal is to work with raw vaccine, case, and lab report data to arrive at an analysis of breakthrough cases in the state of Virginia that uses a data lake as underlying storage.
https://github.com/cdcgov/prime-public-health-data-infrastructure
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.6%) to scientific vocabulary
Keywords
Repository
Repository for the joint CDC+USDS PRIME project's Data Ingestion prototype project. The goal is to work with raw vaccine, case, and lab report data to arrive at an analysis of breakthrough cases in the state of Virginia that uses a data lake as underlying storage.
Basic Info
Statistics
- Stars: 8
- Watchers: 6
- Forks: 5
- Open Issues: 2
- Releases: 0
Topics
Metadata Files
docs/README.md
PRIME Public Health Data Infrastructure
General disclaimer This repository was created for use by CDC programs to collaborate on public health related projects in support of the CDC mission. GitHub is not hosted by the CDC, but is a third party website used by CDC and its partners to share information and collaborate on software. CDC use of GitHub does not imply an endorsement of any one particular service, product, or enterprise.
Overview
The PRIME Public Health Data Infrastructure projects are part of the Pandemic-Ready Interoperability Modernization Effort, a multi-year collaboration between CDC and the U.S. Digital Service (USDS) to strengthen data quality and information technology systems in state and local health departments.
This repository represents the source code for three related workstreams in this area:
- Data Storage, Tooling, and Preparation (DSTP) - Assist STLTs in implementing modern infrastructure that is flexible enough to allow jurisdictions to prepare for as-of-yet unknown use cases, while providing a long-term storage mechanism and query solution for effective daily use and timely response.
- Common Data Model - Define USCDI+ for Public Health and implementation guidance that specifies common data elements, value sets, and APIs to enable interoperability, data cleaning, and data linkage across the ecosystem of public health data senders and receivers
- Workbench - Develop a platform for high quality tools, services, analytical products, and reporting that addresses identified pain points for data receivers/public health departments, and incentivizes adoption of the Common Data Model
All three projects will begin with a time-limited prototype, focused on working with raw ELR, eCR, and VXU data from the commonwealth of Virginia Department of Health, for the purpose of experimenting with a unified data lake paradigm. Over time they will evolve to tackle new STLTs and new challenges around data quality and standardization.
The PRIME Public Health Data Infrastructure prototype a sibling project to PRIME ReportStream, focusing on delivering COVID-19 test data to public health departments, and PRIME SimpleReport, working on a better way to report COVID-19 rapid tests.
Problem Scope
Long-term Vision for all three workstreams: Public health systems to digest, analyze, and respond to data are siloed. Lacking access to actionable data, our national, as well as state, local, and territorial infrastructure, isn’t pandemic-ready. Our objective was to learn how the CDC can best support STLTs in moving towards a modern public health data infrastructure.
Vision for short-term prototype: The Commonwealth of Virginia Department of Public Health would like to move towards a more ‘holistic’ solution for electronic data processing and integration, so that currently siloed datasets can be processed and ultimately used in a more efficient and effective manner. Specifically, in order to support the vision of ‘holistic processing’, this prototype is focusing the on seeing what can be done with raw data, focused on COVID-19 breakthrough cases.
Target Users
Eventual target users of this system include:
- Public Health Departments
- Epidemiologists who rely on health data to take regular actions
- Senior stakeholders who make executive decisions using aggregate health data
- IT teams who have to support epidemiologists and external stakeholders integrating with the PHD
- PHDs may include state, county, city, and tribal organizations
Technical
Getting Started
To start development against this project, please consult the Getting Started doc for more details on setting up local development environments, deployments, and more.
Decision Records
See A record of all decisions that have been made on the project and propose your own by going to the decisions directory and following instructions there.
Standard Notices
Public Domain Standard Notice
This repository constitutes a work of the United States Government and is not subject to domestic copyright protection under 17 USC § 105. This repository is in the public domain within the United States, and copyright and related rights in the work worldwide are waived through the CC0 1.0 Universal public domain dedication. All contributions to this repository will be released under the CC0 dedication. By submitting a pull request you are agreeing to comply with this waiver of copyright interest.
License Standard Notice
This project is in the public domain within the United States, and copyright and related rights in the work worldwide are waived through the CC0 1.0 Universal public domain dedication. All contributions to this project will be released under the CC0 dedication. By submitting a pull request or issue, you are agreeing to comply with this waiver of copyright interest and acknowledge that you have no expectation of payment, unless pursuant to an existing contract or agreement.
Privacy Standard Notice
This repository contains only non-sensitive, publicly available data and information. All material and community participation is covered by the Disclaimer and Code of Conduct. For more information about CDC's privacy policy, please visit http://www.cdc.gov/other/privacy.html.
Contributing Standard Notice
Anyone is encouraged to contribute to the repository by forking and submitting a pull request. (If you are new to GitHub, you might start with a basic tutorial.) By contributing to this project, you grant a world-wide, royalty-free, perpetual, irrevocable, non-exclusive, transferable license to all users under the terms of the Apache Software License v2 or later.
All comments, messages, pull requests, and other submissions received through CDC including this GitHub page may be subject to applicable federal law, including but not limited to the Federal Records Act, and may be archived. Learn more at http://www.cdc.gov/other/privacy.html.
See CONTRIBUTING.md for more information.
Records Management Standard Notice
This repository is not a source of government records, but is a copy to increase collaboration and collaborative potential. All government records will be published through the CDC web site.
Related documents
- Open Practices
- Rules of Behavior
- Thanks and Acknowledgements
- Disclaimer
- Contribution Notice
- Code of Conduct
Additional Standard Notices
Please refer to CDC's Template Repository for more information about contributing to this repository, public domain notices and disclaimers, and code of conduct.
Owner
- Name: Centers for Disease Control and Prevention
- Login: CDCgov
- Kind: organization
- Email: data@cdc.gov
- Location: Atlanta, GA
- Website: http://open.cdc.gov/
- Twitter: CDCgov
- Repositories: 114
- Profile: https://github.com/CDCgov
CDC's collaborative software projects to protect America from health, safety, and security threats, both foreign and in the U.S.
GitHub Events
Total
- Fork event: 2
Last Year
- Fork event: 2
Issues and Pull Requests
Last synced: over 1 year ago
All Time
- Total issues: 7
- Total pull requests: 159
- Average time to close issues: about 1 month
- Average time to close pull requests: 6 days
- Total issue authors: 3
- Total pull request authors: 16
- Average comments per issue: 1.0
- Average comments per pull request: 0.94
- Merged pull requests: 139
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- JosiahSiegel (3)
- bryan-skylight (2)
- jeremy-page (2)
Pull Request Authors
- ghost (23)
- spencer-kathol (22)
- bryan-skylight (19)
- DanPaseltiner (18)
- JosiahSiegel (16)
- NatHillardUSDS (16)
- bamader (13)
- nickclyde (9)
- KennethSkylight (6)
- bmaden-gov (5)
- VincentLaUSDS (5)
- JSkeets (2)
- snesm (1)
- gskathol (1)
- VincentLa (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- azure/login 1f63701bf3e6892515f1b7ce2d2bf1708b46beaf composite
- Azure/functions-action v1 composite
- actions/setup-python v1 composite
- ./.github/actions/az-tf * composite
- actions/checkout dcd71f646680f2efd8db4afa5ad64fdcba30e748 composite
- josiahsiegel/terraform-stats 0ea7b86f08699ad7b106e6c18195a9921c8b3214 composite
- slackapi/slack-github-action 34c3fd73326693ef04728f8611669d918a2d781d composite
- ./.github/actions/az-tf * composite
- ./.github/actions/function-app-deploy * composite
- ./.github/actions/terraform-deploy * composite
- actions/checkout v2 composite
- ./.github/actions/az-tf * composite
- ./.github/actions/function-app-deploy * composite
- ./.github/actions/terraform-deploy * composite
- actions/checkout v2 composite
- ./.github/actions/az-tf * composite
- ./.github/actions/function-app-deploy * composite
- ./.github/actions/terraform-deploy * composite
- actions/checkout v2 composite
- JosiahSiegel/AzViz-action v1.0.4 composite
- actions/checkout v2 composite
- actions/upload-artifact v2 composite
- azure/login v1 composite
- ./.github/actions/create-tags * composite
- actions/checkout v2 composite
- azure/login v1 composite
- ResearchSoftwareActions/EnsureCleanNotebooksAction 1.1 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- hashicorp/setup-terraform v1 composite
- azure-functions *
- azure-identity *
- azure-keyvault-secrets *
- azure-storage-blob *
- requests *
- azure-functions *
- azure-identity *
- azure-storage-blob *
- azure.keyvault.secrets *
- ping3 *
- psycopg2-binary *
- black * development
- flake8 * development
- mypy * development
- pytest * development
- azure-functions *
- azure-identity *
- azure-storage-blob *
- hl7 *
- requests *
- smartystreets_python_sdk *
- typer *
- urllib3 *
- pytest * development