chartextractor
ChartExtractor uses computer vision to convert images of paper charts to digital data.
https://github.com/paper-chart-extraction-project/chartextractor
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: ncbi.nlm.nih.gov, ieee.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.1%) to scientific vocabulary
Keywords
Repository
ChartExtractor uses computer vision to convert images of paper charts to digital data.
Basic Info
- Host: GitHub
- Owner: Paper-Chart-Extraction-Project
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Homepage: https://paper-chart-extraction-project.github.io/ChartExtractor/
- Size: 14.4 MB
Statistics
- Stars: 3
- Watchers: 1
- Forks: 2
- Open Issues: 5
- Releases: 1
Topics
Metadata Files
README.md
ChartExtractor
ChartExtractor is a computer vision program that extracts data from smartphone images of paper medical charts.
Why Does This Exist?
The availability of readily accessible, digital medical data offers two major benefits to institutions: it provides real-time access for informed medical decisions and retrospective data for research. This data allows researchers to:
- Study statistical associations to uncover socioeconomic, genetic, and environmental factors of health.
- Quantify how medicine is being used and how practice varies geographically.
- Create retrospective studies that can then be used for prospective randomized control trials to study causality.
EMRs are now widespread in the west, with 96% adoption in American hospitals and 78% in office-based clinics National Trends in Hospital and Physician Adoption of Electronic Health Records. Their adoption has allowed groups like MPOG and NACOR to form, which amalgamate massive datasets of medical records. These datasets enable researchers to perform large meta-analyses on millions of data points, including studies of uncommon and poorly understood conditions by providing enough data for statistically significant analysis.
Additionally, a physician caring for a patient recovering in an ICU can easily access a hyperlinked version of their surgical case and history, speeding up the application of interventions or tests.
Although these benefits are exceptionally valuable, adopting electronic medical records (EMRs) has four distinct challenges Challenges of Implementing Electronic Health Records in Resource-Limited Settings:
- EMRs are costly.
- EMRs require training and human capital to implement and maintain.
- EMRs require sufficient electrical and internet infrastructure to function effectively.
- EMRs necessitate significant workflow changes.
Due to these challenges, developing nations have been slow to adopt EMRs.
History
ChartExtractor, a project launched in 2019, offers a non-EMR solution to bridge the gap between having no EMR system, and having a full EMR system. This free, easily accessible smartphone application digitizes paper charts, requiring minimal changes to clinical operations.
This project began in 2019 as a capstone project as a data science master's capstone at the University of Virginia. The goal of this project was to read the data from an paper anesthesia record from the University Teaching Hospital in Kigali, Rwanda using only a smartphone image of it. Successive years brought new groups who each assessed the feasibility of a new part of the chart, and improved on the work of the previous groups [1], [2], [3], [4].
All this work, however, was for one side of a single chart, at a single site. So, in 2023, we decided to prospectively create our own chart that implemented a workaround the most difficult part of the traditional handwritten anesthesia records: handwritten words.
This repository houses the computer vision code designed to work on the chart that we created. When this repository is finished, a site needs only to adopt the chart in order to have a machine readable anesthesia chart.
The Paper Anesthesia Record
While we hope to expand to charts from the ICU and beyond, the first chart this program will operate on is the anesthesia record. The anesthesia record has two sides, one to record preoperative and postoperative information, and the other dedicated to only intraoperative data.
Front

Back

Owner
- Name: Paper Chart Extraction Project
- Login: Paper-Chart-Extraction-Project
- Kind: organization
- Location: United States of America
- Repositories: 1
- Profile: https://github.com/Paper-Chart-Extraction-Project
Repositories for training and using computer vision to extract data from paper medical charts.
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: ChartExtractor
message: 'To cite this work, please use the metadata in this file.'
type: software
authors:
- given-names: Ryan
email: vcz2aj@uvahealth.org
affiliation: University of Virginia School of Medicine
orcid: 'https://orcid.org/0009-0005-2993-6499'
family-names: Folks
- given-names: Charbel
family-names: Marche
email: qdy4zt@virginia.edu
affiliation: University of Virginia School of Data Science
- given-names: Hannah
family-names: Valenty
email: unc6kr@virginia.edu
affiliation: University of Virginia School of Data Science
- given-names: Matthew
family-names: Beck
affiliation: University of Virginia School of Data Science
email: mtb2tgk@virginia.edu
repository-code: >-
https://github.com/Paper-Chart-Extraction-Project/ChartExtractor
url: 'https://github.com/Paper-Chart-Extraction-Project'
repository: >-
https://github.com/Paper-Chart-Extraction-Project/ChartExtractorSupplements
abstract: >-
ChartExtractor is a piece of software used to digitize
smartphone photographs of paper anesthesia records.
keywords:
- Computer Vision
- Artificial Intelligence
- Global Health
license: GPL-3.0
GitHub Events
Total
- Create event: 32
- Release event: 1
- Issues event: 9
- Delete event: 23
- Issue comment event: 14
- Push event: 119
- Pull request review comment event: 2
- Pull request review event: 4
- Pull request event: 57
- Fork event: 1
Last Year
- Create event: 32
- Release event: 1
- Issues event: 9
- Delete event: 23
- Issue comment event: 14
- Push event: 119
- Pull request review comment event: 2
- Pull request review event: 4
- Pull request event: 57
- Fork event: 1
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 5
- Total pull requests: 18
- Average time to close issues: 7 days
- Average time to close pull requests: about 2 hours
- Total issue authors: 2
- Total pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.06
- Merged pull requests: 16
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 5
- Pull requests: 18
- Average time to close issues: 7 days
- Average time to close pull requests: about 2 hours
- Issue authors: 2
- Pull request authors: 2
- Average comments per issue: 0.0
- Average comments per pull request: 0.06
- Merged pull requests: 16
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- RyanDoesMath (10)
- sahilkaushal006 (1)
Pull Request Authors
- RyanDoesMath (45)
- charbelmarche33 (6)
- hvalenty (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/checkout v4 composite
- actions/deploy-pages v4 composite
- actions/setup-python v5 composite
- actions/upload-pages-artifact v3 composite
- furo *
- sphinx_rtd_theme *
- pillow *