growthcleanr

R package for cleaning anthropometric (height and weight) data from electronic health record systems.

https://github.com/carriedaymont/growthcleanr

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.8%) to scientific vocabulary

Keywords

ehr ehr-data r
Last synced: 6 months ago · JSON representation

Repository

R package for cleaning anthropometric (height and weight) data from electronic health record systems.

Basic Info
  • Host: GitHub
  • Owner: carriedaymont
  • License: other
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 14.9 MB
Statistics
  • Stars: 14
  • Watchers: 6
  • Forks: 8
  • Open Issues: 25
  • Releases: 9
Topics
ehr ehr-data r
Created over 7 years ago · Last pushed 7 months ago
Metadata Files
Readme Changelog License

README.Rmd

---
output: github_document
---



```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```

# growthcleanr



[![CRAN status](https://www.r-pkg.org/badges/version/growthcleanr) ](https://cran.r-project.org/package=growthcleanr)
[![R build status](https://github.com/carriedaymont/growthcleanr/workflows/R-CMD-check/badge.svg) ](https://github.com/carriedaymont/growthcleanr/actions)
[![Docker](https://github.com/mitre/growthcleanr/actions/workflows/build-and-publish-image-tag.yml/badge.svg)](https://github.com/mitre/growthcleanr/actions/workflows/build-and-publish-image-tag.yml)



R package for cleaning data from Electronic Health Record systems, focused on
cleaning height and weight measurements.

 This package implements the
[Daymont et al. algorithm](https://academic.oup.com/jamia/article/24/6/1080/3767271),
as specified in Supplemental File 3 within the
[Supplementary Material](https://academic.oup.com/jamia/article/24/6/1080/3767271#97610899)
published with that paper.

> Carrie Daymont, Michelle E Ross, A Russell Localio, Alexander G Fiks, Richard
> C Wasserman, Robert W Grundmeier, Automated identification of implausible
> values in growth data from pediatric electronic health records, Journal of the
> American Medical Informatics Association, Volume 24, Issue 6, November 2017,
> Pages 1080–1087, https://doi.org/10.1093/jamia/ocx037

This package also includes an R version of the
[SAS macro published by the CDC](https://www.cdc.gov/nccdphp/dnpao/growthcharts/resources/sas.htm)
for calculating percentiles and Z-scores of pediatric growth observations and
utilities for working with both functions. As of summer 2021, it also supports
cleaning anthropometric measurements for adults up to age 65. The adult
algorithm has not yet been published in a peer-reviewed publication, but is
described in detail at
[Adult algorithm](https://carriedaymont.github.io/growthcleanr/articles/adult-algorithm.html).

## Installation

To install the stable version from CRAN:

```{r, eval = FALSE}
install.packages("growthcleanr")
```

## Summary

The `growthcleanr` package processes data prepared in a specific format to
identify biologically implausible height and weight measurements. It bases these
evaluations on techniques which use patient-specific longitudinal analysis and
variations from published growth trajectory charts for pediatric subjects. These
techniques are performed in a specific order which refines and improves results
throughout the process.

Results from `growthcleanr` include a flag for each measurement indicating
whether it is to be included or excluded based on plausibility, with a variety
of specific types of exclusions identified distinctly. These flags can be
analyzed further by researchers studying anthropometric EHR data to determine
which measurements to include or exclude in their own studies. No values are
deleted or otherwise removed; each is only flagged in a new column.

To start running `growthcleanr`, an R installation with a variety of additional
packages is required, as is a growth measurement dataset prepared for use in
`growthcleanr`.

The rest of this documentation includes:

### Getting started:

- [Quickstart](https://carriedaymont.github.io/growthcleanr/articles/quickstart.html),
  a brief tour of using growthcleanr, including data preparation
- [Installation](https://carriedaymont.github.io/growthcleanr/articles/installation.html),
  options for installing growthcleanr, with notes on specific platforms and
  source-level installation for developers
- [Usage](https://carriedaymont.github.io/growthcleanr/articles/usage.html),
  examples of cleaning data, multiple options, example data

### Advanced topics:

- [Configuration options](https://carriedaymont.github.io/growthcleanr/articles/configuration.html),
  changing growthcleanr operational settings
- [Understanding growthcleanr output](https://carriedaymont.github.io/growthcleanr/articles/output.html),
  the exclusion types growthcleanr identifies
- [Adult algorithm](https://carriedaymont.github.io/growthcleanr/articles/adult-algorithm.html),
  a detailed description of how growthcleanr assesses observations from adult
  subjects
- [Computing BMI percentiles and Z-scores](https://carriedaymont.github.io/growthcleanr/articles/utilities.html),
  additional functions for common data transforms and determining percentiles
  and Z-scores using the CDC method
- [Working with large datasets](https://carriedaymont.github.io/growthcleanr/articles/large-data-sets.html),
  notes and suggestions for running `growthcleanr` with large data sources
- [Next steps](https://carriedaymont.github.io/growthcleanr/articles/next-steps.html),
  notes on potential enhancements to the pediatric and adult algorithms
- [Developer guidelines](https://carriedaymont.github.io/growthcleanr/articles/developer-guidelines.html), advice for contributors to this package, including a CRAN release checklist

## Changes

For a detailed history of released versions, see the
[Changelog](https://carriedaymont.github.io/growthcleanr/news/index.html)
or`NEWS.md`. Tagged releases, starting with 1.2.3 in January 2021, are listed
[at GitHub](https://github.com/carriedaymont/growthcleanr/releases).

Owner

  • Name: Carrie Daymont
  • Login: carriedaymont
  • Kind: user

GitHub Events

Total
  • Watch event: 2
  • Delete event: 5
  • Member event: 1
  • Issue comment event: 4
  • Push event: 8
  • Pull request event: 11
  • Fork event: 1
  • Create event: 5
Last Year
  • Watch event: 2
  • Delete event: 5
  • Member event: 1
  • Issue comment event: 4
  • Push event: 8
  • Pull request event: 11
  • Fork event: 1
  • Create event: 5

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 57
  • Total pull requests: 66
  • Average time to close issues: 3 months
  • Average time to close pull requests: 21 days
  • Total issue authors: 5
  • Total pull request authors: 6
  • Average comments per issue: 0.93
  • Average comments per pull request: 0.67
  • Merged pull requests: 56
  • Bot issues: 0
  • Bot pull requests: 19
Past Year
  • Issues: 0
  • Pull requests: 10
  • Average time to close issues: N/A
  • Average time to close pull requests: 17 days
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.7
  • Merged pull requests: 6
  • Bot issues: 0
  • Bot pull requests: 8
Top Authors
Issue Authors
  • dchud (42)
  • delosh653 (12)
  • kradimer (1)
  • SamuelMcEwan (1)
  • jaskdb (1)
  • kliegl (1)
Pull Request Authors
  • dependabot[bot] (26)
  • delosh653 (24)
  • dchud (20)
  • grundmeier (2)
  • mcanouil (1)
  • molivier-314 (1)
Top Labels
Issue Labels
easy (8) bug (1)
Pull Request Labels
dependencies (26) github_actions (1)

Packages

  • Total packages: 1
  • Total downloads:
    • cran 461 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 1
  • Total versions: 6
  • Total maintainers: 1
cran.r-project.org: growthcleanr

Data Cleaner for Anthropometric Measurements

  • Versions: 6
  • Dependent Packages: 0
  • Dependent Repositories: 1
  • Downloads: 461 Last month
Rankings
Forks count: 8.7%
Stargazers count: 15.8%
Average: 22.5%
Dependent repos count: 23.8%
Dependent packages count: 28.7%
Downloads: 35.5%
Last synced: 6 months ago

Dependencies

.github/workflows/R-CMD-check.yaml actions
  • actions/checkout v3 composite
  • r-lib/actions/check-r-package v2 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
.github/workflows/build-and-publish-image-latest.yml actions
  • VaultVulp/gp-docker-action 1.6.0 composite
  • actions/checkout v2.5.0 composite
.github/workflows/build-and-publish-image-tag.yml actions
  • VaultVulp/gp-docker-action 1.6.0 composite
  • actions/checkout v2.5.0 composite
.github/workflows/pkgdown.yaml actions
  • JamesIves/github-pages-deploy-action v4.4.1 composite
  • actions/checkout v3 composite
  • r-lib/actions/setup-pandoc v2 composite
  • r-lib/actions/setup-r v2 composite
  • r-lib/actions/setup-r-dependencies v2 composite
DESCRIPTION cran
  • R >= 2.10 depends
  • R.utils >= 2.11.0 imports
  • data.table >= 1.13.0 imports
  • doParallel >= 1.0.15 imports
  • dplyr >= 1.0.1 imports
  • foreach >= 1.5.0 imports
  • labelled >= 2.5.0 imports
  • magrittr >= 1.5 imports
  • plyr >= 1.8.6 imports
  • tidyr >= 1.1.0 imports
  • argparser >= 0.6 suggests
  • bit64 >= 4.0.2 suggests
  • knitr >= 1.29 suggests
  • rmarkdown >= 2.3 suggests
  • testthat >= 2.3.2 suggests
Dockerfile docker
  • ghcr.io/rocker-org/tidyverse latest build