Plotrr

Plotrr: Functions for making visual exploratory data analysis with nested data easier. - Published in JOSS (2017)

https://github.com/cdcrabtree/plotrr

Science Score: 95.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org, zenodo.org
  • Committers with academic emails
    1 of 4 committers (25.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

exploratory-data-analysis helpers r visualization

Scientific Fields

Engineering Computer Science - 40% confidence
Last synced: 6 months ago · JSON representation

Repository

Functions for making visual exploratory data analysis with nested data easier.

Basic Info
  • Host: GitHub
  • Owner: cdcrabtree
  • License: other
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 76.2 KB
Statistics
  • Stars: 3
  • Watchers: 3
  • Forks: 2
  • Open Issues: 0
  • Releases: 2
Topics
exploratory-data-analysis helpers r visualization
Created about 9 years ago · Last pushed about 8 years ago
Metadata Files
Readme Changelog License

README.md

Build Status Build status DOI status Rdoc

plotrr: Functions for Making Visual Exploratory Data Analysis with Nested Data Easier.

Social scientists can improve their research by conducting exploratory data analysis (EDA) (Tukey 1977). The benefits of EDA include: ''maximiz[ing] insight into a data set; uncover[ing] underlying structure; extract[ing] important variables; detect[ing] outliers and anomalies; test[ing] underlying assumptions; develop[ing] parsimonious models; and determin[ing] optimal factor settings'' (NIST/SEMATECH 2012). Despite these benefits, scholars infrequently conduct EDA. One possible explanation for this is because it takes additional time to do so; it is often easier to move straight to confirmatory analysis.

The time concern is particularly an issue for researchers who use nested data. The issue here is that most existing EDA software routines visualize relationships based on the pooled data. Few existing functions help scholars easily visualize relationships within groups/units.

plotrr helps address this issue by providing several functions that make visual EDA easier to conduct. The focus of many of the package's functions is to create plots that can help researchers explore relationships within nested data. Among other things, these functions can help scholars assess the extent to which expected relationships between variables occur in specific cases. bivarplots creates a bivariate plot for every group/unit in the data, dotplots creates a dot plot for every group/unit, and violinplots creates a violin plot for every group/unit.

As demonstrated in Crabtree and Nelson (2017), creating and interpreting plots like this this can help scholars find initial support for their theoretical expectations prior to conducting analysis with pooled data. The intuition here is that researchers can check their initial priors about relationships within cases. When the data support those priors, scholars have some additional evidence that the processes they theorize actually occur in the real world.

In addition to these functions, the package also includes histplots, which creates histograms of a measure for each group/unit, and bivarrugplot, which returns a plot of the bivariate relationship between two measures alongside a rugplot of each measure.

Finally, the package also contains several "helper," or convenience, functions. clear effectively clears the R terminal. lengthunique calculates the number of uniques values in a vector. makefacnum converts factor vectors numeric vectors.

Package Installation

The latest development version (1.0.0) is on GitHub can be installed using devtools.

if(!require("ghit")){ install.packages("ghit") } ghit::install_github("cdcrabtree/nomine")

Support or Contact

Please use the issue tracker for problems, questions, or feature requests. If you would rather email with questions or comments, you can contact Charles Crabtree and he will address the issue.

If you would like to contribute to the package, that is great! We welcome pull requests and new developers.

Tests

To test the software, users and potential contributors can use the example code provided in the documentation for each function.

Thanks

Thanks to Karl Broman and Hadley Wickham for providing excellent free guies to building R packages.

References

  • Trochim, William M. K., and James P. Donnelly. 2008. Research Methods Knowledge Base. New York, NY: Cengage Learning.

Owner

  • Name: Charles Crabtree
  • Login: cdcrabtree
  • Kind: user
  • Location: Tokyo | Hanover | EU
  • Company: Dartmouth College

Assistant Professor of Government @dartmouth. #discrimination, #inequality, #experiments, #measurement | #FirstGen

JOSS Publication

Plotrr: Functions for making visual exploratory data analysis with nested data easier.
Published
March 05, 2017
Volume 2, Issue 11, Page 190
Authors
Charles Crabtree ORCID
University of Michigan
Michael J. Nelson ORCID
Pennsylvania State University
Editor
Pjotr Prins ORCID
Tags
exploratory data analysis nested data visualization

GitHub Events

Total
Last Year

Committers

Last synced: 7 months ago

All Time
  • Total Commits: 94
  • Total Committers: 4
  • Avg Commits per committer: 23.5
  • Development Distribution Score (DDS): 0.117
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Charles Crabtree c****s@g****m 83
Karl Broman k****n@g****m 8
Charles Crabtree c****r@u****u 2
cdcrabtree c****e@c****t 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 3
  • Total pull requests: 2
  • Average time to close issues: about 1 hour
  • Average time to close pull requests: 37 minutes
  • Total issue authors: 1
  • Total pull request authors: 1
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • kbroman (3)
Pull Request Authors
  • kbroman (2)
Top Labels
Issue Labels
Pull Request Labels