oir

Internal package to standardize preliminary data analysis, including common initial exploratory feedback

https://github.com/nmedina17/oir

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Internal package to standardize preliminary data analysis, including common initial exploratory feedback

Basic Info
  • Host: GitHub
  • Owner: nmedina17
  • License: mit
  • Language: R
  • Default Branch: main
  • Homepage:
  • Size: 260 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 7
  • Releases: 2
Created over 4 years ago · Last pushed over 2 years ago
Metadata Files
Readme License Citation

README.md

oir

Internal package to standardize preliminary data analysis and integrate analytical feedback

Encodes common statistical decision trees, for a list of response variables for a project. Cite via Zenodo DOI -- DOI

Requires input data tbl to be "nested" into 2 columns -- 1. "variable", a vector of response variable names, and 2. "varData", a list of

This can be done with the 2 code lines, acting on typically cleaned "wide" data (i.e. response variables as columns), with dataset's independent variable keys also as columns (e.g. "plot", treatment", etc.) --

e.g.

cleanData <- dplyr::tibble( "plot" = c(1, 2, 3), "treat" = c(1, 2, 3), "var1" = c(1, 2, 3), "var2" = c(1, 2, 3) #, [...] )

varTbl <- cleanData %>% tidyr::pivotlonger(namesto = "variable", values_to = "value") %>% purr::nest(cols = !c(-variable)).

then processed using --

statFormula <- value ~ [independent variable of interest] resultsTbl <- varTbl %>% oir::getStatsTbl(formula = statFormula)

Notes --

  • best used with simple model formula, i.e. 1 independent variable in "statFormula" (at a time)
  • the main oir::getStatsTbl() function has versions "1", "2", and "12" to run --
    1. linear regressions at plot-/group-level median centers ("1")
    2. non-linear version of regression (not nlme::) modifying independent variable with stats::poly(x, degree = 2, at finest row level ("2")
    3. the same non-linear version of regression, but using plot-/group-level centers ("12")
  • more complex formulas (e.g. lmer) can be processed, but with limited insight into final appropriate p-values
  • non-parametric option included as back-up test for usable p-value, but may offer slightly different inference/interpretation

Owner

  • Name: Nicholas Medina, Ph.D.
  • Login: nmedina17
  • Kind: user
  • Location: Chicago, IL, USA

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Medina"
  given-names: "Nicholas"
  orcid: "https://orcid.org/0000-0001-5465-3988"
title: "Code for: Automating statistical decision trees for data exploration and analysis revision workflow"
version: 0.9.1
doi: 10.5281/zenodo.6800299
date-released: 2022-07-05
url: "https://github.com/nmedina17/oir"

GitHub Events

Total
Last Year

Dependencies

DESCRIPTION cran
  • tidymodels * depends
  • tidyverse * depends
  • class * imports
  • devtools * imports
  • doParallel * imports
  • dplyr * imports
  • furrr * imports
  • future * imports
  • ggbeeswarm * imports
  • gginnards * imports
  • ggplot2 * imports
  • ggpmisc * imports
  • ggpubr * imports
  • glue * imports
  • here * imports
  • lmerTest * imports
  • magrittr * imports
  • poweRlaw * imports
  • rpart * imports
  • rstatix * imports
  • survival * imports
  • tictoc * imports
  • vegan * imports