oir

Internal package to standardize preliminary data analysis, including common initial exploratory feedback

https://github.com/nmedina17/oir

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: zenodo.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.8%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Internal package to standardize preliminary data analysis, including common initial exploratory feedback

Basic Info

Host: GitHub
Owner: nmedina17
License: mit
Language: R
Default Branch: main
Homepage:
Size: 260 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 7
Releases: 2

Created over 4 years ago · Last pushed almost 3 years ago

Metadata Files

Readme License Citation

oir

Internal package to standardize preliminary data analysis and integrate analytical feedback

Encodes common statistical decision trees, for a list of response variables for a project. Cite via Zenodo DOI --

Requires input data tbl to be "nested" into 2 columns -- 1. "variable", a vector of response variable names, and 2. "varData", a list of

This can be done with the 2 code lines, acting on typically cleaned "wide" data (i.e. response variables as columns), with dataset's independent variable keys also as columns (e.g. "plot", treatment", etc.) --

e.g.

cleanData <- dplyr::tibble( "plot" = c(1, 2, 3), "treat" = c(1, 2, 3), "var1" = c(1, 2, 3), "var2" = c(1, 2, 3) #, [...] )

varTbl <- cleanData %>% tidyr::pivotlonger(namesto = "variable", values_to = "value") %>% purr::nest(cols = !c(-variable)).

then processed using --

statFormula <- value ~ [independent variable of interest] resultsTbl <- varTbl %>% oir::getStatsTbl(formula = statFormula)

Notes --

best used with simple model formula, i.e. 1 independent variable in "statFormula" (at a time)
the main oir::getStatsTbl() function has versions "1", "2", and "12" to run --
1. linear regressions at plot-/group-level median centers ("1")
2. non-linear version of regression (not nlme::) modifying independent variable with stats::poly(x, degree = 2, at finest row level ("2")
3. the same non-linear version of regression, but using plot-/group-level centers ("12")
more complex formulas (e.g. lmer) can be processed, but with limited insight into final appropriate p-values
non-parametric option included as back-up test for usable p-value, but may offer slightly different inference/interpretation

Owner

Name: Nicholas Medina, Ph.D.
Login: nmedina17
Kind: user
Location: Chicago, IL, USA

Repositories: 2
Profile: https://github.com/nmedina17

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Medina"
  given-names: "Nicholas"
  orcid: "https://orcid.org/0000-0001-5465-3988"
title: "Code for: Automating statistical decision trees for data exploration and analysis revision workflow"
version: 0.9.1
doi: 10.5281/zenodo.6800299
date-released: 2022-07-05
url: "https://github.com/nmedina17/oir"

GitHub Events

Total

Last Year

Dependencies

DESCRIPTION cran

tidymodels * depends
tidyverse * depends
class * imports
devtools * imports
doParallel * imports
dplyr * imports
furrr * imports
future * imports
ggbeeswarm * imports
gginnards * imports
ggplot2 * imports
ggpmisc * imports
ggpubr * imports
glue * imports
here * imports
lmerTest * imports
magrittr * imports
poweRlaw * imports
rpart * imports
rstatix * imports
survival * imports
tictoc * imports
vegan * imports

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

oir