simstudy

simstudy: Illuminating research methods through data generation - Published in JOSS (2020)

https://github.com/kgoldfeld/simstudy

Science Score: 100.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org
  • Committers with academic emails
    1 of 14 committers (7.1%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

data-generation data-simulation r simulation statistical-models
Last synced: 4 months ago · JSON representation ·

Repository

simstudy: Illuminating research methods through data generation

Basic Info
Statistics
  • Stars: 85
  • Watchers: 5
  • Forks: 8
  • Open Issues: 20
  • Releases: 16
Topics
data-generation data-simulation r simulation statistical-models
Created over 9 years ago · Last pushed 4 months ago
Metadata Files
Readme Changelog Contributing License Code of conduct Citation Support Codemeta

README.Rmd

---
title: "simstudy"
output: github_document
---

```{r, include = FALSE}
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)
```



[![R build status](https://github.com/kgoldfeld/simstudy/workflows/R-CMD-check/badge.svg?branch=main)](https://github.com/kgoldfeld/simstudy/actions){target="_blank"}
[![CRAN status](https://www.r-pkg.org/badges/version/simstudy)](https://CRAN.R-project.org/package=simstudy){target="_blank"}
[![status](https://joss.theoj.org/papers/10.21105/joss.02763/status.svg)](https://joss.theoj.org/papers/10.21105/joss.02763){target="_blank"}
[![CRAN downloads](https://cranlogs.r-pkg.org/badges/grand-total/simstudy)](https://CRAN.R-project.org/package=simstudy){target="_blank"}
[![codecov](https://app.codecov.io/gh/kgoldfeld/simstudy/branch/main/graph/badge.svg)](https://app.codecov.io/gh/kgoldfeld/simstudy){target="_blank"}
[![Lifecycle: stable](https://img.shields.io/badge/lifecycle-stable-brightgreen.svg)](https://lifecycle.r-lib.org/articles/stages.html){target="_blank"}


The `simstudy` package is a collection of functions that allow users to generate simulated data sets in order to explore modeling techniques or better understand data generating processes. The user defines the distributions of individual variables, specifies relationships between covariates and outcomes, and generates data based on these specifications. The final data sets can represent randomized control trials, repeated measure designs, cluster randomized trials, or naturally observed data processes. Other complexities that can be added include survival data, correlated data, factorial study designs, step wedge designs, and missing data processes.

Simulation using `simstudy` has two fundamental steps. The user (1) **defines** the data elements of a data set and (2) **generates** the data based on these definitions. Additional functionality exists to simulate observed or randomized **treatment assignment/exposures**, to create **longitudinal/panel** data, to create **multi-level/hierarchical** data, to create datasets with **correlated variables** based on a specified covariance structure, to **merge** datasets, to create data sets with **missing** data, and to create non-linear relationships with underlying **spline** curves.

The overarching philosophy of `simstudy` is to create data generating processes that mimic the typical models used to fit those types of data. So, the parameterization of some of the data generating processes may not follow the standard parameterizations for the specific distributions. For example, in `simstudy` *gamma*-distributed data are generated based on the specification of a mean μ (or log(μ)) and a dispersion $d$, rather than shape α and rate β parameters that more typically characterize the *gamma* distribution. When we estimate the parameters, we are modeling μ (or some function of μ), so we should explicitly recover the `simstudy` parameters used to generate the model, thus illuminating the relationship between the underlying data generating processes and the models. For more details on the
package, use cases, examples, and function reference see the [documentation page](https://kgoldfeld.github.io/simstudy/articles/simstudy.html).


## Installation

You can install the released version of simstudy from [CRAN](https://CRAN.R-project.org){target="_blank"} with:

``` r
install.packages("simstudy")
```

And the development version from [GitHub](https://github.com/){target="_blank"} with:

``` r
# install.packages("devtools")
devtools::install_github("kgoldfeld/simstudy")
```
## Example

Here is some simple sample code, much more in the vignettes:

```{r, echo = TRUE}
library(simstudy)
set.seed(1965)

def <- defData(varname="x", formula = 10, variance = 2, dist = "normal")
def <- defData(def, varname="y", formula = "3 + 0.5 * x", variance = 1, dist = "normal")
dd <- genData(250, def)

dd <- trtAssign(dd, nTrt = 4, grpName = "grp", balanced = TRUE)

dd
```

## Contributing & Support

If you find a bug or need help, please file an issue with a [reprex](https://www.tidyverse.org/help/){target="_blank"} on [Github](https://github.com/kgoldfeld/simstudy/issues){target="_blank"}. We are happy to accept contributions to simstudy. More information on how to propose changes or fix bugs can be found [here](https://kgoldfeld.github.io/simstudy/CONTRIBUTING.html){target="_blank"}.

## Code of Conduct

Please note that the simstudy project is released with a [Contributor Code of Conduct](https://kgoldfeld.github.io/simstudy/CODE_OF_CONDUCT.html){target="_blank"}. By contributing to this project, you agree to abide by its terms.

Owner

  • Login: kgoldfeld
  • Kind: user

JOSS Publication

simstudy: Illuminating research methods through data generation
Published
October 27, 2020
Volume 5, Issue 54, Page 2763
Authors
Keith Goldfeld ORCID
NYU Grossman School of Medicine.
Jacob Wujciak-Jens ORCID
Independent Researcher
Editor
Mikkel Meyer Andersen ORCID
Tags
statistics data-simulation statistical-models data-generation

Citation (CITATION.cff)

# --------------------------------------------
# CITATION file created with {cffr} R package
# See also: https://docs.ropensci.org/cffr/
# --------------------------------------------
 
cff-version: 1.2.0
message: 'To cite package "simstudy" in publications use:'
type: software
license: GPL-3.0-only
title: 'simstudy: Simulation of Study Data'
version: 0.8.1.9000
doi: 10.21105/joss.02763
identifiers:
- type: doi
  value: 10.32614/CRAN.package.simstudy
- type: url
  value: https://kgoldfeld.github.io/simstudy/dev/
abstract: Simulates data sets in order to explore modeling techniques or better understand
  data generating processes. The user specifies a set of relationships between covariates,
  and generates data based on these specifications. The final data sets can represent
  data from randomized control trials, repeated measure (longitudinal) designs, and
  cluster randomized trials. Missingness can be generated using various mechanisms
  (MCAR, MAR, NMAR).
authors:
- family-names: Goldfeld
  given-names: Keith
  email: keith.goldfeld@nyulangone.org
  orcid: https://orcid.org/0000-0002-0292-8780
- family-names: Wujciak-Jens
  given-names: Jacob
  email: jacob@wujciak.de
  orcid: https://orcid.org/0000-0002-7281-3989
preferred-citation:
  type: article
  title: 'simstudy: Illuminating research methods through data generation'
  authors:
  - family-names: Goldfeld
    given-names: Keith
    email: keith.goldfeld@nyulangone.org
    orcid: https://orcid.org/0000-0002-0292-8780
  - family-names: Wujciak-Jens
    given-names: Jacob
    email: jacob@wujciak.de
    orcid: https://orcid.org/0000-0002-7281-3989
  publisher:
    name: The Open Journal
  journal: Journal of Open Source Software
  year: '2020'
  volume: '5'
  issue: '54'
  url: https://doi.org/10.21105/joss.02763
  doi: 10.21105/joss.02763
  start: '2763'
repository: https://CRAN.R-project.org/package=simstudy
repository-code: https://github.com/kgoldfeld/simstudy
url: https://kgoldfeld.github.io/simstudy/
date-released: '2024-07-29'
contact:
- family-names: Goldfeld
  given-names: Keith
  email: keith.goldfeld@nyulangone.org
  orcid: https://orcid.org/0000-0002-0292-8780
keywords:
- data-generation
- data-simulation
- r
- simulation
- statistical-models
references:
- type: software
  title: 'R: A Language and Environment for Statistical Computing'
  notes: Depends
  url: https://www.R-project.org/
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
  version: '>= 4.1.0'
- type: software
  title: data.table
  abstract: 'data.table: Extension of `data.frame`'
  notes: Imports
  url: https://r-datatable.com
  repository: https://CRAN.R-project.org/package=data.table
  authors:
  - family-names: Barrett
    given-names: Tyson
    email: t.barrett88@gmail.com
    orcid: https://orcid.org/0000-0002-2137-1391
  - family-names: Dowle
    given-names: Matt
    email: mattjdowle@gmail.com
  - family-names: Srinivasan
    given-names: Arun
    email: asrini@pm.me
  - family-names: Gorecki
    given-names: Jan
  - family-names: Chirico
    given-names: Michael
    orcid: https://orcid.org/0000-0003-0787-087X
  - family-names: Hocking
    given-names: Toby
    orcid: https://orcid.org/0000-0002-3146-0865
  - family-names: Schwendinger
    given-names: Benjamin
    orcid: https://orcid.org/0000-0003-3315-8114
  - family-names: Krylov
    given-names: Ivan
    email: ikrylov@disroot.org
    orcid: https://orcid.org/0000-0002-0172-3812
  year: '2025'
  doi: 10.32614/CRAN.package.data.table
- type: software
  title: glue
  abstract: 'glue: Interpreted String Literals'
  notes: Imports
  url: https://glue.tidyverse.org/
  repository: https://CRAN.R-project.org/package=glue
  authors:
  - family-names: Hester
    given-names: Jim
    orcid: https://orcid.org/0000-0002-2739-7082
  - family-names: Bryan
    given-names: Jennifer
    email: jenny@posit.co
    orcid: https://orcid.org/0000-0002-6983-2759
  year: '2025'
  doi: 10.32614/CRAN.package.glue
- type: software
  title: methods
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Imports
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: mvnfast
  abstract: 'mvnfast: Fast Multivariate Normal and Student''s t Methods'
  notes: Imports
  url: https://github.com/mfasiolo/mvnfast/
  repository: https://CRAN.R-project.org/package=mvnfast
  authors:
  - family-names: Fasiolo
    given-names: Matteo
    email: matteo.fasiolo@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.mvnfast
- type: software
  title: Rcpp
  abstract: 'Rcpp: Seamless R and C++ Integration'
  notes: Imports
  url: https://www.rcpp.org
  repository: https://CRAN.R-project.org/package=Rcpp
  authors:
  - family-names: Eddelbuettel
    given-names: Dirk
    email: edd@debian.org
    orcid: https://orcid.org/0000-0001-6419-907X
  - family-names: Francois
    given-names: Romain
    orcid: https://orcid.org/0000-0002-2444-4226
  - family-names: Allaire
    given-names: JJ
    orcid: https://orcid.org/0000-0003-0174-9868
  - family-names: Ushey
    given-names: Kevin
    orcid: https://orcid.org/0000-0003-2880-7407
  - family-names: Kou
    given-names: Qiang
    orcid: https://orcid.org/0000-0001-6786-5453
  - family-names: Russell
    given-names: Nathan
  - family-names: Ucar
    given-names: Iñaki
    orcid: https://orcid.org/0000-0001-6403-5550
  - family-names: Bates
    given-names: Doug
    orcid: https://orcid.org/0000-0001-8316-9503
  - family-names: Chambers
    given-names: John
  year: '2025'
  doi: 10.32614/CRAN.package.Rcpp
- type: software
  title: backports
  abstract: 'backports: Reimplementations of Functions Introduced Since R-3.0.0'
  notes: Imports
  url: https://github.com/r-lib/backports
  repository: https://CRAN.R-project.org/package=backports
  authors:
  - family-names: Lang
    given-names: Michel
    email: michellang@gmail.com
    orcid: https://orcid.org/0000-0001-9754-0393
  - family-names: Murdoch
    given-names: Duncan
    email: murdoch.duncan@gmail.com
  - name: R Core Team
  year: '2025'
  doi: 10.32614/CRAN.package.backports
- type: software
  title: fastglm
  abstract: 'fastglm: Fast and Stable Fitting of Generalized Linear Models using ''RcppEigen'''
  notes: Imports
  url: https://github.com/jaredhuling/fastglm
  repository: https://CRAN.R-project.org/package=fastglm
  authors:
  - family-names: Huling
    given-names: Jared
    email: jaredhuling@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.fastglm
- type: software
  title: pbv
  abstract: 'pbv: Probabilities for Bivariate Normal Distribution'
  notes: Imports
  url: https://sites.google.com/view/alexander-robitzsch/software
  repository: https://CRAN.R-project.org/package=pbv
  authors:
  - family-names: Robitzsch
    given-names: Alexander
    orcid: https://orcid.org/0000-0002-8226-3132
  year: '2025'
  doi: 10.32614/CRAN.package.pbv
  version: '>= 0.5.47'
- type: software
  title: covr
  abstract: 'covr: Test Coverage for Packages'
  notes: Suggests
  url: https://covr.r-lib.org
  repository: https://CRAN.R-project.org/package=covr
  authors:
  - family-names: Hester
    given-names: Jim
    email: james.f.hester@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.covr
- type: software
  title: dplyr
  abstract: 'dplyr: A Grammar of Data Manipulation'
  notes: Suggests
  url: https://dplyr.tidyverse.org
  repository: https://CRAN.R-project.org/package=dplyr
  authors:
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
    orcid: https://orcid.org/0000-0003-4757-117X
  - family-names: François
    given-names: Romain
    orcid: https://orcid.org/0000-0002-2444-4226
  - family-names: Henry
    given-names: Lionel
  - family-names: Müller
    given-names: Kirill
    orcid: https://orcid.org/0000-0002-1416-3412
  - family-names: Vaughan
    given-names: Davis
    email: davis@posit.co
    orcid: https://orcid.org/0000-0003-4777-038X
  year: '2025'
  doi: 10.32614/CRAN.package.dplyr
- type: software
  title: formatR
  abstract: 'formatR: Format R Code Automatically'
  notes: Suggests
  url: https://github.com/yihui/formatR
  repository: https://CRAN.R-project.org/package=formatR
  authors:
  - family-names: Xie
    given-names: Yihui
    email: xie@yihui.name
    orcid: https://orcid.org/0000-0003-0645-5666
  year: '2025'
  doi: 10.32614/CRAN.package.formatR
- type: software
  title: gee
  abstract: 'gee: Generalized Estimation Equation Solver'
  notes: Suggests
  repository: https://CRAN.R-project.org/package=gee
  authors:
  - family-names: Carey
    given-names: Vincent J
  year: '2025'
  doi: 10.32614/CRAN.package.gee
- type: software
  title: ggplot2
  abstract: 'ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics'
  notes: Suggests
  url: https://ggplot2.tidyverse.org
  repository: https://CRAN.R-project.org/package=ggplot2
  authors:
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
    orcid: https://orcid.org/0000-0003-4757-117X
  - family-names: Chang
    given-names: Winston
    orcid: https://orcid.org/0000-0002-1576-2126
  - family-names: Henry
    given-names: Lionel
  - family-names: Pedersen
    given-names: Thomas Lin
    email: thomas.pedersen@posit.co
    orcid: https://orcid.org/0000-0002-5147-4711
  - family-names: Takahashi
    given-names: Kohske
  - family-names: Wilke
    given-names: Claus
    orcid: https://orcid.org/0000-0002-7470-9261
  - family-names: Woo
    given-names: Kara
    orcid: https://orcid.org/0000-0002-5125-4188
  - family-names: Yutani
    given-names: Hiroaki
    orcid: https://orcid.org/0000-0002-3385-7233
  - family-names: Dunnington
    given-names: Dewey
    orcid: https://orcid.org/0000-0002-9415-4582
  - family-names: Brand
    given-names: Teun
    name-particle: van den
    orcid: https://orcid.org/0000-0002-9335-7468
  year: '2025'
  doi: 10.32614/CRAN.package.ggplot2
- type: software
  title: grid
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Suggests
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: gridExtra
  abstract: 'gridExtra: Miscellaneous Functions for "Grid" Graphics'
  notes: Suggests
  repository: https://CRAN.R-project.org/package=gridExtra
  authors:
  - family-names: Auguie
    given-names: Baptiste
    email: baptiste.auguie@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.gridExtra
- type: software
  title: hedgehog
  abstract: 'hedgehog: Property-Based Testing'
  notes: Suggests
  url: https://hedgehog.qa
  repository: https://CRAN.R-project.org/package=hedgehog
  authors:
  - family-names: Campbell
    given-names: Huw
    email: huw.campbell@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.hedgehog
- type: software
  title: knitr
  abstract: 'knitr: A General-Purpose Package for Dynamic Report Generation in R'
  notes: Suggests
  url: https://yihui.org/knitr/
  repository: https://CRAN.R-project.org/package=knitr
  authors:
  - family-names: Xie
    given-names: Yihui
    email: xie@yihui.name
    orcid: https://orcid.org/0000-0003-0645-5666
  year: '2025'
  doi: 10.32614/CRAN.package.knitr
- type: software
  title: magrittr
  abstract: 'magrittr: A Forward-Pipe Operator for R'
  notes: Suggests
  url: https://magrittr.tidyverse.org
  repository: https://CRAN.R-project.org/package=magrittr
  authors:
  - family-names: Bache
    given-names: Stefan Milton
    email: stefan@stefanbache.dk
  - family-names: Wickham
    given-names: Hadley
    email: hadley@rstudio.com
  year: '2025'
  doi: 10.32614/CRAN.package.magrittr
- type: software
  title: Matrix
  abstract: 'Matrix: Sparse and Dense Matrix Classes and Methods'
  notes: Suggests
  url: https://Matrix.R-forge.R-project.org
  repository: https://CRAN.R-project.org/package=Matrix
  authors:
  - family-names: Bates
    given-names: Douglas
    orcid: https://orcid.org/0000-0001-8316-9503
  - family-names: Maechler
    given-names: Martin
    email: mmaechler+Matrix@gmail.com
    orcid: https://orcid.org/0000-0002-8685-9910
  - family-names: Jagan
    given-names: Mikael
    orcid: https://orcid.org/0000-0002-3542-2938
  year: '2025'
  doi: 10.32614/CRAN.package.Matrix
- type: software
  title: mgcv
  abstract: 'mgcv: Mixed GAM Computation Vehicle with Automatic Smoothness Estimation'
  notes: Suggests
  repository: https://CRAN.R-project.org/package=mgcv
  authors:
  - family-names: Wood
    given-names: Simon
    email: simon.wood@r-project.org
  year: '2025'
  doi: 10.32614/CRAN.package.mgcv
- type: software
  title: ordinal
  abstract: 'ordinal: Regression Models for Ordinal Data'
  notes: Suggests
  url: https://github.com/runehaubo/ordinal
  repository: https://CRAN.R-project.org/package=ordinal
  authors:
  - family-names: Christensen
    given-names: Rune Haubo Bojesen
    email: rune.haubo@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.ordinal
- type: software
  title: pracma
  abstract: 'pracma: Practical Numerical Math Functions'
  notes: Suggests
  repository: https://CRAN.R-project.org/package=pracma
  authors:
  - family-names: Borchers
    given-names: Hans W.
    email: hwborchers@googlemail.com
  year: '2025'
  doi: 10.32614/CRAN.package.pracma
- type: software
  title: rmarkdown
  abstract: 'rmarkdown: Dynamic Documents for R'
  notes: Suggests
  url: https://pkgs.rstudio.com/rmarkdown/
  repository: https://CRAN.R-project.org/package=rmarkdown
  authors:
  - family-names: Allaire
    given-names: JJ
    email: jj@posit.co
  - family-names: Xie
    given-names: Yihui
    email: xie@yihui.name
    orcid: https://orcid.org/0000-0003-0645-5666
  - family-names: Dervieux
    given-names: Christophe
    email: cderv@posit.co
    orcid: https://orcid.org/0000-0003-4474-2498
  - family-names: McPherson
    given-names: Jonathan
    email: jonathan@posit.co
  - family-names: Luraschi
    given-names: Javier
  - family-names: Ushey
    given-names: Kevin
    email: kevin@posit.co
  - family-names: Atkins
    given-names: Aron
    email: aron@posit.co
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
  - family-names: Cheng
    given-names: Joe
    email: joe@posit.co
  - family-names: Chang
    given-names: Winston
    email: winston@posit.co
  - family-names: Iannone
    given-names: Richard
    email: rich@posit.co
    orcid: https://orcid.org/0000-0003-3925-190X
  year: '2025'
  doi: 10.32614/CRAN.package.rmarkdown
- type: software
  title: scales
  abstract: 'scales: Scale Functions for Visualization'
  notes: Suggests
  url: https://scales.r-lib.org
  repository: https://CRAN.R-project.org/package=scales
  authors:
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
  - family-names: Pedersen
    given-names: Thomas Lin
    email: thomas.pedersen@posit.co
    orcid: https://orcid.org/0000-0002-5147-4711
  - family-names: Seidel
    given-names: Dana
  year: '2025'
  doi: 10.32614/CRAN.package.scales
- type: software
  title: splines
  abstract: 'R: A Language and Environment for Statistical Computing'
  notes: Suggests
  authors:
  - name: R Core Team
  institution:
    name: R Foundation for Statistical Computing
    address: Vienna, Austria
  year: '2025'
- type: software
  title: survival
  abstract: 'survival: Survival Analysis'
  notes: Suggests
  url: https://github.com/therneau/survival
  repository: https://CRAN.R-project.org/package=survival
  authors:
  - family-names: Therneau
    given-names: Terry M
    email: therneau.terry@mayo.edu
  year: '2025'
  doi: 10.32614/CRAN.package.survival
- type: software
  title: testthat
  abstract: 'testthat: Unit Testing for R'
  notes: Suggests
  url: https://testthat.r-lib.org
  repository: https://CRAN.R-project.org/package=testthat
  authors:
  - family-names: Wickham
    given-names: Hadley
    email: hadley@posit.co
  year: '2025'
  doi: 10.32614/CRAN.package.testthat
- type: software
  title: gtsummary
  abstract: 'gtsummary: Presentation-Ready Data Summary and Analytic Result Tables'
  notes: Suggests
  url: https://www.danieldsjoberg.com/gtsummary/
  repository: https://CRAN.R-project.org/package=gtsummary
  authors:
  - family-names: Sjoberg
    given-names: Daniel D.
    email: danield.sjoberg@gmail.com
    orcid: https://orcid.org/0000-0003-0862-2018
  - family-names: Larmarange
    given-names: Joseph
    orcid: https://orcid.org/0000-0001-7097-700X
  - family-names: Curry
    given-names: Michael
    orcid: https://orcid.org/0000-0002-0261-4044
  - family-names: Rua
    given-names: Emily
    name-particle: de la
    orcid: https://orcid.org/0009-0000-8738-5561
  - family-names: Lavery
    given-names: Jessica
    orcid: https://orcid.org/0000-0002-2746-5647
  - family-names: Whiting
    given-names: Karissa
    orcid: https://orcid.org/0000-0002-4683-1868
  - family-names: Zabor
    given-names: Emily C.
    orcid: https://orcid.org/0000-0002-1402-4498
  year: '2025'
  doi: 10.32614/CRAN.package.gtsummary
- type: software
  title: broom.helpers
  abstract: 'broom.helpers: Helpers for Model Coefficients Tibbles'
  notes: Suggests
  url: https://larmarange.github.io/broom.helpers/
  repository: https://CRAN.R-project.org/package=broom.helpers
  authors:
  - family-names: Larmarange
    given-names: Joseph
    email: joseph@larmarange.net
    orcid: https://orcid.org/0000-0001-7097-700X
  - family-names: Sjoberg
    given-names: Daniel D.
    email: danield.sjoberg@gmail.com
    orcid: https://orcid.org/0000-0003-0862-2018
  year: '2025'
  doi: 10.32614/CRAN.package.broom.helpers
- type: software
  title: survminer
  abstract: 'survminer: Drawing Survival Curves using ''ggplot2'''
  notes: Suggests
  url: https://rpkgs.datanovia.com/survminer/index.html
  repository: https://CRAN.R-project.org/package=survminer
  authors:
  - family-names: Kassambara
    given-names: Alboukadel
    email: alboukadel.kassambara@gmail.com
  - family-names: Kosinski
    given-names: Marcin
    email: m.p.kosinski@gmail.com
  - family-names: Biecek
    given-names: Przemyslaw
    email: przemyslaw.biecek@gmail.com
  year: '2025'
  doi: 10.32614/CRAN.package.survminer
- type: software
  title: katex
  abstract: 'katex: Rendering Math to HTML, ''MathML'', or R-Documentation Format'
  notes: Suggests
  url: https://docs.ropensci.org/katex/
  repository: https://CRAN.R-project.org/package=katex
  authors:
  - family-names: Ooms
    given-names: Jeroen
    email: jeroenooms@gmail.com
    orcid: https://orcid.org/0000-0002-4035-0289
  year: '2025'
  doi: 10.32614/CRAN.package.katex
- type: software
  title: dirmult
  abstract: 'dirmult: Estimation in Dirichlet-Multinomial Distribution'
  notes: Suggests
  repository: https://CRAN.R-project.org/package=dirmult
  authors:
  - family-names: Tvedebrink
    given-names: Torben
    email: tvede@math.aau.dk
  year: '2025'
  doi: 10.32614/CRAN.package.dirmult
- type: software
  title: rms
  abstract: 'rms: Regression Modeling Strategies'
  notes: Suggests
  url: https://hbiostat.org/R/rms/
  repository: https://CRAN.R-project.org/package=rms
  authors:
  - family-names: Harrell Jr
    given-names: Frank E
    email: fh@fharrell.com
  year: '2025'
  doi: 10.32614/CRAN.package.rms

CodeMeta (codemeta.json)

{
  "@context": [
    "https://doi.org/10.5063/schema/codemeta-2.0",
    "http://schema.org"
  ],
  "@type": "SoftwareSourceCode",
  "identifier": "simstudy",
  "description": "Simulates data sets in order to explore modeling techniques or better understand data generating processes. The user specifies a set of relationships between covariates, and generates data based on these specifications. The final data sets can represent data from randomized control trials, repeated measure (longitudinal) designs, and cluster randomized trials. Missingness can be generated using various mechanisms (MCAR, MAR, NMAR).",
  "name": "simstudy: Simulation of Study Data",
  "codeRepository": "https://github.com/kgoldfeld/simstudy",
  "relatedLink": [
    "https://kgoldfeld.github.io/simstudy/",
    "https://kgoldfeld.github.io/simstudy/dev/",
    "https://CRAN.R-project.org/package=simstudy"
  ],
  "issueTracker": "https://github.com/kgoldfeld/simstudy/issues",
  "license": "https://spdx.org/licenses/GPL-3.0",
  "version": "0.3.0.9000",
  "programmingLanguage": {
    "@type": "ComputerLanguage",
    "name": "R",
    "url": "https://r-project.org"
  },
  "runtimePlatform": "R version 4.1.2 (2021-11-01)",
  "provider": {
    "@id": "https://cran.r-project.org",
    "@type": "Organization",
    "name": "Comprehensive R Archive Network (CRAN)",
    "url": "https://cran.r-project.org"
  },
  "author": [
    {
      "@type": "Person",
      "givenName": "Keith",
      "familyName": "Goldfeld",
      "email": "keith.goldfeld@nyulangone.org",
      "@id": "https://orcid.org/0000-0002-0292-8780"
    },
    {
      "@type": "Person",
      "givenName": "Jacob",
      "familyName": "Wujciak-Jens",
      "email": "jacob@wujciak.de",
      "@id": "https://orcid.org/0000-0002-7281-3989"
    }
  ],
  "contributor": {},
  "copyrightHolder": {},
  "funder": {},
  "maintainer": [
    {
      "@type": "Person",
      "givenName": "Keith",
      "familyName": "Goldfeld",
      "email": "keith.goldfeld@nyulangone.org",
      "@id": "https://orcid.org/0000-0002-0292-8780"
    }
  ],
  "softwareSuggestions": [
    {
      "@type": "SoftwareApplication",
      "identifier": "covr",
      "name": "covr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=covr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "dplyr",
      "name": "dplyr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=dplyr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "formatR",
      "name": "formatR",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=formatR"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "gee",
      "name": "gee",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=gee"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ggplot2",
      "name": "ggplot2",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=ggplot2"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "grid",
      "name": "grid"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "gridExtra",
      "name": "gridExtra",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=gridExtra"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "hedgehog",
      "name": "hedgehog",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=hedgehog"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "knitr",
      "name": "knitr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=knitr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "magrittr",
      "name": "magrittr",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=magrittr"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "Matrix",
      "name": "Matrix",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=Matrix"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "mgcv",
      "name": "mgcv",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=mgcv"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "ordinal",
      "name": "ordinal",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=ordinal"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "pracma",
      "name": "pracma",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=pracma"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "rmarkdown",
      "name": "rmarkdown",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=rmarkdown"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "scales",
      "name": "scales",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=scales"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "splines",
      "name": "splines"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "survival",
      "name": "survival",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=survival"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "testthat",
      "name": "testthat",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=testthat"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "gtsummary",
      "name": "gtsummary",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=gtsummary"
    }
  ],
  "softwareRequirements": [
    {
      "@type": "SoftwareApplication",
      "identifier": "R",
      "name": "R",
      "version": ">= 3.3.0"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "data.table",
      "name": "data.table",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=data.table"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "glue",
      "name": "glue",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=glue"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "methods",
      "name": "methods"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "mvnfast",
      "name": "mvnfast",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=mvnfast"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "mvtnorm",
      "name": "mvtnorm",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=mvtnorm"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "Rcpp",
      "name": "Rcpp",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=Rcpp"
    },
    {
      "@type": "SoftwareApplication",
      "identifier": "backports",
      "name": "backports",
      "provider": {
        "@id": "https://cran.r-project.org",
        "@type": "Organization",
        "name": "Comprehensive R Archive Network (CRAN)",
        "url": "https://cran.r-project.org"
      },
      "sameAs": "https://CRAN.R-project.org/package=backports"
    }
  ],
  "releaseNotes": "https://github.com/kgoldfeld/simstudy/blob/master/NEWS.md",
  "readme": "https://github.com/kgoldfeld/simstudy/blob/main/README.md",
  "fileSize": "6381.75KB",
  "contIntegration": [
    "https://github.com/kgoldfeld/simstudy/actions",
    "https://app.codecov.io/gh/kgoldfeld/simstudy"
  ],
  "developmentStatus": "https://lifecycle.r-lib.org/articles/stages.html",
  "keywords": [
    "r",
    "data-simulation",
    "data-generation",
    "simulation",
    "statistical-models"
  ],
  "citation": [
    {
      "@type": "ScholarlyArticle",
      "datePublished": "2020",
      "author": [
        {
          "@type": "Person",
          "givenName": "Keith",
          "familyName": "Goldfeld"
        },
        {
          "@type": "Person",
          "givenName": "Jacob",
          "familyName": "Wujciak-Jens"
        }
      ],
      "name": "simstudy: Illuminating research methods through data generation",
      "identifier": "10.21105/joss.02763",
      "url": "https://doi.org/10.21105/joss.02763",
      "pagination": "2763",
      "@id": "https://doi.org/10.21105/joss.02763",
      "sameAs": "https://doi.org/10.21105/joss.02763",
      "isPartOf": {
        "@type": "PublicationIssue",
        "issueNumber": "54",
        "datePublished": "2020",
        "isPartOf": {
          "@type": [
            "PublicationVolume",
            "Periodical"
          ],
          "volumeNumber": "5",
          "name": "Journal of Open Source Software"
        }
      }
    }
  ]
}

Papers & Mentions

Total mentions: 1

Estimands in epigenome-wide association studies
Last synced: 2 months ago

GitHub Events

Total
  • Issues event: 3
  • Watch event: 4
  • Delete event: 1
  • Issue comment event: 8
  • Push event: 49
  • Pull request review event: 5
  • Pull request review comment event: 7
  • Pull request event: 4
  • Create event: 2
Last Year
  • Issues event: 3
  • Watch event: 4
  • Delete event: 1
  • Issue comment event: 8
  • Push event: 49
  • Pull request review event: 5
  • Pull request review comment event: 7
  • Pull request event: 4
  • Create event: 2

Committers

Last synced: 5 months ago

All Time
  • Total Commits: 758
  • Total Committers: 14
  • Avg Commits per committer: 54.143
  • Development Distribution Score (DDS): 0.728
Past Year
  • Commits: 10
  • Committers: 1
  • Avg Commits per committer: 10.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Keith Goldfeld k****d@g****m 206
assignUser g****b@k****e 194
Keith Goldfeld k****d@n****g 190
elinsooon m****n@g****m 54
Keith Goldfeld g****1@G****l 30
Keith Goldfeld g****1@g****g 27
GitHub Actions a****s@g****m 22
Keith Goldfeld K****G@1****g 10
James Joseph Balamuta b****2@i****u 6
GertjanV g****n 6
Keith Goldfeld g****1@1****l 5
Keith Goldfeld k****g@K****l 5
Michael Bradley, Jr m****r@g****m 2
Keith Goldfeld K****G@K****d 1

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 74
  • Total pull requests: 70
  • Average time to close issues: 5 months
  • Average time to close pull requests: 2 days
  • Total issue authors: 16
  • Total pull request authors: 3
  • Average comments per issue: 3.18
  • Average comments per pull request: 3.6
  • Merged pull requests: 61
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 2
  • Average time to close issues: 10 months
  • Average time to close pull requests: 4 days
  • Issue authors: 2
  • Pull request authors: 1
  • Average comments per issue: 1.5
  • Average comments per pull request: 3.0
  • Merged pull requests: 2
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • kgoldfeld (44)
  • assignUser (9)
  • elinsooon (7)
  • gabgilling (2)
  • simonelgato (1)
  • NielsKrarup (1)
  • zackarno (1)
  • Alik-V (1)
  • moosterwegel (1)
  • momozinho (1)
  • ddsjoberg (1)
  • ecohen13 (1)
  • DAVIDCRUZ0202 (1)
  • lorenzoFabbri (1)
  • hebaalhosainyTU (1)
Pull Request Authors
  • kgoldfeld (51)
  • assignUser (12)
  • elinsooon (7)
Top Labels
Issue Labels
feature (23) bug (13) question (2) breaking change (1)
Pull Request Labels
docs (2) feature (2)

Packages

  • Total packages: 1
  • Total downloads:
    • cran 949 last-month
  • Total dependent packages: 3
  • Total dependent repositories: 6
  • Total versions: 27
  • Total maintainers: 1
cran.r-project.org: simstudy

Simulation of Study Data

  • Versions: 27
  • Dependent Packages: 3
  • Dependent Repositories: 6
  • Downloads: 949 Last month
Rankings
Stargazers count: 5.2%
Forks count: 7.9%
Average: 9.4%
Dependent packages count: 10.9%
Downloads: 11.2%
Dependent repos count: 11.9%
Last synced: 4 months ago