Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
2 of 2 committers (100.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.0%) to scientific vocabulary
Repository
ODS
Basic Info
- Host: GitHub
- Owner: Yinghao-Pan
- Language: R
- Default Branch: master
- Size: 138 KB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
ODS
Outcome-dependent sampling (ODS) schemes are cost-effective ways to enhance study efficiency. In ODS designs, one observes the exposure/covariates with a probability that depends on the outcome variable. Popular ODS designs include case-control for binary outcome, case-cohort for time-to-event outcome, and continuous outcome ODS design (Zhou et al. 2002). Because ODS data has biased sampling nature, standard statistical analysis such as linear regression will lead to biases estimates of the population parameters. This package implements four statistical methods related to ODS designs: (1) An empirical likelihood method analyzing the primary continuous outcome with respect to exposure variables in continuous ODS design (Zhou et al., 2002). (2) A partial linear model analyzing the primary outcome in continuous ODS design (Zhou, Qin and Longnecker, 2011). (3) Analyze a secondary outcome in continuous ODS design (Pan et al. 2018). (4) An estimated likelihood method analyzing a secondary outcome in case-cohort data (Pan et al. 2017).
The references are the following:
Zhou H, Weaver M, Qin J, Longnecker M, Wang M. (2002). A semiparametric empirical likelihood method for data from an outcome‐dependent sampling scheme with a continuous outcome. Biometrics, 58(2):413-421.
Zhou H, Qin G, Longnecker M. (2011). A partial linear model in the outcome‐dependent sampling setting to evaluate the effect of prenatal PCB exposure on cognitive function in children. Biometrics, 67(3):876-885.
Pan Y, Cai J, Kim S, Zhou H. (2017). Regression analysis for secondary response variable in a case‐cohort study. Biometrics.
Pan Y, Cai J, Longnecker M, Zhou H. (2018). Secondary outcome analysis for data from an outcome‐dependent sampling design. Statistics in medicine, 37(15):2321-2337.
Linear model in ODS
We assume that in the population, the primary outcome variable $Y$ follows the linear model: $$ Y = \beta{0} + \beta{1}X + \epsilon $$ where $X$ are the covariates, and $\epsilon\sim N(0, \sigma^2)$. In continuous ODS design, a simple random sample is taken from the full cohort, then two supplemental samples are taken from tails of the $Y$ distribution, i.e. $(-\infty, \mu{Y} - a*\sigma{Y})$ and $(\mu{Y} + a*\sigma{Y}, +\infty)$. As ODS data is not a simple random sample of the overall population, naive regression analysis will yield to invalid estimates of the population parameters. Zhou et al. (2002) develops a semiparametric empirical likelihood estimator (MSELE) for conducting inference on the parameters in the linear model.
Function odsmle provides the parameter estimates, and function se.spmle calculates the standard error for MSELE estimator.
Partial linear model in ODS
We assume that in the population, the primary outcome variable $Y$ follows the partial linear model: $$ E(Y|X,Z)=g(X)+Z^{T}\gamma $$ where $X$ is the expensive exposure, $Z$ are other covariates. $g(\cdot)$ is an unknown smooth function. Zhou, Qin and Longnecker (2011) considers a penalized spline method to estimate the nonparamatric function $g(\cdot)$ and other regression coefficients $\gamma$ under the ODS sampling scheme.
Function Estimate_PLMODS computes the parameter estimates and standard error in the partial linear model. Function gcv_ODS calculates the generalized cross-validation (GCV) for selecting the smoothing parameter. The details can be seen in Zhou, Qin and Longnecker (2011).
Secondary analysis in ODS design
We assume that in the population, the primary outcome $Y1$ and the secondary outcome $Y2$ satisfy the following conditional mean model: $$ E(Y1|X,Z)=\beta0+\beta1X+\beta2Z $$ $$ E(Y2|X,Z)=\gamma0+\gamma1X+\gamma2Z $$ Pan et al. (2018) proposed an augmented inverse probability weighted estimating equation to analyze the secondary outcome (parameters: $\gamma0, \gamma1, \gamma2$) for data obtained from the continuous ODS design. Function **secondaryODS** computes the parameter estimates and standard error for $(\beta, \gamma)$.
Secondary analysis of case-cohort data
When the primary outcome is survival time, case-cohort design is commonly used to enhance study efficiency. We assume that the primary outcome (survival time) follows the Cox model: $$ \lambda(t|X,Y2,Z)=\lambda0(t)\exp(\gamma1X+\gamma2Y2+\gamma3Z) $$ $Y2$ is a secondary outcome that satisfy the following linear model: $$ Y2 = \beta{0} + \beta{1}X + \beta2Z + \epsilon $$ where $\epsilon\sim N(0, \sigma^2)$. Pan et al. (2017) proposed a nonparametric estimated likelihood approach for analyzing the secondary outcome $Y2$ when the data is obtained from a case-cohort study. Function secondary_casecohort computes the parameter estimates and standard error for $(\beta, \gamma)$.
Package installation
~~~ install.packages("devtools") devtools::install_github("Yinghao-Pan/ODS") ~~~
GitHub Events
Total
Last Year
Committers
Last synced: over 2 years ago
Top Committers
| Name | Commits | |
|---|---|---|
| Yinghao Pan | y****n@e****u | 8 |
| Yinghao Pan | y****8@u****u | 7 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Packages
- Total packages: 1
-
Total downloads:
- cran 219 last-month
- Total dependent packages: 0
- Total dependent repositories: 0
- Total versions: 1
- Total maintainers: 1
cran.r-project.org: ODS
Statistical Methods for Outcome-Dependent Sampling Designs
- Homepage: https://github.com/Yinghao-Pan/ODS
- Documentation: http://cran.r-project.org/web/packages/ODS/ODS.pdf
- License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
-
Latest release: 0.2.0
published over 7 years ago
Rankings
Maintainers (1)
Dependencies
- R >= 3.5.0 depends
- cubature >= 1.4 imports
- stats * imports
- survival >= 2.42 imports
- utils * imports