https://github.com/alexmclain/cdscr

Programs for length-biased cross-sectional survival data.

Keywords

infertility length-biased-distribution survival-analysis

Last synced: 7 months ago · JSON representation

Repository

Programs for length-biased cross-sectional survival data.

Basic Info

Host: GitHub
Owner: alexmclain
Language: R
Default Branch: master
Homepage:
Size: 73.2 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Topics

infertility length-biased-distribution survival-analysis

Created over 5 years ago · Last pushed over 4 years ago

Metadata Files

Readme

Length-biased semicompeting risks models for cross-sectional data

These programs will analysis of current duration data that is subjected to semi-competing risks. The details of this model are discussed in McLain et al. (2021) referenced below. To run the programs download the files to a folder, and open the example script document (make sure the folder containing all the files is the working directory). In McLain et al. (2021) data from the NSFG is analyzed. The NSFG data is available at the following website

In the paper, cycle 6 through the 2015-2017 data is analyzed (we cannot post NSFG data on this website, but it is all publicly available).

McLain, A.C., S. Guo, M.E. Thoma, and J. Zhang (2021). Length-biased semicompeting risks models for cross-sectional data: An application to current duration of pregnancy attempt data. Annals of Applied Statistics 15:2 1054–1067.

Examples with Simulated Data

Example with continuous survival data

First, load the programs, read in the data and extract the key elements:

```r source("CDSCRprograms.R") exdata <- read.csv("CDSCR_data.csv")

Total current duration time

x.data <- ex_data$T

Time to intermittent event

y.data <- ex_data$TTFT

Indicator that intermittent event occured.

delta.data <- exdata$failind ```

Setting the number of peicewise parameters, locations will be choosen within function (knots can be given) r K1 <- 5 #For T K2 <- 5 #For Y K3 <- 5 #For Z

Run Maximum Likelihood Optimization

```r CDSCR_opt <- CDSCR(x.data,y.data,delta.data,K1=K1,K2=K2,K3=K3,B=20)

Extract knots

knots.t <- CDSCRopt$knots.t knots.y <- CDSCRopt$knots.y knots.z <- CDSCR_opt$knots.z

Likelihood value and optimization details.

CDSCRopt$likeopt$value CDSCRopt$likeopt$counts CDSCRopt$likeopt$convergence

Extract parameter estimates

parest <- CDSCRopt$like_opt$par ```

Transform the parameter estimate to their corresponding values.

r alpha_T_est <- exp(par_est[1:(length(knots.t)+1)]) alpha_Y_est <- exp(par_est[(length(knots.t)+2):(length(c(knots.t,knots.y))+2)]) alpha_Z_est <- exp(par_est[(length(c(knots.t,knots.y))+3):(length(c(knots.t,knots.y,knots.z))+3)]) corrxy_est <- -(exp(par_est[length(par_est)-1])/(1+exp(par_est[length(par_est)-1]))) corrxz_est <- (exp(par_est[length(par_est)])/(1+exp(par_est[length(par_est)])))

Calculating the survival functions of T, Y, Z and min(T,Y)

r t_vec <- seq(0,36) Surv_T_est <- PC_surv(t_vec,knots.t,alpha_T_est) Surv_Y_est <- PC_surv(t_vec,knots.y,alpha_Y_est) Surv_Z_est <- PC_surv(t_vec,knots.z,alpha_Z_est) Min_TY_est <- surv.min.T.Y(t_vec,corrxy_est,knots.t,alpha_T_est,knots.y,alpha_Y_est)

Now, we'll estimate the survival using the standard (McLain et al. 2014) approach

r Surv_T_naive <- CD_surv(x.data,method = 'Piecewise', knots = knots.t)$Surv_est

Plotting the results ####

r plot(t_vec,Surv_T_est,lwd=3,type = "l",xlim=c(0,max(t_vec)),ylim=c(0,1),ylab="Survival Function",xlab="Time",las=1) lines(t_vec,Surv_Y_est,col=2,lwd=3) lines(t_vec,Surv_Z_est,col=3,lwd=3) lines(t_vec,Min_TY_est,col=4,lwd=3) lines(t_vec,Surv_T_naive(t_vec),col=1,lwd=3,lty=2) legend("topright",legend = c("Survival of T","Survival of Y","Survival of Z","Survival of min(T,Y)","Survival of T (naive)"),lwd=3,col=c(1:4,1),lty=c(rep(1,4),2))

Produce a table of selected values

r t_want <- c(3,6,12,24,36) surv_res <- data.frame(time=t_want,Est_surv_T=Surv_T_est[t_vec %in% t_want],Naive_surv_T=Surv_T_naive(t_want),Est_surv_Y=Surv_Y_est[t_vec %in% t_want],Est_surv_Z=Surv_Z_est[t_vec %in% t_want],Est_surv_minTY=Min_TY_est[t_vec %in% t_want]) surv_res

Example with grouped survival data

```r exdata <- read.csv("CDSCRdata_grp.csv")

Total current duration time

x.data <- ex_data$T

Time to intermittent event

y.data <- ex_data$TTFT

Indicator that intermittent event occured.

delta.data <- exdata$failind ```

Setting the number of peicewise parameters, locations will be chosen within function (knots can be given)

r K1 <- 5 #For T K2 <- 5 #For Y K3 <- 5 #For Z

Run Maximum Likelihood Optimization and extract the results

```r CDSCR_opt <- CDSCR(x.data,y.data,delta.data,K1=K1,K2=K2,K3=K3,B=20)

Extract knots

knots.t <- CDSCRopt$knots.t knots.y <- CDSCRopt$knots.y knots.z <- CDSCR_opt$knots.z

Likelihood value and optimization details.

CDSCRopt$likeopt$value CDSCRopt$likeopt$counts CDSCRopt$likeopt$convergence

Extract parameter estimates

parest <- CDSCRopt$like_opt$par

Transform to corresponding values

alphaTest <- exp(parest[1:(length(knots.t)+1)]) alphaYest <- exp(parest[(length(knots.t)+2):(length(c(knots.t,knots.y))+2)]) alphaZest <- exp(parest[(length(c(knots.t,knots.y))+3):(length(c(knots.t,knots.y,knots.z))+3)]) corrxyest <- -(exp(parest[length(parest)-1])/(1+exp(parest[length(parest)-1]))) corrxzest <- (exp(parest[length(parest)])/(1+exp(parest[length(par_est)]))) ```

Calculating the survival functions of T, Y, Z and min(T,Y)

r t_vec <- seq(0,36) Surv_T_est <- PC_surv(t_vec,knots.t,alpha_T_est) Surv_Y_est <- PC_surv(t_vec,knots.y,alpha_Y_est) Surv_Z_est <- PC_surv(t_vec,knots.z,alpha_Z_est) Min_TY_est <- surv.min.T.Y(t_vec,corrxy_est,knots.t,alpha_T_est,knots.y,alpha_Y_est)

Estimating the survival using the standard (McLain et al. 2014) approach

r Surv_T_naive <- CD_surv(x.data,method = 'Piecewise', knots = knots.t)$Surv_est

Plotting the results

r plot(t_vec,Surv_T_est,lwd=3,type = "l",xlim=c(0,max(t_vec)),ylim=c(0,1),ylab="Survival Function",xlab="Time",las=1) lines(t_vec,Surv_Y_est,col=2,lwd=3) lines(t_vec,Surv_Z_est,col=3,lwd=3) lines(t_vec,Min_TY_est,col=4,lwd=3) lines(t_vec,Surv_T_naive(t_vec),col=1,lwd=3,lty=2) legend("topright",legend = c("Survival of T","Survival of Y","Survival of Z","Survival of min(T,Y)","Survival of T (naive)"),lwd=3,col=c(1:4,1),lty=c(rep(1,4),2))

Table of selected values

r t_want <- c(3,6,12,24,36) surv_res <- data.frame(time=t_want,Est_surv_T=Surv_T_est[t_vec %in% t_want],Naive_surv_T=Surv_T_naive(t_want),Est_surv_Y=Surv_Y_est[t_vec %in% t_want],Est_surv_Z=Surv_Z_est[t_vec %in% t_want],Est_surv_minTY=Min_TY_est[t_vec %in% t_want]) surv_res

McLain A. C., R. Sundaram, M. E. Thoma, and G. M. Buck Louis (2014). Semiparametric modeling of grouped current duration data with preferential reporting. Statistics in Medicine, 33 (23), 3961--3972

Owner

Name: Alexander McLain
Login: alexmclain
Kind: user
Location: Columbia, SC
Company: University of South Carolina

Website: https://sites.google.com/site/alexmclain/home
Twitter: AlexanderMcLai2
Repositories: 2
Profile: https://github.com/alexmclain

I have a broad range of interests that focus on estimation, prediction and inference in complex models, commonly with clustered or high-dimensional data.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/alexmclain/cdscr

Science Score: 23.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Length-biased semicompeting risks models for cross-sectional data

Examples with Simulated Data

Example with continuous survival data

Total current duration time

Time to intermittent event

Indicator that intermittent event occured.

Extract knots

Likelihood value and optimization details.

Extract parameter estimates

Example with grouped survival data

Total current duration time

Time to intermittent event

Indicator that intermittent event occured.

Extract knots

Likelihood value and optimization details.

Extract parameter estimates

Transform to corresponding values

Owner

GitHub Events

Total

Last Year