multisiteOTR

https://github.com/youmisuk/multisiteOTR

Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 12 DOI reference(s) in README
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.8%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: youmisuk
Language: R
Default Branch: main
Size: 18.6 KB

Statistics

Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme Citation

Designing Optimal, Data-Driven Policies from Multisite Randomized Trials

Youmi Suk¹ and Chan Park²

¹ Department of Human Development, Teachers College Columbia University
² Department of Statistics and Data Science, The Wharton School, University of Pennsylvania

Overview

Optimal treatment regimes (OTRs) have been widely employed in computer science and personalized medicine to provide data-driven, optimal recommendations to individuals. However, previous research on OTRs has primarily focused on settings that are independent and identically distributed, with little attention given to the unique characteristics of educational settings, where students are nested within schools and there are hierarchical dependencies. The goal of this study is to design OTRs from multisite randomized trials, a commonly used experimental design in education and psychology to evaluate educational programs. We investigate modifications to popular OTR methods, specifically Q-learning and weighting methods, in order to improve their performance in multisite randomized trials. A total of 12 modifications, 6 for Q-learning and 6 for weighting, are proposed by utilizing different multilevel models, moderators, and augmentations. Simulation studies reveal that all Q-learning modifications improve performance in multisite randomized trials and the modifications that incorporate random treatment effects show the most promise in handling cluster-level moderators. Among weighting methods, the modification that incorporates cluster dummies into moderator variables and augmentation terms performs best across simulation conditions. The proposed modifications are demonstrated through an application to estimate an OTR of conditional cash transfer programs using a multisite randomized trial in Colombia to maximize educational attainment.

For more details of our proposed methods, see our paper. Here, we provide R codes to reproduce our simulation study and replicate our data analysis using data about conditional cash transfer (CCT) programs.

Simulation Study

DataGeneratingModels.R

This R file includes data generating codes for data from a multisite randomized trial with a cluster-level unmeasured covariate.

Qlearn.R

This R file includes a function named Qlearn to implement our proposed modifcations for Q-learning as well as the baseline Q-learning method.

SimulationCodes.R

This R file includes simulation codes with our proposed modifcations for Q-learning and weighting methods where the parameter beta1 represents the cofficient of a cross-level interaction effect between treatment status and a cluster-level unmeasured covariate. For more information on simulation condtions, please refer to our paper.

CCT Data Study

Data on conditional cash transfer (CCT) programs

For our empirical analysis, we used data collected by Barrera-Osorio et al. (2011). The data can be downloaded from openICPSR by clicking here. For more information on the data, please refer to the detiled report by Barrera-Osorio et al. (2011) and the codebook provided here.

CCTdataAnalysisCodes.R

This R file can be used to replicate our data analysis.

Please note that these supplemental materials are provided for the purpose of reproducibility and should be used in accordance with academic ethical guidelines. Any reference to these materials in your work should properly cite the original sources.

Owner

Name: Youmi Suk
Login: youmisuk
Kind: user
Location: New York, NY
Company: Teachers College, Columbia University

Website: https://youmilab.github.io/
Repositories: 2
Profile: https://github.com/youmisuk

Assistant Professor of Applied Statistics

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Suk"
  given-names: "Youmi"
  orcid: "http://orcid.org/0000-0003-0316-6201"
- family-names: "Park"
  given-names: "Chan"
  orcid: "https://orcid.org/0000-0001-8315-1522"
title: "R Codes for Designing Optimal, Data-Driven Policies from Multisite Randomized Trials"
version: 
doi: 
date-released: 2023-01-20
url: "https://github.com/youmisuk/multisiteOTR"

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science