Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 12 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.8%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: youmisuk
  • Language: R
  • Default Branch: main
  • Size: 18.6 KB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 3 years ago · Last pushed over 2 years ago
Metadata Files
Readme Citation

README.md

Designing Optimal, Data-Driven Policies from Multisite Randomized Trials

Youmi Suk1 and Chan Park2

1 Department of Human Development, Teachers College Columbia University
2 Department of Statistics and Data Science, The Wharton School, University of Pennsylvania

Overview

Optimal treatment regimes (OTRs) have been widely employed in computer science and personalized medicine to provide data-driven, optimal recommendations to individuals. However, previous research on OTRs has primarily focused on settings that are independent and identically distributed, with little attention given to the unique characteristics of educational settings, where students are nested within schools and there are hierarchical dependencies. The goal of this study is to design OTRs from multisite randomized trials, a commonly used experimental design in education and psychology to evaluate educational programs. We investigate modifications to popular OTR methods, specifically Q-learning and weighting methods, in order to improve their performance in multisite randomized trials. A total of 12 modifications, 6 for Q-learning and 6 for weighting, are proposed by utilizing different multilevel models, moderators, and augmentations. Simulation studies reveal that all Q-learning modifications improve performance in multisite randomized trials and the modifications that incorporate random treatment effects show the most promise in handling cluster-level moderators. Among weighting methods, the modification that incorporates cluster dummies into moderator variables and augmentation terms performs best across simulation conditions. The proposed modifications are demonstrated through an application to estimate an OTR of conditional cash transfer programs using a multisite randomized trial in Colombia to maximize educational attainment.

For more details of our proposed methods, see our paper. Here, we provide R codes to reproduce our simulation study and replicate our data analysis using data about conditional cash transfer (CCT) programs.

Simulation Study

  • DataGeneratingModels.R

This R file includes data generating codes for data from a multisite randomized trial with a cluster-level unmeasured covariate.

  • Qlearn.R

This R file includes a function named Qlearn to implement our proposed modifcations for Q-learning as well as the baseline Q-learning method.

  • SimulationCodes.R

This R file includes simulation codes with our proposed modifcations for Q-learning and weighting methods where the parameter beta1 represents the cofficient of a cross-level interaction effect between treatment status and a cluster-level unmeasured covariate. For more information on simulation condtions, please refer to our paper.

CCT Data Study

  • Data on conditional cash transfer (CCT) programs

For our empirical analysis, we used data collected by Barrera-Osorio et al. (2011). The data can be downloaded from openICPSR by clicking here. For more information on the data, please refer to the detiled report by Barrera-Osorio et al. (2011) and the codebook provided here.

  • CCTdataAnalysisCodes.R

This R file can be used to replicate our data analysis.

Please note that these supplemental materials are provided for the purpose of reproducibility and should be used in accordance with academic ethical guidelines. Any reference to these materials in your work should properly cite the original sources.

Owner

  • Name: Youmi Suk
  • Login: youmisuk
  • Kind: user
  • Location: New York, NY
  • Company: Teachers College, Columbia University

Assistant Professor of Applied Statistics

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Suk"
  given-names: "Youmi"
  orcid: "http://orcid.org/0000-0003-0316-6201"
- family-names: "Park"
  given-names: "Chan"
  orcid: "https://orcid.org/0000-0001-8315-1522"
title: "R Codes for Designing Optimal, Data-Driven Policies from Multisite Randomized Trials"
version: 
doi: 
date-released: 2023-01-20
url: "https://github.com/youmisuk/multisiteOTR"

GitHub Events

Total
Last Year