illinois-wellness-data
Public use data for the Illinois Workplace Wellness Study
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 4 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.0%) to scientific vocabulary
Keywords
Repository
Public use data for the Illinois Workplace Wellness Study
Basic Info
- Host: GitHub
- Owner: reifjulian
- License: cc0-1.0
- Language: Stata
- Default Branch: master
- Homepage: https://www.nber.org/workplacewellness/
- Size: 11 MB
Statistics
- Stars: 18
- Watchers: 4
- Forks: 18
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
Illinois Workplace Wellness Study: Public Use Data Repository
Overview Examples Restricted-use data Terms of use References Update history
Overview
This repository contains public use data for the Illinois Workplace Wellness Study, a randomized controlled trial run by Damon Jones, David Molitor, and Julian Reif. These data can be used for research, teaching, and replicating published results. For information about the study, publications, the research team, pre-analysis plans, original survey instruments, and more, please visit the study website.
The following diagram summarizes the organization of the repository.
illinois-wellness-data # Public-use project folder
data # Public-use data
| codebooks # Codebooks
| csv # Datasets (CSV format)
| stata # Datasets (Stata format)
documentation # Public-use data documentation
replications # Replication folders
jamaim_2020 # Reif et al. (2020)
qje_2019 # Jones, Molitor, and Reif (2019)
The documentation folder describes the datasets. The replications folder provides Stata code that uses the public use data to replicate results for the following publications: - Jones, Molitor, and Reif (2019, Quarterly Journal of Economics) - Article | Replication code - Reif et al. (2020, JAMA Internal Medicine) - Article | Replication code
Examples
- Estimate the one-year causal effect of the Illinois workplace wellness program on medical spending (see Table 3 of Jones, Molitor, and Reif 2019)
stata
* Stata code
use "https://reifjulian.github.io/illinois-wellness-data/data/stata/claims.dta", clear
reg spend_0816_0717 treat [aw=covg_0816_0717], robust
```R
R code
library(haven) library(estimatr) mydata <- readdta("https://reifjulian.github.io/illinois-wellness-data/data/stata/claims.dta") lmrobust(spend08160717 ~ treat, data = mydata, weights = covg08160717, se_type = "HC1") ```
- Create a frequency histogram of average monthly hospital spending for the August 2016 - July 2017 time period
stata
* Stata code
use "https://reifjulian.github.io/illinois-wellness-data/data/stata/claims.dta", clear
histogram spendHosp_0816_0717, graphregion(fcolor(white)) freq
```R
R code
library(haven) mydata <- readdta("https://reifjulian.github.io/illinois-wellness-data/data/stata/claims.dta") hist(mydata$spendHosp0816_0717) ```
- Estimate the one-year causal effect of the Illinois workplace wellness program on beliefs about chances of high cholesterol (see Table 2 of Reif et al. (2020))
stata
* Stata code
use "https://reifjulian.github.io/illinois-wellness-data/data/stata/biometrics.dta", clear
reg self_cholesterol_2017 treat, absorb(Strata_biometrics) robust
```R
R code
library(haven) library(estimatr) mydata <- readdta("https://reifjulian.github.io/illinois-wellness-data/data/stata/biometrics.dta") lmrobust(selfcholesterol2017 ~ treat, fixedeffects = ~ Stratabiometrics, data = mydata, se_type = "HC1") ```
- Create a density plot of pre-period glucose levels, by sex
stata
* Stata code
use "https://reifjulian.github.io/illinois-wellness-data/data/stata/biometrics.dta", clear
kdensity glucose_2016, gen(x fx) nograph
kdensity glucose_2016 if male==0, gen(fxf) at(x) nograph
kdensity glucose_2016 if male==1, gen(fxm) at(x) nograph
label var fxf "Female"
label var fxm "Male"
twoway line fxf fxm x if x<200, graphregion(fcolor(white)) lcolor(red blue)
```R
R code
library(tidyverse) library(haven) mydata <- readdta("https://reifjulian.github.io/illinois-wellness-data/data/stata/biometrics.dta") %>% mutate(Group = factor(male, labels = c("Female", "Male"))) ggplot(mydata) + geomdensity(aes(glucose2016, color=Group, fill=Group), alpha = 0.1) + xlim(50, 200) + xlab("Glucose (mg/dL) (2016)") + thememinimal() ```
Restricted-use data
The restricted-use data include: - Anonymized identifiers that allow researchers to link individuals across different datasets - Uncensored variables such as salary - Raw responses to the online and on-site surveys
These data are currently hosted on a non-networked computer located at:
National Bureau of Economic Research
1050 Massachusetts Ave.
Cambridge, MA
02138
The non-networked computer includes Stata code that provides a full replication of the tables and figues from Jones, Molitor, and Reif (2019) and Reif et al. (2020).
Researchers interested in using the restricted-use data must: - Obtain consent from the study's principal investigators (Jones, Molitor, and Reif); - Obtain approval from the NBER Institutional Review Board; and - Sign a non-disclosure agreement with NBER
Inquiries regarding the restricted-use data can be directed to David Molitor.
Terms of use
These study data contain information collected on 4,834 RESEARCH SUBJECTS. The PROMISE OF CONFIDENTIALITY promises to these individuals that the information they provided will not be disseminated without their permission; that the fact that they participated in the study will not be disclosed; and that disseminated information will include no linkages to their identities. Names and other identifying information regarding these individuals are presumed to be confidential.
Any intentional identification of a RESEARCH SUBJECT or unauthorized disclosure of his or her confidential information violates the PROMISE OF CONFIDENTIALITY given to the providers of the information. Therefore, by downloading these data, you hereby agree:
- To not use these datasets for investigation of specific RESEARCH SUBJECTS
- To make no use of the identity of any RESEARCH SUBJECT discovered inadvertently, and to advise us of any such discovery (jreif@illinois.edu)
Any materials (books, articles, conference papers, theses, dissertations, reports, and other such publications) created that employ, reference, or otherwise use these data (in whole or in part) should credit this source. Please cite it as:
Jones, D., D. Molitor, and J. Reif. "What Do Workplace Wellness Programs Do? Evidence from the Illinois Workplace Wellness Study." Quarterly Journal of Economics, November 2019, 134(4): 1747-1791.
References
Jones, D., D. Molitor, and J. Reif. "What Do Workplace Wellness Programs Do? Evidence from the Illinois Workplace Wellness Study." Quarterly Journal of Economics, November 2019, 134(4): 1747-1791.
Reif, J., Chan, D., Jones, D., Payne, L., and Molitor, D. "Effects of a Workplace Wellness Program on Employee Health, Health Beliefs, and Medical Use: A Randomized Clinical Trial." JAMA Internal Medicine, May 2020, 180(7): 952-960.
Update history
February 1, 2022
- Added replication code for Reif et al. (2020)
- Added new dataset with biometrics variables
- Added utilization and diagnosis variables to claims dataset
October 1, 2020
- Initial release
Owner
- Name: Julian Reif
- Login: reifjulian
- Kind: user
- Location: Champaign, Illinois
- Company: University of Illinois
- Website: www.julianreif.com
- Twitter: julianreif
- Repositories: 7
- Profile: https://github.com/reifjulian
Associate Professor of Finance and Economics, Gies College of Business, University of Illinois
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this repository, please cite it as below."
authors:
- family-names: "Jones"
given-names: "Damon"
- family-names: "Molitor"
given-names: "David"
- family-names: "Reif"
given-names: "Julian"
title: "Illinois Workplace Wellness Study: Public Use Data Repository"
version: 1.0
doi: 10.5281/zenodo.4064859
date-released: 2020-10-01
url: "https://github.com/reifjulian/illinois-wellness-data"
preferred-citation:
type: article
authors:
- family-names: "Jones"
given-names: "Damon"
- family-names: "Molitor"
given-names: "David"
- family-names: "Reif"
given-names: "Julian"
journal: "Quarterly Journal of Economics"
month: 11
start: 1747 # First page number
end: 1791 # Last page number
title: "What Do Workplace Wellness Programs Do? Evidence from the Illinois Workplace Wellness Study"
issue: 4
volume: 134
year: 2019