Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.9%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: I4Replication
  • Language: R
  • Default Branch: main
  • Size: 19 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 9 months ago · Last pushed 6 months ago
Metadata Files
Readme Citation

README.md

Reproducibility, Coding‑Error Detection & Robustness

Comparing Human‑Only, AI‑Assisted, and AI‑Led Teams on Assessing Research Reproducibility in Quantitative Social Science

Short‑version This repo contains 17 R scripts and 10 Stata scripts that jointly reproduce every table and figure in Brodeur et al. (2025).
Stata’s master.do installs its own dependencies and calls the “R‑only” modules via {rsource}, so you can stay inside Stata while still getting the full analysis.

MIT Licence  CC‑BY 4.0  Last Updated


Getting started

This project relies on the here package for resolving file paths. To ensure that here::here() points to the correct locations, open the R project file AI paper.Rproj before running any code. Avoid manually setting the working directory or sourcing scripts from outside the project folder—doing so can cause file loading to fail.

1  |  Project layout

├── data/ # Raw + processed datasets │ ├── AI games.xlsx │ ├── AI Games - Prompts Information.xlsx │ ├── AI games.dta │ └── AI games.rds ├── code/ │ ├── R code/ # 17 scripts (analysis & figures) │ └── Stata code/ # 10 scripts + master.do └── output/ ├── tables/ ├── figures/ └── logs/


2  |  Quick start

2.1 R workflow

Open AI paper.Rproj to launch the project. From the project root, run

r source("code/R code/master.R")

2.2 Stata workflow

You have two options:

| Option | Command | When to use | |--------|---------|-------------| | A. Launch from repo root |
stata cd "/path/to/AI paper" global path "`c(pwd)'" // project root do "code/Stata code/master.do" | Clone the repo and run directly. | | B. Keep original globals | Edit the global path line in master.do to your local clone location. | If you prefer hard‑coded paths. |

The Stata pipeline:

  1. Checks & installs required packages (reghdfe, ftools, …).
  2. Runs core Stata analysis (cleaning.do, main.do, etc.).
  3. Calls the remaining R‑only scripts with
    stata rsource … , rpath("/usr/local/bin/R")
    so results stay consistent across languages.

3  |  Dependencies

R ≥ 4.4.0

Auto‑installed via pacman::p_load(): haven, rmarkdown, readxl, dplyr, stringr, tidyr, forcats, janitor, lubridate, fixest, purrr, broom, tibble, car, margins, sandwich, lmtest, multcomp, kableExtra, ggplot2, patchwork, modelsummary, ggsurvfit, survRM2, xtable

Stata ≥ 17/MP

reghdfe, ftools, estout, ppmlhdfe, rsource


4  |  Script map (Stata ↔︎ R)

| Block | Stata script | R script (if any) | Purpose | |-------|--------------|-------------------|---------| | Cleaning | cleaning.do | cleaning.R | Raw → tidy | | Main OLS | main.do | main.R | Core regressions | | Logit/Poisson | logit poisson.do | logit poisson.R | Alt link functions | | Full controls | full controls.do | full controls.R | Max covariate set | | Software heterogeneity | softwares.do | softwares.R | Split by software | | Error shares | error shares.do | error shares.R | Outcome decomposition | | Study‑2 interaction | study 2.do | study 2.R | Wave‑specific effects | | Power analysis | — | power.R | Ex‑post power (R‑only) | | Branch differences | — | branches.R | Human vs AI | | Balance tables | — | balance.R | Covariate balance | | GPT skill | — | gpt skill.R | Skill heterogeneity | | Prompt usage | — | prompts.R | Prompt heterogeneity | | RMST | — | rmst.R | Restricted‑mean survival | | Time‑to‑event figs | time to first.do | time to first.R | Kaplan‑Meier curves | | KM figs | reproduction rates.do | reproduction rates.R | Rates across events | | Prompt distribution | prompt distribution.do | prompt distribution.R | Usage distribution |


5  |  Citation

If you build on this code or data, please cite:

Abel Brodeur, David Valenta, Alexandru Marcoci, Juan P. Aparicio, Derek Mikola, Bruno Barbarioli, Rohan Alexander, Lachlan Deer, Tom Stafford, Lars Vilhuber, Gunther Bensch et al. (2025).
“Comparing Human‑Only, AI‑Assisted, and AI‑Led Teams on Assessing Research Reproducibility in Quantitative Social Science.”
Working paper, under revision at Nature.

A machine‑readable CITATION.cff is included for convenience.


6  |  Licence

  • Code – © Abel Brodeur, 2025 • MIT Licence
  • Data & generated figures – CC‑BY 4.0

See the LICENSE and LICENSE-data files for full terms.


Last updated: 03 Jun 2025

Owner

  • Name: I4R
  • Login: I4Replication
  • Kind: organization
  • Email: instituteforreplication@gmail.com
  • Location: Canada

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this repository, please cite it as below."
title: "Comparing Human-Only, AI-Assisted, and AI-Led Teams on Assessing Research Reproducibility in Quantitative Social Science"
version: "0.9.0"
authors:
  - family-names: Brodeur
    given-names: Abel
    affiliation: "University of Ottawa"
    orcid: "0000-0002-XXXX-XXXX"
  - family-names: Valenta
    given-names: David
  - family-names: Marcoci
    given-names: Alexandru
  - family-names: Aparicio
    given-names: Juan P.
  - family-names: Mikola
    given-names: Derek
  - family-names: Barbarioli
    given-names: Bruno
  - family-names: Alexander
    given-names: Rohan
  - family-names: Deer
    given-names: Lachlan
  - family-names: Stafford
    given-names: Tom
  - family-names: Vilhuber
    given-names: Lars
  - family-names: Bensch
    given-names: Gunther
date-released: "2025-06-03"
license: "MIT"
repository-code: "https://github.com/<username>/AI-paper"

GitHub Events

Total
  • Delete event: 3
  • Push event: 17
  • Pull request event: 8
  • Create event: 4
Last Year
  • Delete event: 3
  • Push event: 17
  • Pull request event: 8
  • Create event: 4