paracetamol-for-aki-parist-trial-work-flow
Comprehensive, end-to-end R and R Markdown pipeline for the PARIST trial, covering raw data cleaning, RMST-based time-to-recovery analyses, biomarker trajectories, hepatotoxicity safety checks, and fully reproducible manuscript generation with CI automation.
https://github.com/gpaasi/paracetamol-for-aki-parist-trial-work-flow
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.0%) to scientific vocabulary
Repository
Comprehensive, end-to-end R and R Markdown pipeline for the PARIST trial, covering raw data cleaning, RMST-based time-to-recovery analyses, biomarker trajectories, hepatotoxicity safety checks, and fully reproducible manuscript generation with CI automation.
Basic Info
- Host: GitHub
- Owner: gpaasi
- License: other
- Language: R
- Default Branch: main
- Size: 3.59 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
PARIST-Trial Workflow • Severe-Malaria AKI Project
End-to-end, reproducible code + text for the manuscript
“Paracetamol for Renal Recovery in Severe Malaria (PARIST Trial)”
📑 Table of contents
- Project scope
- Folder structure
- Quick-start
- Re-running the full pipeline
- Continuous integration
- Authorship & contributions
- Data governance
- License & citation
- Contact
Project scope
This repository is the single source of truth for:
- De-identified analysis datasets
- R scripts and R Markdown notebooks
- Auto-generated figures & tables
- The manuscript in markdown/LaTeX (rendered to PDF & HTML)
- GitHub Actions CI to guarantee that analyses and rendering are repeatable
Why a public workflow?
Releasing code and de-identified data (+ exact package versions viarenv) fulfils FAIR-data principles, improves transparency, and accelerates follow-up research.
Folder structure
```
parist-trial-workflow/ ├── analysis/ # R scripts & notebooks │ ├── 01_data-cleaning.R │ ├── 02_descriptive.Rmd │ ├── 03_RMST_analysis.R <- primary analysis │ └── utils/plot_theme.R ├── data/ │ ├── raw/ # 🔒 not in git (see .gitignore) │ └── derived/ # de-identified CSVs shared openly ├── manuscript/ │ ├── PARIST_manuscript.Rmd │ ├── references.bib │ └── journal-template.tex ├── figures/ # auto-generated PNG/PDF/SVG ├── docs/ # rendered manuscript (CI artifact / GitHub Pages) ├── archive/ # original Word files & approvals ├── .github/workflows/ci.yml ├── renv.lock # pinned R package versions ├── LICENSE | CITATION.cff └── README.md # you are here
````
Quick-start
Clone and recreate the R environment
```bash
git clone https://github.com/
one-liner to restore exact packages
R -q -e "install.packages('renv'); renv::restore()" ````
Run the primary analysis
bash
Rscript analysis/03_RMST_analysis.R
Render the manuscript locally
bash
R -q -e "rmarkdown::render('manuscript/PARIST_manuscript.Rmd',
output_dir = 'docs')"
open docs/PARIST_manuscript.pdf
Re-running the full pipeline
| Stage | Script / tool | Output |
| --------------------------------- | --------------------- | ------------------------------------------------------------------ |
| 1 Import & clean | 01_data-cleaning.R | data/derived/parist_analysis_set.csv |
| 2 Descriptive stats | 02_descriptive.Rmd | figures/Table1.png |
| 3 Primary RMST + adjusted models | 03_RMST_analysis.R | figures/Fig2_RMST_curve.pdf, analysis/results/rmst_summary.csv |
The CI workflow executes exactly these steps on every push / PR to guarantee reproducibility.
Continuous integration
.github/workflows/ci.yml:
- Checks out the repo.
- Restores the
renvcache. - Runs analysis scripts; fails if any exit non-zero.
- Renders the manuscript to PDF.
- Uploads artifacts & (optionally) deploys
docs/to GitHub Pages.
Status badge at the top of this README shows the latest build.
Authorship & contributions
We adopt the CRediT taxonomy. Example (edit as needed):
| Author | Contribution(s) | | ------ | -------------------------------------------------------- | | A.B. | Conceptualization, Methodology, Writing – original draft | | C.D. | Formal analysis, Software, Visualization | | E.F. | Investigation, Data curation | | G.H. | Supervision, Funding acquisition |
➡️ How to contribute:
- Create a feature branch:
git checkout -b feat/<slug> - Commit with Conventional Commit prefix (
feat:,fix:,docs:…) - Open a Pull Request; CI must be green & 2 reviews obtained.
- Squash-merge to
main.
Data governance
| Data tier | Location | Policy |
| --------------------- | --------------------------- | --------------------------------- |
| Raw identifiable | Institution file-share only | Never pushed to GitHub |
| Derived de-identified | data/derived/ | Commit OK; shared under CC-BY 4.0 |
| Post-acceptance | Zenodo / Mendeley Data | DOI back-linked here |
Corresponding author stores the encryption key for raw ↔︎ derived mapping.
License & citation
- Code: MIT License – see
LICENSE. - Text & figures: Creative Commons CC-BY 4.0.
A ready-made CITATION.cff lets GitHub generate a citation snippet.
When citing, please use the DOI badge at the top of this README once minted.
Contact
Corresponding author: Dr Paasi George – [georgepaasi8@gmail.com)
For issues or feature requests open a GitHub Issue or start a Discussion.
Happy analysing – and thanks for advancing open, reproducible clinical science!
```
Owner
- Login: gpaasi
- Kind: user
- Repositories: 1
- Profile: https://github.com/gpaasi
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this repository, please cite it as below."
title: "PARIST-Pipeline: Reproducible workflow and data for the Paracetamol in Severe Malaria Renal Injury Study"
version: "1.0.0"
doi: ""
date-released: 2025-06-18
authors:
- family-names: Paasi
given-names: George
affiliation: "clinical trials unit, Mbale clinical research institute"
abstract: |
This repository contains all code, data descriptors, and documentation necessary to reproduce the analyses reported in the manuscript
"Paracetamol use and renal recovery in paediatric haemoglobinuric severe malaria: an RMST-based evaluation".
It includes data-wrangling notebooks, statistical scripts for RMST, biomarker and hepatotoxicity analyses, and figure-generation pipelines,
together with the cleaned, de‑identified dataset and supplementary materials.
license: "MIT"
repository-code: "https://github.com/USERNAME/parist-pipeline"
keywords:
- malaria
- acute kidney injury
- paediatrics
- paracetamol
- RMST
- reproducible research
- clinical trial
GitHub Events
Total
- Release event: 5
- Push event: 10
- Create event: 6
Last Year
- Release event: 5
- Push event: 10
- Create event: 6