https://github.com/clinical-infection-research-uosheffield/development_natural_protective_immunity_streptococcus_pyogenes
Science Score: 39.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 12 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Clinical-Infection-Research-UoSheffield
- Language: R
- Default Branch: main
- Size: 74.5 MB
Statistics
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Early Life Serological Profiles and the Development of Natural Protective Humoral Immunity to Streptococcus pyogenes in a High Burden Setting
Code to reproduce analyses from the SpyCATS longitudinal study of immunity to Streptococcus pyogenes in The Gambia.
Description
This repository contains the code to reproduce analyses from the manuscript entitled:
"Early life serological profiles and the development of natural protective humoral immunity to Streptococcus pyogenes in a high burden setting."
Data Anonymization:
Data including antibody measurements, clinical events, and demographics have been anonymized to protect participant confidentiality.Analytical Methods:
Analyses include mixed-effects logistic regression to model event probabilities and piecewise regression. Cox proportional hazards models are used to estimate risk across different outcomes, with results formatted into comprehensive tables and plots.
Data Anonymisation
- IDs are anonymized by mapping them to randomly generated codes.
- All dates are uniformly offset by a constant to preserve time intervals while concealing actual dates.
- Exact ages are replaced with pseudo-ages randomly generated within defined age groups.
Upon publication, data to reproduce analyses will be made publicly available, hosted on Zenodo at https://doi.org/10.5281/zenodo.14887949
Repository Structure
.
├── README.md # Project documentation
├── data # Contains processed data files (empty initially)
├── R_output # Output from R scripts (figures, summaries)
└── scripts # R scripts for data analysis and visualization
- data: Contains processed data files.
- R_output: Contains output from R scripts, such as figures and summaries.
- scripts: Contains R scripts used for data analysis and visualization.
Prerequisites
Ensure you have R installed on your system. You will also need the librarian package version (1.8.1) to manage dependencies.
- R Version: Requires R version 4.4.0 or later.
- RStudio: Version 2024.04.2+764 (2024.04.2+764) of later
- Operating System: The code has been tested on:
- macOS 14.5
- macOS 14.5
Typical installation and run time is several minutes.
Installing Dependencies
The following R packages are required for the analyses. The librarian package will ensure all dependencies are installed and loaded without needing to install each package individually:
broom (1.0.6)broom.mixed (0.2.9.5)CorrMixed (1.1)corrplot (0.92)cowplot (1.1.3)devtools (2.4.5)dplyr (1.1.4)dunn.test (1.3.6)flextable (0.9.6)forcats (1.0.0)FSA (0.9.5)ggdist (3.3.2)ggplot2 (3.5.1)ggpubr (0.6.0)grid (4.4.0)gridExtra (2.3)gtsummary (2.0.0)inborutils (0.4.0)librarian (1.8.1)lme4 (1.1-35.5)lmerTest (3.1-3)mfp (1.5.4.1)officer (0.6.6)patchwork (1.2.0)pheatmap (1.0.12)psych (2.4.6.26)splines (4.4.0)survival (3.7-0)tidyr (1.3.1)tidyverse (2.0.0)UpSetR (1.4.0)wesanderson (0.3.7)
Instructions for Use
Import Data:
There are two ways to import the data into this repository:
🔁 Option 1: Run the automated script
Run the load_data.R script to automatically download and extract the anonymized dataset from the Zenodo depository.
source("load_data.R")
The data has been made publicly available upon publication via DOI: https://doi.org/10.5281/zenodo.14887949
🛠 Option 2: Manual download (fewer dependencies required)
- Go to the dataset's Zenodo page: https://doi.org/10.5281/zenodo.14887949
- Click the “Download all” button to download the full archive (e.g.,
14887949.zip). - Unzip the contents.
- Move the extracted folder or files into a new directory in the project directory called
data/.
Your directory structure should now look like:
repo/
├── data/
│ ├── file1.RDS
│ ├── file2.RDS
│ └── ...
├── scripts/
└── ...etc
Reproduce Analyses:
Work through the scripts sequentially to reproduce the analyses presented in the manuscript. The recommended order is as follows:
``
-01motherchildpairs.R
-02bloodIgGbaseline.R
-03bloodtitresaroundevents.R
-04protection.R
-04.1protectionresponsetoreviews.R
-04.2responsetoreviewers10foldcrossvalidation.R
-04.3responsetoreviewersmultiplethresholds.R
-05Mtypespecificanalysis.R
-05.1responsetoreviewersMtypespecificanalysis.R
-05.2responsetoreviewerMprotection.R
-06functionalassays.R`
```
Note that source data for 06functionalassays.R, the data used in this analysis is owned by GSK Vaccines and is not publicly available, however the code is visible to understand the analyses performed
Analyses should be viewed directly within RStudio. The output will be displayed through plots, tables, and console outputs within the RStudio environment, or outputted to a directory called "R_output"
Review Results:
To review results, load the scripts in RStudio and run them interactively. The figures, tables, and statistical summaries will be displayed directly in the Plots and Viewer panels or within the Console.
License
This repository is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
You are free to:
- Share: Copy and redistribute the material in any medium or format.
Under the following terms:
- Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- NonCommercial: You may not use the material for commercial purposes.
- NoDerivatives: If you remix, transform, or build upon the material, you may not distribute the modified version.
For full license details, see:
Creative Commons License
Owner
- Name: Clinical Infection Research UoSheffield
- Login: Clinical-Infection-Research-UoSheffield
- Kind: organization
- Email: ....@sheffield.ac.uk
- Location: United Kingdom
- Website: https://www.sheffield.ac.uk/smph/research/themes/infection-and-immunity
- Repositories: 1
- Profile: https://github.com/Clinical-Infection-Research-UoSheffield
Clinical-oriented research into Microorganisms, the Human body and Immunity internationally and in the UK
GitHub Events
Total
- Watch event: 1
- Push event: 31
- Pull request event: 2
- Create event: 4
Last Year
- Watch event: 1
- Push event: 31
- Pull request event: 2
- Create event: 4