https://github.com/clinical-infection-research-uosheffield/development_natural_protective_immunity_streptococcus_pyogenes

https://github.com/clinical-infection-research-uosheffield/development_natural_protective_immunity_streptococcus_pyogenes

Science Score: 39.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 12 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.7%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: Clinical-Infection-Research-UoSheffield
  • Language: R
  • Default Branch: main
  • Size: 74.5 MB
Statistics
  • Stars: 1
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme

README.md

Early Life Serological Profiles and the Development of Natural Protective Humoral Immunity to Streptococcus pyogenes in a High Burden Setting

Code to reproduce analyses from the SpyCATS longitudinal study of immunity to Streptococcus pyogenes in The Gambia.


Description

This repository contains the code to reproduce analyses from the manuscript entitled:
"Early life serological profiles and the development of natural protective humoral immunity to Streptococcus pyogenes in a high burden setting."

  1. Data Anonymization:
    Data including antibody measurements, clinical events, and demographics have been anonymized to protect participant confidentiality.

  2. Analytical Methods:
    Analyses include mixed-effects logistic regression to model event probabilities and piecewise regression. Cox proportional hazards models are used to estimate risk across different outcomes, with results formatted into comprehensive tables and plots.


Data Anonymisation

  • IDs are anonymized by mapping them to randomly generated codes.
  • All dates are uniformly offset by a constant to preserve time intervals while concealing actual dates.
  • Exact ages are replaced with pseudo-ages randomly generated within defined age groups.

Upon publication, data to reproduce analyses will be made publicly available, hosted on Zenodo at https://doi.org/10.5281/zenodo.14887949


Repository Structure

. ├── README.md # Project documentation ├── data # Contains processed data files (empty initially) ├── R_output # Output from R scripts (figures, summaries) └── scripts # R scripts for data analysis and visualization - data: Contains processed data files.
- R_output: Contains output from R scripts, such as figures and summaries.
- scripts: Contains R scripts used for data analysis and visualization.


Prerequisites

Ensure you have R installed on your system. You will also need the librarian package version (1.8.1) to manage dependencies.

  • R Version: Requires R version 4.4.0 or later.
  • RStudio: Version 2024.04.2+764 (2024.04.2+764) of later
  • Operating System: The code has been tested on:
    • macOS 14.5

Typical installation and run time is several minutes.


Installing Dependencies

The following R packages are required for the analyses. The librarian package will ensure all dependencies are installed and loaded without needing to install each package individually:

  • broom (1.0.6)
  • broom.mixed (0.2.9.5)
  • CorrMixed (1.1)
  • corrplot (0.92)
  • cowplot (1.1.3)
  • devtools (2.4.5)
  • dplyr (1.1.4)
  • dunn.test (1.3.6)
  • flextable (0.9.6)
  • forcats (1.0.0)
  • FSA (0.9.5)
  • ggdist (3.3.2)
  • ggplot2 (3.5.1)
  • ggpubr (0.6.0)
  • grid (4.4.0)
  • gridExtra (2.3)
  • gtsummary (2.0.0)
  • inborutils (0.4.0)
  • librarian (1.8.1)
  • lme4 (1.1-35.5)
  • lmerTest (3.1-3)
  • mfp (1.5.4.1)
  • officer (0.6.6)
  • patchwork (1.2.0)
  • pheatmap (1.0.12)
  • psych (2.4.6.26)
  • splines (4.4.0)
  • survival (3.7-0)
  • tidyr (1.3.1)
  • tidyverse (2.0.0)
  • UpSetR (1.4.0)
  • wesanderson (0.3.7)

Instructions for Use

Import Data:

There are two ways to import the data into this repository:

🔁 Option 1: Run the automated script

Run the load_data.R script to automatically download and extract the anonymized dataset from the Zenodo depository.

source("load_data.R")

The data has been made publicly available upon publication via DOI: https://doi.org/10.5281/zenodo.14887949

🛠 Option 2: Manual download (fewer dependencies required)

  1. Go to the dataset's Zenodo page: https://doi.org/10.5281/zenodo.14887949
  2. Click the “Download all” button to download the full archive (e.g., 14887949.zip).
  3. Unzip the contents.
  4. Move the extracted folder or files into a new directory in the project directory called data/.

Your directory structure should now look like:

repo/ ├── data/ │   ├── file1.RDS │   ├── file2.RDS │   └── ... ├── scripts/ └── ...etc

Reproduce Analyses:

Work through the scripts sequentially to reproduce the analyses presented in the manuscript. The recommended order is as follows:

`` -01motherchildpairs.R -02bloodIgGbaseline.R -03bloodtitresaroundevents.R -04protection.R -04.1protectionresponsetoreviews.R -04.2responsetoreviewers10foldcrossvalidation.R -04.3responsetoreviewersmultiplethresholds.R -05Mtypespecificanalysis.R -05.1responsetoreviewersMtypespecificanalysis.R -05.2responsetoreviewerMprotection.R -06functionalassays.R`

```

Note that source data for 06functionalassays.R, the data used in this analysis is owned by GSK Vaccines and is not publicly available, however the code is visible to understand the analyses performed

Analyses should be viewed directly within RStudio. The output will be displayed through plots, tables, and console outputs within the RStudio environment, or outputted to a directory called "R_output"


Review Results:

To review results, load the scripts in RStudio and run them interactively. The figures, tables, and statistical summaries will be displayed directly in the Plots and Viewer panels or within the Console.


License

This repository is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.

You are free to:

  • Share: Copy and redistribute the material in any medium or format.

Under the following terms:

  • Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
  • NonCommercial: You may not use the material for commercial purposes.
  • NoDerivatives: If you remix, transform, or build upon the material, you may not distribute the modified version.

For full license details, see:
Creative Commons License

Owner

  • Name: Clinical Infection Research UoSheffield
  • Login: Clinical-Infection-Research-UoSheffield
  • Kind: organization
  • Email: ....@sheffield.ac.uk
  • Location: United Kingdom

Clinical-oriented research into Microorganisms, the Human body and Immunity internationally and in the UK

GitHub Events

Total
  • Watch event: 1
  • Push event: 31
  • Pull request event: 2
  • Create event: 4
Last Year
  • Watch event: 1
  • Push event: 31
  • Pull request event: 2
  • Create event: 4