cox_statistical-scaling
Public repository containing the analysis code and figure for the p-value scaling project, led by Dr. Dezerae Cox
https://github.com/dezeraecox-manuscripts/cox_statistical-scaling
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 8 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (16.1%) to scientific vocabulary
Repository
Public repository containing the analysis code and figure for the p-value scaling project, led by Dr. Dezerae Cox
Basic Info
- Host: GitHub
- Owner: dezeraecox-manuscripts
- License: mit
- Language: Python
- Default Branch: master
- Size: 121 KB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 2
Metadata Files
README.md
COX_Statistical-scaling
This repository contains the analysis code associated with the Statistical Scaling project, led by Dr. Dezerae Cox. This manuscript has been submitted for publication under the title "PERCEPT: Replacing binary p-value thresholding with scaling for more nuanced identification of sample differences "
🚨 Keen to see PERCEPT in action? Want to try it out with your data? Click on the
launch binderbadge above to open an interactive walk through. Alternatively, you can find a ready-to-use spreadsheet template for manual applications in theexamplesfolder.
Prerequisites
This analysis assumes a standard installation of Python 3 (=> 3.6). For specific package requirements, see the environment.yml file, or create a new conda environment containing all packages by running conda create -f environment.yml.
Raw data and databases
In the case of the published dataset comparisons, raw data in the form of supplementary datasets from each publication can be accessed using the raw_data script. Unfortunately, due to journal subscription requirements, in some instances journal access is required. In these cases datasets will need to be manually downloaded for which the URL information is available via the script and summarised in Supplementary Table 1.
Novel simulated and digitised datasets generated for this study have also been uploaded as an open-access Zenodo dataset available here.
Finally, various public databases (e.g. UniProt) were queried as indicated in the accompanying manuscript for which access protocols are also provided in the respective analysis workflow where appropriate.
Workflow
To reproduce analyses presented in the manuscript, where processing order is important for individual analyses scripts have been numbered and should be run in order before unnumbered counterparts. Otherwise, there is no interdependence between the analysis for different data types (simulated, omics, biomarkers).
In addition, each figure can be generated using the scripts provided under the src/figures folder.
A note on randomisation
By it's very nature, randomisation produces different datasets upon each run of the code. In this case, the numpy seed value has been set to ensure reproducibility between code runs in which random numbers are generated. By fixing the seed, the same sequence of random numbers can be generated each time the code is run, enabling consistent results and facilitating debugging, testing, and sharing of code and data across different environments.
Owner
- Name: manuscripts
- Login: dezeraecox-manuscripts
- Kind: organization
- Repositories: 1
- Profile: https://github.com/dezeraecox-manuscripts
Citation (citation.cff)
cff-version: 1.0.0
message: "If you use this respository, please cite the manuscript alongside the following:"
authors:
- family-names: Cox
given-names: Dezerae
orcid: https://orcid.org/0000-0002-5345-8360
- family-names: Hatters
given-names: Danny M
orcid: https://orcid.org/0000-0002-9965-2847
title: "COX_Statistical-scaling"
version: 0.1
doi: https://doi.org/10.5281/zenodo.8128038
date-released: 2023-07-09
url: "https://github.com/dezeraecox-manuscripts/COX_Statistical-scaling"