cox_statistical-scaling

Public repository containing the analysis code and figure for the p-value scaling project, led by Dr. Dezerae Cox

https://github.com/dezeraecox-manuscripts/cox_statistical-scaling

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 8 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (16.1%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Public repository containing the analysis code and figure for the p-value scaling project, led by Dr. Dezerae Cox

Basic Info
  • Host: GitHub
  • Owner: dezeraecox-manuscripts
  • License: mit
  • Language: Python
  • Default Branch: master
  • Size: 121 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 2
Created almost 3 years ago · Last pushed about 2 years ago
Metadata Files
Readme License Citation

README.md

DOI DOI

Binder

COX_Statistical-scaling

This repository contains the analysis code associated with the Statistical Scaling project, led by Dr. Dezerae Cox. This manuscript has been submitted for publication under the title "PERCEPT: Replacing binary p-value thresholding with scaling for more nuanced identification of sample differences "

🚨 Keen to see PERCEPT in action? Want to try it out with your data? Click on the launch binder badge above to open an interactive walk through. Alternatively, you can find a ready-to-use spreadsheet template for manual applications in the examples folder.

Prerequisites

This analysis assumes a standard installation of Python 3 (=> 3.6). For specific package requirements, see the environment.yml file, or create a new conda environment containing all packages by running conda create -f environment.yml.

Raw data and databases

In the case of the published dataset comparisons, raw data in the form of supplementary datasets from each publication can be accessed using the raw_data script. Unfortunately, due to journal subscription requirements, in some instances journal access is required. In these cases datasets will need to be manually downloaded for which the URL information is available via the script and summarised in Supplementary Table 1.

Novel simulated and digitised datasets generated for this study have also been uploaded as an open-access Zenodo dataset available here.

Finally, various public databases (e.g. UniProt) were queried as indicated in the accompanying manuscript for which access protocols are also provided in the respective analysis workflow where appropriate.

Workflow

To reproduce analyses presented in the manuscript, where processing order is important for individual analyses scripts have been numbered and should be run in order before unnumbered counterparts. Otherwise, there is no interdependence between the analysis for different data types (simulated, omics, biomarkers).

In addition, each figure can be generated using the scripts provided under the src/figures folder.

A note on randomisation

By it's very nature, randomisation produces different datasets upon each run of the code. In this case, the numpy seed value has been set to ensure reproducibility between code runs in which random numbers are generated. By fixing the seed, the same sequence of random numbers can be generated each time the code is run, enabling consistent results and facilitating debugging, testing, and sharing of code and data across different environments.

Owner

  • Name: manuscripts
  • Login: dezeraecox-manuscripts
  • Kind: organization

Citation (citation.cff)

cff-version: 1.0.0
message: "If you use this respository, please cite the manuscript alongside the following:"
authors:
  - family-names: Cox
    given-names: Dezerae
    orcid: https://orcid.org/0000-0002-5345-8360
  - family-names: Hatters
    given-names: Danny M
    orcid: https://orcid.org/0000-0002-9965-2847
title: "COX_Statistical-scaling"
version: 0.1
doi: https://doi.org/10.5281/zenodo.8128038
date-released: 2023-07-09
url: "https://github.com/dezeraecox-manuscripts/COX_Statistical-scaling"

GitHub Events

Total
Last Year

Dependencies

environment.yml pypi