https://github.com/bhklab/ovcsubtypes
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 6 committers (16.7%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.6%) to scientific vocabulary
Keywords
Repository
Basic Info
- Host: GitHub
- Owner: bhklab
- Language: R
- Default Branch: master
- Size: 53.3 MB
Statistics
- Stars: 1
- Watchers: 7
- Forks: 0
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
README.md
Analysis and report generation for ovarian subtyping project
(this will be updated shortly to reflect new directory structure)
Instructions:
First, install MetaGxOvarian (contact Greg Chen or Deena Gendoo for the latest version). Install MetaGx with the commands
git clone https://github.com/bhklab/MetaGx.git
R CMD BUILD MetaGx
R CMD INSTALL MetaGx_0.9.9.tar.gz
Clone this repository:
git clone https://github.com/bhklab/OvcSubtypes.git
In the "reports" directory, there are three main knitr files: - reproduceResults.Rnw - classificationAcrossDatasets.Rnw - robustness.Rnw Note that classificationAcrossDatasets.Rnw produces the file "esets.not.rescaled.RData" which contains samples of high-grade serious ovarian cancer, with genes not z-score rescaled by gene. In order to ensure consistency in all analyses, this file is used by robustness.Rnw and batch.cluster.all.R.
The three knitr files can be run to produce pdf files within an R session with the command:
library(knitr)
knit("reproduceResults.Rnw")
knit("classificationAcrossDatasets.Rnw")
knit("robustness.Rnw")
(alternatively, they can be compiled in RStudio, making sure Preferences -> Sweave -> Weave Rnw files is set to knitr)
Note that several dependencies may need to be installed. Packages on CRAN can be installed with, for example,
install.packages("xtable")
Packages on Bioconductor can be installed with, for example,
source("https://bioconductor.org/biocLite.R")
biocLite("survcomp")
Within robustness.Rnw, there are two main components: - a reproduction of clustering algorithms and comparison to the cluster labels given in original supplementary texts (producing heatmaps demonstrating concordance between our implementation and the original results) - evaluation of robustness using prediction strength
The prediction strength analysis depends on the output of a fairly computationally intensive run, which produced the output directory jan20clusters. In order to read this output, cd into the "reports" directory and extract the compressed directory:
tar -xvfz jan20clusters.tar.gz
(Addendum) Performing cluster analysis for prediction strength analysis:
In the prediction strength analysis, we re-cluster each dataset with each algorithm 100 times. This takes a substantial amount of computation, so this is parallelized on an SGE cluster with the files "submitbatchclustering.sh" and "batch.cluster.all.R". Ensuring that "esets.not.rescaled.RData" exists, the command
./submit_batch_clustering.sh
submits an array job to the cluster, creating a directory with clustering output. This directory is read in "robustness_validation.Rnw".
This was most recently performed on January 20, producing the output jan20clusters.tar.gz
Owner
- Name: BHKLAB
- Login: bhklab
- Kind: organization
- Location: Toronto, Ontario, Canada
- Website: http://www.pmgenomics.ca/bhklab/
- Repositories: 168
- Profile: https://github.com/bhklab
The Haibe-Kains Laboratory @ Princess Margaret Cancer Centre
GitHub Events
Total
Last Year
Committers
Last synced: 8 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| gmchen | g****n@g****m | 187 |
| Natchar | N****r@a****a | 61 |
| lavkan | l****n@g****m | 42 |
| Benjamin Haibe-Kains | b****a@i****a | 10 |
| Levi Waldron | l****h@g****m | 2 |
| Deena | D****o | 1 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 8 months ago
All Time
- Total issues: 1
- Total pull requests: 2
- Average time to close issues: 5 days
- Average time to close pull requests: 1 minute
- Total issue authors: 1
- Total pull request authors: 1
- Average comments per issue: 7.0
- Average comments per pull request: 1.0
- Merged pull requests: 1
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- lwaldron (1)
Pull Request Authors
- lavkan (2)