TitanCNA

Analysis of subclonal copy number alterations (CNA) and loss of heterozygosity (LOH) in cancer

https://github.com/gavinha/titancna

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
    2 of 15 committers (13.3%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.2%) to scientific vocabulary

Keywords

10x-genomics copy-number-variation genome-sequencing hmm tumor-heterogeneity

Keywords from Contributors

bioconductor-package genomics gene bioinformatics proteomics metabolomics ncbi-geo mass-spectrometry tracking particles
Last synced: 9 months ago · JSON representation

Repository

Analysis of subclonal copy number alterations (CNA) and loss of heterozygosity (LOH) in cancer

Basic Info
  • Host: GitHub
  • Owner: gavinha
  • License: gpl-3.0
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 10.6 MB
Statistics
  • Stars: 97
  • Watchers: 3
  • Forks: 37
  • Open Issues: 27
  • Releases: 0
Topics
10x-genomics copy-number-variation genome-sequencing hmm tumor-heterogeneity
Created over 11 years ago · Last pushed about 5 years ago
Metadata Files
Readme License

README.md

Build Status

TitanCNA

TitanCNA a R/Bioconductor package for analyzing subclonal copy number alterations (CNA) and loss of heterozygosity (LOH) in whole genome and exome sequencing of tumours.

Ha, G., et al. (2014). TITAN: Inference of copy number architectures in clonal cell populations from tumour whole genome sequence data. Genome Research, 24: 1881-1893. (PMID: 25060187)

Contact

Gavin Ha
Fred Hutchinson Cancer Research Center
contact: gavinha@gmail.com or gha@fredhutch.org
Date: May 30, 2019
Website: GavinHaLab.org

Table of Contents

Links

Snakemake Workflow: https://github.com/gavinha/TitanCNA/tree/master/scripts/snakemake
10X Snakemake Workflow: https://github.com/gavinha/TitanCNA10Xsnakemake
Google Groups: https://groups.google.com/forum/#!forum/titancna
Publication in Genome Research: http://genome.cshlp.org/content/24/11/1881

News

(See NEWS for previous version notes)

May 30, 2019 - TitanCNA version 1.23.1

Addressed the issue of RangedData being deprecated by converting code to use GRanges from the GenomicRanges package. New or modified functions:
- wigToGRanges: to load WIG files and store in GRanges object. - correctIntegerCN(): performs allelic copy number (major/minor CN) adjustment.

August 9, 2018

Improved parameter inference by handling errors and allowing EM to continue until convergence. This fixes runs that previously would fail because samples had very low tumor content.

July 26, 2018

Snakemake workflow for 10X Genomics whole genome sequencing data is now included in another Git repo. https://github.com/gavinha/TitanCNA10Xsnakemake

TitanCNA version 1.17.1 changes

1) New functions: - correctIntegerCN(): recomputes high-level copy number that is capped by the maximum CN state. Performs two tasks - (1) correct log ratio based on purity and ploidy, and then convert to decimal CN value; (2) Correct bins and segments in which the original predicted integer copy number was assigned the maximum CN state; bins and segments for all of chromosome X are also corrected, if provided in the input.

2) Modified functions: - plotSegmentMedians() and plotCNlogRByChr(): includes argument to show color-coding for corrected copy number; defaults to TRUE for this argument.

3) Removed functions/manual/dependencies: - extractAlleleReadCounts()
- Rsamtools dependency

TitanCNA version 1.15.0 changes

1) 10X Genomics analysis - Please see scripts for instructions on running the 10X Genomics analysis.

2) New script to help select optimal solutions.
Please see scripts/R_scripts

3) Added snakemake pipeline for entire TITAN workflow
Please see scripts/snakemake.

4) New function: - plotSegmentMedians() - loadHaplotypeAlleleCounts(): loads input allele counts with phasing information - plotHaplotypeFraction(): results from 10X Genomics WGS data with phasing of haplotype blocks

5) Modified features (no changes for user-accessible functions): - updateParameters: coordinate descent estimate of ploidy update uses previously estimated normal parameter from the same corodinate descent iteration ; leads to faster convergence

Installation

Install TitanCNA R package from github

From within R-3.3.2 or higher,
install.packages("devtools") library(devtools) install_github("gavinha/TitanCNA")

Install TitanCNA from Bioconductor

From within R-3.3.2 or higher,
if (!requireNamespace("BiocManager", quietly=TRUE)) install.packages("BiocManager") BiocManager::install("TitanCNA")

Install other dependencies

  1. Install the HMMcopy suite Please follow instructions on the HMMcopy GitHub https://github.com/shahcompbio/hmmcopy_utils.

  2. Install ichorCNA Please follow instructions on the ichorCNA GitHub Wiki https://github.com/broadinstitute/ichorCNA.

Usage

R scripts are provided to run the R component of the TITAN analysis using the TitanCNA R/Bioconductor package.
Please go to the scripts directory and look at the README there for more details.

Snakemake workflow

A snakemake is also provided in this repo.
This workflow will run the TITAN a set of tumour-normal pairs, starting from the BAM files and generating TitanCNA outputs. It will also perform model selection at the end of the workflow to choose the optimal ploidy and clonal cluster solutions.

Vignette in TitanCNA R package

The PDF of the vignette can be accessed from R library(TitanCNA) browseVignettes(package = "TitanCNA") The path of the file can also be located using pathToInstall <- system.file(package = "TitanCNA") pathToPdf <- paste0(pathToInstall, "/int/doc/TitanCNA.pdf) The example provided will reproduce Figure 1 in the manuscript. However, it will be slightly different because the example is only based on the analysis of chr2, not genome-wide.

Acknowledgements

TitanCNA was developed by Gavin Ha while in the laboratories of Sohrab Shah (sshah@bccrc.ca) and Sam Aparicio (saparicio@bccrc.ca) at the Dept of Molecular Oncology, BC Cancer Agency, Vancouver, Canada.
Yikan Wang and Daniel Lai have contributed code and discussions to this project.
The KRONOS TITAN workflow was developed by Diljot Grewal (dgrewal@bccrc.ca) and Jafar Taghiyar (jtaghiyar@bccrc.ca).
HMMcopy was co-developed by Daniel Lai and Gavin Ha.

TitanCNA was inspired by existing methods including OncoSNP and PyClone

Software License

License: GPLv3

TitanCNA R code is open source and R/Bioconductor package is under GPLv3. This applies to the v1.9.0 and all subsequent versions within and obtained from Bioconductor.

Users who are using TitanCNA earlier than v1.9.0 not for the purpose of academic research should contact gha@fredhutch.org, sshah@bccrc.ca, and prebstein@bccancer.bc.ca to inquire about previous licensing.

Owner

  • Name: Gavin Ha
  • Login: gavinha
  • Kind: user
  • Company: Fred Hutchinson Cancer Research Center

GitHub Events

Total
  • Issues event: 1
  • Watch event: 3
  • Fork event: 1
Last Year
  • Issues event: 1
  • Watch event: 3
  • Fork event: 1

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 265
  • Total Committers: 15
  • Avg Commits per committer: 17.667
  • Development Distribution Score (DDS): 0.343
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Gavin Ha g****a@b****g 174
Gavin Ha g****a@b****a 38
Luca Beltrame l****e@m****t 12
Dan Tenenbaum d****a@f****g 10
annahoge 3****e 6
chapmanb c****b@5****m 5
Bioconductor Git-SVN Bridge b****c@b****g 4
Herve Pages h****s@f****g 4
Dylan Hennessey d****s@u****a 2
Hervé Pagès h****s@f****g 2
Nitesh Turaga n****a@g****m 2
vobencha v****a@g****m 2
vobencha v****n@r****g 2
LiNk-NY m****9@g****m 1
Marc Carlson m****n@f****g 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 82
  • Total pull requests: 17
  • Average time to close issues: 5 months
  • Average time to close pull requests: 3 months
  • Total issue authors: 47
  • Total pull request authors: 10
  • Average comments per issue: 2.32
  • Average comments per pull request: 0.35
  • Merged pull requests: 10
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • lbeltrame (9)
  • fpbarthel (8)
  • aleighbrown (5)
  • lounsburynw (5)
  • ysbioinfo (4)
  • gavinha (3)
  • pchamely (3)
  • lijinsgithub (2)
  • thek71 (2)
  • d-henness (2)
  • elifirem (2)
  • jgockley62 (2)
  • lima1 (1)
  • kieranrcampbell (1)
  • ilykos (1)
Pull Request Authors
  • chapmanb (6)
  • annahoge (2)
  • lbeltrame (2)
  • aleighbrown (1)
  • roryk (1)
  • wayneliuq (1)
  • d-henness (1)
  • setgree (1)
  • gaog94 (1)
  • ZhaoxiangSimonCai (1)
Top Labels
Issue Labels
enhancement (3)
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 27,078 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
bioconductor.org: TitanCNA

Subclonal copy number and LOH prediction from whole genome sequencing of tumours

  • Versions: 5
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 27,078 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Forks count: 1.1%
Stargazers count: 2.3%
Average: 7.1%
Downloads: 32.1%
Maintainers (1)
Last synced: 10 months ago

Dependencies

DESCRIPTION cran
  • R >= 3.5.1 depends
  • GenomeInfoDb >= 1.8.7 imports
  • GenomicRanges >= 1.24.3 imports
  • IRanges >= 2.6.1 imports
  • VariantAnnotation >= 1.18.7 imports
  • data.table >= 1.10.4 imports
  • dplyr >= 0.5.0 imports
  • foreach >= 1.4.3 imports