VaSP

Quantification and Visualization of Variations of Splicing in Population

https://github.com/yuhuihui2011/vasp

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.9%) to scientific vocabulary

Keywords

3s-scores alternative-splicing ballgown rna-seq splicing sqtl statistics visualization

Keywords from Contributors

bioconductor-package gene proteomics
Last synced: 6 months ago · JSON representation

Repository

Quantification and Visualization of Variations of Splicing in Population

Basic Info
  • Host: GitHub
  • Owner: yuhuihui2011
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 8.98 MB
Statistics
  • Stars: 3
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
3s-scores alternative-splicing ballgown rna-seq splicing sqtl statistics visualization
Created almost 6 years ago · Last pushed about 4 years ago
Metadata Files
Readme

README.md

logo

VaSP: Quantification and Visualization of Variations of Splicing in Population

Release in Bioc platform Bioconductor-release Build Status update <!-- badges: end -->


Table of contents

Introduction

VaSP is an R package for discovery of genome-wide variable alternative splicing events from short-read RNA-seq data and visualizations of gene splicing information for publication-quality multi-panel figures. (Warning: The visualizing function is removed due to the dependent package Sushi deprecated. If you want to use it, please change back to an older version.)

Figure 1. Overview of VaSP. (A). The workflow and functions of VaSP. The input is an R data object ballgown (see ?ballgown) produced by a standard RNA-seq data analysis protocol, including mapping with HISAT2, assembling with StringTie, and collecting expression information with R package Ballgown. VaSP calculates the Single Splicing Strength (3S) scores for all splicing junctions in the genome (?spliceGenome) or in a particular gene (?spliceGene), identifies genotype-specific splicing (GSS) events (?BMfinder), and displays differential splicing information (?splicePlot. This function is currently deprecated). The 3S scores can be also used for other analyses, such as differential splicing analysis or splicing QTL identification. (B). VaSP estimates 3S scores based on junction-read counts normalized by gene-level read coverage. In this example, VaSP calculates the splicing scores of four introns in a gene X with two transcript isoforms. Only the fourth intron is a full usage intron excised by both the two isoforms and the other three are alternative donor site (AltD) sites or Intron Retention (IntronR), respectively. (C). Visualization of splicing information in gene MSTRG.183 (LOCOs01g03070), whole gene without splicing scores. (D). Visualization of differential splicing region of the gene MSTRG.183 with splicing score displaying. In C and D, the y-axes are read depths and the arcs (lines between exons) indicate exon-exon junctions (introns). The dotted arcs indicate no junction-reads spanning the intron (3S = 0) and solid arcs indicate 3S > 0. The transcripts labeled beginning with ‘LOCOs’ indicate annotated transcripts by reference genome annotation and the ones beginning with “MSTRG” are transcripts assembled by StringTie. (Yu et al., 2021)

Installation

Start R (>= 4.0) and run:

{r,eval=FALSE} if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager") BiocManager::install("VaSP") vignette('VaSP')

If you use an older version of R (>= 3.5), enter:

{r,eval=FALSE} BiocManager::install("yuhuihui2011/VaSP", build_vignettes=TRUE) vignette('VaSP')

Data input

Users need to follow the manual of R package Ballgown (https://github.com/alyssafrazee/ballgown) to creat a ballgown object as an input for the VaSP package. See ?ballgown for detailed information on creating Ballgown objects. The object can be stored in a .RDate file by save() . Here is an example of constructing rice.bg object from HISAT2+StringTie output

{r,eval=FALSE} library(VaSP) ?ballgown path<-system.file('extdata', package='VaSP') rice.bg<-ballgown(samples = list.dirs(path = path,recursive = F) )

Quick start

Calculate 3S (Single Splicing Strength) scores, find GSS (genotype-specific splicing) events and display the splicing information.

  • Calculating 3S scores:
library(VaSP)
#> Loading required package: ballgown
#> 
#> Attaching package: 'ballgown'
#> The following object is masked from 'package:base':
#> 
#>     structure
data(rice.bg)
?rice.bg
rice.bg
#> ballgown instance with 33 transcripts and 6 samples
score<-spliceGene(rice.bg, gene="MSTRG.183", junc.type = "score")
tail(round(score,2),2)
#>    Sample_027 Sample_042 Sample_102 Sample_137 Sample_237 Sample_272
#> 58          0       0.02       0.22       0.23       0.23       0.23
#> 59          0       0.00       0.00       0.12       0.12       0.12
  • Discovering GSS:
gss <- BMfinder(score, cores = 1) 
#> Warning in BMfinder(score, cores = 1): sample size < 30, the result should be
#> further tested!
#> ---BMfinder: 16 features * 6 samples
#> ---Completed:  a total of 4 bimodal distrubition features found!
gss
#>    Sample_027 Sample_042 Sample_102 Sample_137 Sample_237 Sample_272
#> 55          2          2          1          1          1          1
#> 57          1          1          2          2          2          2
#> 58          1          1          2          2          2          2
#> 59          1          1          1          2          2          2
  • Extracing intron information
gss_intron<-structure(rice.bg)$intron
(gss_intron<-gss_intron[gss_intron$id%in%rownames(gss)])
#> GRanges object with 4 ranges and 2 metadata columns:
#>       seqnames          ranges strand |        id transcripts
#>          <Rle>       <IRanges>  <Rle> | <integer> <character>
#>   [1]     Chr1 1179011-1179226      - |        55       15:16
#>   [2]     Chr1 1179011-1179134      - |        57          17
#>   [3]     Chr1 1179011-1179110      - |        58          18
#>   [4]     Chr1 1179011-1179106      - |        59          19
#>   -------
#>   seqinfo: 1 sequence from an unspecified genome; no seqlengths
range(gss_intron)
#> GRanges object with 1 range and 0 metadata columns:
#>       seqnames          ranges strand
#>          <Rle>       <IRanges>  <Rle>
#>   [1]     Chr1 1179011-1179226      -
#>   -------
#>   seqinfo: 1 sequence from an unspecified genome; no seqlengths
  • Showing the splicing information (this function is currently deprecated)
splicePlot(rice.bg,gene='MSTRG.183',samples = sampleNames(rice.bg)[c(1,3,5)],
           start = 1179000, end = 1179300)
#> [1] "yes"

Features

Detailed usage examples are available in the Vignette.

Contributors

License

The code is freely available under the GPL (>= 2.0) license

Citation

Yu, H., Du, Q., Campbell, M., Yu, B., Walia, H. and Zhang, C. (2021), Genome‐wide discovery of natural variation in pre‐mRNA splicing and prioritising causal alternative splicing to salt stress response in rice. New Phytol. 230: 1273-1287. https://doi.org/10.1111/nph.17189

Owner

  • Name: Huihui Yu
  • Login: yuhuihui2011
  • Kind: user

GitHub Events

Total
Last Year

Committers

Last synced: over 2 years ago

All Time
  • Total Commits: 93
  • Total Committers: 4
  • Avg Commits per committer: 23.25
  • Development Distribution Score (DDS): 0.441
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
yuhuihui2011 6****1 52
yuhuihui2011 y****1@f****m 32
Nitesh Turaga n****a@g****m 8
Hervé Pagès h****b@g****m 1
Committer Domains (Top 20 + Academic)

Packages

  • Total packages: 1
  • Total downloads:
    • bioconductor 7,507 total
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 5
  • Total maintainers: 1
bioconductor.org: VaSP

Quantification and Visualization of Variations of Splicing in Population

  • Versions: 5
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 7,507 Total
Rankings
Dependent repos count: 0.0%
Dependent packages count: 0.0%
Forks count: 19.8%
Average: 26.8%
Stargazers count: 29.0%
Downloads: 85.5%
Maintainers (1)
Last synced: 6 months ago

Dependencies

DESCRIPTION cran
  • R >= 4.0 depends
  • ballgown * depends
  • GenomeInfoDb * imports
  • GenomicAlignments * imports
  • GenomicRanges * imports
  • IRanges * imports
  • Rsamtools * imports
  • S4Vectors * imports
  • cluster * imports
  • graphics * imports
  • matrixStats * imports
  • methods * imports
  • parallel * imports
  • stats * imports
  • knitr * suggests
  • rmarkdown * suggests