sofur

Worflow to detect, annotate, filter and vizualize somatic fusions from stranded paired-end RNA-seq from Illumina instruments.

https://github.com/bialimed/sofur

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary

Keywords

bioinformatics fusion rna-seq somatic-fusion-genes
Last synced: 6 months ago · JSON representation ·

Repository

Worflow to detect, annotate, filter and vizualize somatic fusions from stranded paired-end RNA-seq from Illumina instruments.

Basic Info
  • Host: GitHub
  • Owner: bialimed
  • License: gpl-3.0
  • Language: HTML
  • Default Branch: master
  • Homepage:
  • Size: 10.4 MB
Statistics
  • Stars: 1
  • Watchers: 2
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
bioinformatics fusion rna-seq somatic-fusion-genes
Created about 4 years ago · Last pushed 10 months ago
Metadata Files
Readme License Citation

README.md

SoFuR: SOmatic FUsions from Rna

license DOI

Table of Contents

Description

This workflow detects, annotates and filters somatic fusions from stranded paired-end RNA-seq from Illumina's instruments.

Workflow steps

workflow

Installation

1. Download code

  git clone [--branch ${VESRSION}] --recurse-submodules git@github.com:bialimed/sofur.git

2. Install dependencies

  • conda (>=4.6.8):

    # Install conda wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x8664.sh && \ sh Miniconda3-latest-Linux-x8664.sh

    # Install mamba conda activate base conda install -c conda-forge mamba

More details on miniconda install here.

  • snakemake (>=5.4.2):

    mamba create -c conda-forge -c bioconda -n sofur snakemake==7.32.4 conda activate sofur pip install drmaa

More details on snakemake install here.

  • Install rules dependencies (cutadapt, bwa, ...):

    conda activate sofur snakemake \ --use-conda \ --conda-prefix ${applicationenvdir} \ --conda-create-envs-only --snakefile ${APPDIR}/Snakefile \ --configfile workflowparameters.yml

4. Download and prepare resources

SoFuR uses genome sequences, genes annotations, known fusions databanks (artifacts and pathogenics) and others standard resources. These files must be provided in your config.yml (see template). An detailed example for the creation process of these resources is available here.

5. Test install

  • From ${APP_DIR}/test/config/wf_config.yml set variables corresponding to databanks (see ## ${BANK}/... ##).

  • Launch test wit following command:

    conda activate sofur ${APPDIR}/test/launchwf.sh \ ${CONDAENVSDIR} \ ${WORKDIR} \ ${DRMAAPARAMS}

Example with scheduler SGE:

  export DRMAA_LIBRARY_PATH=${SGE_ROOT}/lib/linux-rhel7-x64/libdrmaa.so

  conda activate sofur
  ~/soft/sofur/test/launch_wf.sh \
    /work/$USER/conda_envs/envs \
    /work/$USER/test_sofur \
    ' -V -q {cluster.queue} -l pri_{cluster.queue}=1 -l mem={cluster.vmem} -l h_vmem={cluster.vmem} -pe smp {cluster.threads}'

Example with scheduler slurm:

  export DRMAA_LIBRARY_PATH=$SGE_ROOT/lib/linux-rhel7-x64/libdrmaa.so

  conda activate sofur
  ~/soft/sofur/test/launch_wf.sh \
    /work/$USER/conda_envs/envs \
    /work/$USER/test_sofur \
    ' --partition={cluster.queue} --mem={cluster.mem} --cpus-per-task={cluster.threads}'
  • See results in ${WORK_DIR}/report/run.html.

Usage

Copy ${APP_DIR}/config/workflow_parameters.tpl.yml in your current directory and change values before launching the following command:

conda activate sofur
snakemake \
  --use-conda \
  --conda-prefix ${application_env_dir} \
  --jobs ${nb_jobs} \
  --jobname "sofur.{rule}.{jobid}" \
  --latency-wait 100 \
  --snakefile ${application_dir}/Snakefile \
  --cluster-config ${application_dir}/config/cluster.json \
  --configfile workflow_parameters.yml \
  --directory ${out_dir} \
  > ${out_dir}/wf_log.txt \
  2> ${out_dir}/wf_stderr.txt

Outputs directory

The main elements of the outputs directory are the following:

out_dir/
├── ...
├── report/
|   ├── ...
|   ├── run.html
|   └── sample-A.html
├── stats/
|   ├── ...
|   └── multiqc/
|       ├── ...
|       └── multiqc_report.html
└── structural_variants/
    ├── ...
    ├── sample-A_unfiltered.vcf
    └── sample-A_filtered.vcf
  • The reports files containing filtered fusions list, annotations and viewers are in out_dir/report/{sample}.html.
  • The quality reports is in out_dir/stats/multiqc/multiqc_report.html. It resumes qualities of reads, distribution of alignments (between exon, intron, ...) and strandness analysis.
  • The annotated fusions in VCF format are kept in out_dir/structural_variants. *_unfiltered.vcf contain all the fusions, their annotations and their tags. *_filtered.vcf contain all the fusions, their annotations and their tags after filtering by rules provided by file declarated in configfile by filters.rules.

Performances

The performances are evaluated on real, synthetic and simulated datasets. The commands used in evaluation are stored in assessment. The results summarized in assessment/report.html.

Copyright

2019 Laboratoire d'Anatomo-Cytopathologie du CHU de Toulouse

Owner

  • Name: BIAliMed
  • Login: bialimed
  • Kind: organization
  • Location: France

BioInformatics for personALIsed MEDicine

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Escudié"
  given-names: "Frédéric"
  affiliation: "Centre Hospitalier Universitaire de Toulouse"
- family-names: "Ivashchenko"
  given-names: "Véronique"
  affiliation: "Centre Hospitalier Universitaire de Toulouse"
- family-names: "Brousset"
  given-names: "Pierre"
  affiliation: "Centre Hospitalier Universitaire de Toulouse"
title: "SoFuR"
version: 0.8.0
doi: 10.5281/zenodo.15376175
date-released: 2022-03-11
url: "https://github.com/bialimed/SoFuR"

GitHub Events

Total
  • Release event: 1
  • Public event: 1
  • Push event: 7
Last Year
  • Release event: 1
  • Public event: 1
  • Push event: 7