sofur
Worflow to detect, annotate, filter and vizualize somatic fusions from stranded paired-end RNA-seq from Illumina instruments.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Keywords
Repository
Worflow to detect, annotate, filter and vizualize somatic fusions from stranded paired-end RNA-seq from Illumina instruments.
Basic Info
Statistics
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 1
Topics
Metadata Files
README.md
SoFuR: SOmatic FUsions from Rna
Table of Contents
Description
This workflow detects, annotates and filters somatic fusions from stranded paired-end RNA-seq from Illumina's instruments.
Workflow steps

Installation
1. Download code
git clone [--branch ${VESRSION}] --recurse-submodules git@github.com:bialimed/sofur.git
2. Install dependencies
conda (>=4.6.8):
# Install conda wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x8664.sh && \ sh Miniconda3-latest-Linux-x8664.sh
# Install mamba conda activate base conda install -c conda-forge mamba
More details on miniconda install here.
snakemake (>=5.4.2):
mamba create -c conda-forge -c bioconda -n sofur snakemake==7.32.4 conda activate sofur pip install drmaa
More details on snakemake install here.
Install rules dependencies (cutadapt, bwa, ...):
conda activate sofur snakemake \ --use-conda \ --conda-prefix ${applicationenvdir} \ --conda-create-envs-only --snakefile ${APPDIR}/Snakefile \ --configfile workflowparameters.yml
4. Download and prepare resources
SoFuR uses genome sequences, genes annotations, known fusions databanks (artifacts and pathogenics) and others standard resources. These files must be provided in your config.yml (see template). An detailed example for the creation process of these resources is available here.
5. Test install
From
${APP_DIR}/test/config/wf_config.ymlset variables corresponding to databanks (see## ${BANK}/... ##).Launch test wit following command:
conda activate sofur ${APPDIR}/test/launchwf.sh \ ${CONDAENVSDIR} \ ${WORKDIR} \ ${DRMAAPARAMS}
Example with scheduler SGE:
export DRMAA_LIBRARY_PATH=${SGE_ROOT}/lib/linux-rhel7-x64/libdrmaa.so
conda activate sofur
~/soft/sofur/test/launch_wf.sh \
/work/$USER/conda_envs/envs \
/work/$USER/test_sofur \
' -V -q {cluster.queue} -l pri_{cluster.queue}=1 -l mem={cluster.vmem} -l h_vmem={cluster.vmem} -pe smp {cluster.threads}'
Example with scheduler slurm:
export DRMAA_LIBRARY_PATH=$SGE_ROOT/lib/linux-rhel7-x64/libdrmaa.so
conda activate sofur
~/soft/sofur/test/launch_wf.sh \
/work/$USER/conda_envs/envs \
/work/$USER/test_sofur \
' --partition={cluster.queue} --mem={cluster.mem} --cpus-per-task={cluster.threads}'
- See results in
${WORK_DIR}/report/run.html.
Usage
Copy ${APP_DIR}/config/workflow_parameters.tpl.yml in your current directory
and change values before launching the following command:
conda activate sofur
snakemake \
--use-conda \
--conda-prefix ${application_env_dir} \
--jobs ${nb_jobs} \
--jobname "sofur.{rule}.{jobid}" \
--latency-wait 100 \
--snakefile ${application_dir}/Snakefile \
--cluster-config ${application_dir}/config/cluster.json \
--configfile workflow_parameters.yml \
--directory ${out_dir} \
> ${out_dir}/wf_log.txt \
2> ${out_dir}/wf_stderr.txt
Outputs directory
The main elements of the outputs directory are the following:
out_dir/
├── ...
├── report/
| ├── ...
| ├── run.html
| └── sample-A.html
├── stats/
| ├── ...
| └── multiqc/
| ├── ...
| └── multiqc_report.html
└── structural_variants/
├── ...
├── sample-A_unfiltered.vcf
└── sample-A_filtered.vcf
- The reports files containing filtered fusions
list, annotations and viewers are in
out_dir/report/{sample}.html. - The quality reports is in
out_dir/stats/multiqc/multiqc_report.html. It resumes qualities of reads, distribution of alignments (between exon, intron, ...) and strandness analysis. - The annotated fusions in VCF format are kept in
out_dir/structural_variants.*_unfiltered.vcfcontain all the fusions, their annotations and their tags.*_filtered.vcfcontain all the fusions, their annotations and their tags after filtering by rules provided by file declarated in configfile byfilters.rules.
Performances
The performances are evaluated on real, synthetic and simulated datasets. The
commands used in evaluation are stored in assessment. The results summarized
in assessment/report.html.
Copyright
2019 Laboratoire d'Anatomo-Cytopathologie du CHU de Toulouse
Owner
- Name: BIAliMed
- Login: bialimed
- Kind: organization
- Location: France
- Repositories: 2
- Profile: https://github.com/bialimed
BioInformatics for personALIsed MEDicine
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Escudié" given-names: "Frédéric" affiliation: "Centre Hospitalier Universitaire de Toulouse" - family-names: "Ivashchenko" given-names: "Véronique" affiliation: "Centre Hospitalier Universitaire de Toulouse" - family-names: "Brousset" given-names: "Pierre" affiliation: "Centre Hospitalier Universitaire de Toulouse" title: "SoFuR" version: 0.8.0 doi: 10.5281/zenodo.15376175 date-released: 2022-03-11 url: "https://github.com/bialimed/SoFuR"
GitHub Events
Total
- Release event: 1
- Public event: 1
- Push event: 7
Last Year
- Release event: 1
- Public event: 1
- Push event: 7