https://github.com/ccrgeneticsbranch/ervtelescope

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: plos.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: CCRGeneticsBranch
License: mit
Language: Shell
Default Branch: master
Size: 250 KB

Statistics

Stars: 1
Watchers: 6
Forks: 0
Open Issues: 0
Releases: 0

Created about 5 years ago · Last pushed over 2 years ago

Metadata Files

Readme License

CCRGenetics Branch Snakemake Pipeline Telescope

Transposable element annotations used by Telescope.

Telescope is a computational method that provides accurate estimation of transposable element expression. It directly addresses uncertainty in fragment assignment by reassigning ambiguously mapped fragments to the most probable source transcript as determined within a Bayesian statistical model.

Running the pipeline on Biowulf

Create a folder fastq and add all compressed RNA-seq fastq files to be run with extensions _R1.fastq.gz and _R2.fastq.gz. Hardlinks and symlinks will also work.
The wrapper script run_ervTelescope.sh can be used to a. initialize output folder, b. dryrun and c. submit run to cluster ```bash % ./runervTelescope.sh Pipeline Dir: /gpfs/gsfs10/users/CCBRPipeliner/Pipelines/ervTelescope Git Commit/Tag: 65f31ef9dad234266f456f37bc984886a7f56216

run_ervTelescope.sh --> run ervTelescope pipeline

USAGE: bash ./run_ervTelescope.sh -m/--runmode= -w/--workdir= Required Arguments: 1. RUNMODE: [Type: String] Valid options: *) init : initialize workdir *) run : run with slurm *) reset : DELETE workdir dir and re-init it *) dryrun : dry run snakemake to generate DAG *) unlock : unlock workdir if locked by snakemake *) runlocal : run without submitting to sbatch 2. WORKDIR: [Type: String]: Absolute or relative path to the output folder with write permissions. ```

a. initialize output folder

Output folder needs to be initialized with the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=init -w/--workdir=<PathToOutdir>

b. dryrun

Next, we cd to the output folder and edit the config.yaml to provide the run-specific user defined data like location of input fastqs, location of bowtie2 reference, genome annotations in GTF format etc. Then, we can dryrun the pipeline to verify that we are ready to submit the run to the cluster with the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=dryrun -w/--workdir=<PathToOutdir>

c. submit to cluster

The pipeline can be submitted to cluster using the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=run -w/--workdir=<PathToOutdir>

The above command will submit the master job to the cluster which will the spawn individual snakemake rule-jobs. It returns the master jobid which can be used to track the progress pipeline using a command similar to :

bash % squeue -u $USER -j <JOBID>

Owner

Name: CCR Genetics Branch
Login: CCRGeneticsBranch
Kind: organization

Website: https://ccr.cancer.gov/Genetics-Branch
Repositories: 13
Profile: https://github.com/CCRGeneticsBranch

Genomics, and epigenetic pipelines developed by the OncoGenomics Section of the Genetics Branch, CCR.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/ccrgeneticsbranch/ervtelescope

Science Score: 23.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

CCRGenetics Branch Snakemake Pipeline Telescope

Running the pipeline on Biowulf

Owner

GitHub Events

Total

Last Year