https://github.com/ccrgeneticsbranch/ervtelescope

https://github.com/ccrgeneticsbranch/ervtelescope

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
    Links to: plos.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.1%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: CCRGeneticsBranch
  • License: mit
  • Language: Shell
  • Default Branch: master
  • Size: 250 KB
Statistics
  • Stars: 1
  • Watchers: 6
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created about 5 years ago · Last pushed over 2 years ago
Metadata Files
Readme License

README.md

CCRGenetics Branch Snakemake Pipeline Telescope

Transposable element annotations used by Telescope.

Telescope is a computational method that provides accurate estimation of transposable element expression. It directly addresses uncertainty in fragment assignment by reassigning ambiguously mapped fragments to the most probable source transcript as determined within a Bayesian statistical model.

Running the pipeline on Biowulf

  1. Create a folder fastq and add all compressed RNA-seq fastq files to be run with extensions _R1.fastq.gz and _R2.fastq.gz. Hardlinks and symlinks will also work.

  2. The wrapper script run_ervTelescope.sh can be used to a. initialize output folder, b. dryrun and c. submit run to cluster ```bash % ./runervTelescope.sh Pipeline Dir: /gpfs/gsfs10/users/CCBRPipeliner/Pipelines/ervTelescope Git Commit/Tag: 65f31ef9dad234266f456f37bc984886a7f56216

run_ervTelescope.sh --> run ervTelescope pipeline

USAGE: bash ./run_ervTelescope.sh -m/--runmode= -w/--workdir= Required Arguments: 1. RUNMODE: [Type: String] Valid options: *) init : initialize workdir *) run : run with slurm *) reset : DELETE workdir dir and re-init it *) dryrun : dry run snakemake to generate DAG *) unlock : unlock workdir if locked by snakemake *) runlocal : run without submitting to sbatch 2. WORKDIR: [Type: String]: Absolute or relative path to the output folder with write permissions. ```

a. initialize output folder

Output folder needs to be initialized with the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=init -w/--workdir=<PathToOutdir>

b. dryrun

Next, we cd to the output folder and edit the config.yaml to provide the run-specific user defined data like location of input fastqs, location of bowtie2 reference, genome annotations in GTF format etc. Then, we can dryrun the pipeline to verify that we are ready to submit the run to the cluster with the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=dryrun -w/--workdir=<PathToOutdir>

c. submit to cluster

The pipeline can be submitted to cluster using the following command:

bash % bash ./run_ervTelescope.sh -m/--runmode=run -w/--workdir=<PathToOutdir>

The above command will submit the master job to the cluster which will the spawn individual snakemake rule-jobs. It returns the master jobid which can be used to track the progress pipeline using a command similar to :

bash % squeue -u $USER -j <JOBID>

Owner

  • Name: CCR Genetics Branch
  • Login: CCRGeneticsBranch
  • Kind: organization

Genomics, and epigenetic pipelines developed by the OncoGenomics Section of the Genetics Branch, CCR.

GitHub Events

Total
Last Year