https://github.com/ccrgeneticsbranch/ervtelescope
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: plos.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.1%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: CCRGeneticsBranch
- License: mit
- Language: Shell
- Default Branch: master
- Size: 250 KB
Statistics
- Stars: 1
- Watchers: 6
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
CCRGenetics Branch Snakemake Pipeline Telescope
Transposable element annotations used by Telescope.
Telescope is a computational method that provides accurate estimation of transposable element expression. It directly addresses uncertainty in fragment assignment by reassigning ambiguously mapped fragments to the most probable source transcript as determined within a Bayesian statistical model.
Running the pipeline on Biowulf
Create a folder fastq and add all compressed RNA-seq fastq files to be run with extensions
_R1.fastq.gzand_R2.fastq.gz. Hardlinks and symlinks will also work.The wrapper script
run_ervTelescope.shcan be used to a. initialize output folder, b. dryrun and c. submit run to cluster ```bash % ./runervTelescope.sh Pipeline Dir: /gpfs/gsfs10/users/CCBRPipeliner/Pipelines/ervTelescope Git Commit/Tag: 65f31ef9dad234266f456f37bc984886a7f56216
run_ervTelescope.sh --> run ervTelescope pipeline
USAGE:
bash ./run_ervTelescope.sh -m/--runmode=
a. initialize output folder
Output folder needs to be initialized with the following command:
bash
% bash ./run_ervTelescope.sh -m/--runmode=init -w/--workdir=<PathToOutdir>
b. dryrun
Next, we cd to the output folder and edit the config.yaml to provide the run-specific user defined data like location of input fastqs, location of bowtie2 reference, genome annotations in GTF format etc. Then, we can dryrun the pipeline to verify that we are ready to submit the run to the cluster with the following command:
bash
% bash ./run_ervTelescope.sh -m/--runmode=dryrun -w/--workdir=<PathToOutdir>
c. submit to cluster
The pipeline can be submitted to cluster using the following command:
bash
% bash ./run_ervTelescope.sh -m/--runmode=run -w/--workdir=<PathToOutdir>
The above command will submit the master job to the cluster which will the spawn individual snakemake rule-jobs. It returns the master jobid which can be used to track the progress pipeline using a command similar to :
bash
% squeue -u $USER -j <JOBID>
Owner
- Name: CCR Genetics Branch
- Login: CCRGeneticsBranch
- Kind: organization
- Website: https://ccr.cancer.gov/Genetics-Branch
- Repositories: 13
- Profile: https://github.com/CCRGeneticsBranch
Genomics, and epigenetic pipelines developed by the OncoGenomics Section of the Genetics Branch, CCR.