tandem_rnaseq

Costerton Biofilm Center RNA-sequencing analysis pipeline

https://github.com/bgfritz1/tandem_rnaseq

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.5%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Costerton Biofilm Center RNA-sequencing analysis pipeline

Basic Info
  • Host: GitHub
  • Owner: bgfritz1
  • License: gpl-3.0
  • Language: Python
  • Default Branch: main
  • Size: 40.8 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 2
  • Releases: 1
Created over 2 years ago · Last pushed over 1 year ago
Metadata Files
Readme License Citation

Readme.md

DOI

README

To run the workflow, do the following:

  1. Make sure that the necessary prerequisite files are downloaded. At least this should be the human genome fastq and the annotation file

    • FASTQ: http://ftp.ensembl.org/pub/release-106/fasta/homosapiens/dna/Homosapiens.GRCh38.dna.primary_assembly.fa.gz
    • Annotation: http://ftp.ensembl.org/pub/release-106/gtf/homosapiens/Homosapiens.GRCh38.106.chr.gtf.gz
  2. Make sure the kraken database is downloaded and accessible. Prebuilt databases can be found here: https://benlangmead.github.io/aws-indexes/k2

  3. Edit the metadata.tsv file with your sample information. The metadata file is tab-delimited matrix and should at least have the following column values:

    • sample: Where every sample should have a unique sample name
    • strategy: either PAIRED or SINGLE referring the whether it was paired-end sequencing or single-end.
    • strandedness: refers to the a stranded or directional library prep. 0 = unstranded, 1 = forward-stranded, 2=reverse-stranded.
    • Additonal columns are also accepted.
    • Edit the units.tsv file. This contains information about the raw sequence data files. Paths can be either relative or absolute. The unitcolumn refers to samples that were sequenced across multiple lanes on the same flow cell. If the reads are single end, just leave the fq2 column blank.
    • Edit the config/congif.yamlfile with the relevant details. Paths can be relative to the repository directory. Note: Some options in the config file have not yet been implemented.
    • Once everythng is set up, make sure the following programs are available:
    • snakemake v6.15.3 I have had problems running the pipeling on snakemake v7
    • mamba 0.24.0
    • python>3.7
      On computerome, this can be done with: module load miniconda3/4.12.0 module load mamba-org/mamba/0.24.0 module load snakemake/6.15.3
    • In config/Computerome2/config.yaml change the cluster parameter to the proper Computerome account string.
    • Now, you should be read to go! Navigate to the repository directory (the one containing the config/, workflow/, etc. files) and run the following command: snakemake --profile config/Computerome2. This will start the analysis. It is advisable to run the analysis on a subset of samples first to check that it works as expected.

Owner

  • Name: Blaine Fritz
  • Login: bgfritz1
  • Kind: user
  • Location: Copenhagen, Denmark
  • Company: University of Copenhagen

Postdoc at Costerton Biofilm Center, University of Copenhagen

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Blaine"
  given-names: "Fritz"
  orcid: "https://orcid.org/0000-0002-2085-894X"
title: "Tandem RNA-sequencing Pipeline"
version: 1.0
url: "https://github.com/bgfritz1/Tandem_RNAseq"

GitHub Events

Total
  • Issues event: 1
  • Push event: 3
Last Year
  • Issues event: 1
  • Push event: 3

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 1
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 1
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • bgfritz1 (2)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels