tandem_rnaseq
Costerton Biofilm Center RNA-sequencing analysis pipeline
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.5%) to scientific vocabulary
Repository
Costerton Biofilm Center RNA-sequencing analysis pipeline
Basic Info
- Host: GitHub
- Owner: bgfritz1
- License: gpl-3.0
- Language: Python
- Default Branch: main
- Size: 40.8 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 2
- Releases: 1
Metadata Files
Readme.md
README
To run the workflow, do the following:
Make sure that the necessary prerequisite files are downloaded. At least this should be the human genome fastq and the annotation file
- FASTQ: http://ftp.ensembl.org/pub/release-106/fasta/homosapiens/dna/Homosapiens.GRCh38.dna.primary_assembly.fa.gz
- Annotation: http://ftp.ensembl.org/pub/release-106/gtf/homosapiens/Homosapiens.GRCh38.106.chr.gtf.gz
Make sure the kraken database is downloaded and accessible. Prebuilt databases can be found here: https://benlangmead.github.io/aws-indexes/k2
Edit the
metadata.tsvfile with your sample information. The metadata file is tab-delimited matrix and should at least have the following column values:sample: Where every sample should have a unique sample namestrategy: eitherPAIREDorSINGLEreferring the whether it was paired-end sequencing or single-end.strandedness: refers to the a stranded or directional library prep.0 = unstranded, 1 = forward-stranded, 2=reverse-stranded.- Additonal columns are also accepted.
- Edit the
units.tsvfile. This contains information about the raw sequence data files. Paths can be either relative or absolute. Theunitcolumn refers to samples that were sequenced across multiple lanes on the same flow cell. If the reads are single end, just leave thefq2column blank. - Edit the
config/congif.yamlfile with the relevant details. Paths can be relative to the repository directory. Note: Some options in the config file have not yet been implemented. - Once everythng is set up, make sure the following programs are available:
snakemake v6.15.3I have had problems running the pipeling on snakemake v7mamba 0.24.0python>3.7
On computerome, this can be done with:module load miniconda3/4.12.0 module load mamba-org/mamba/0.24.0 module load snakemake/6.15.3- In
config/Computerome2/config.yamlchange theclusterparameter to the proper Computerome account string. - Now, you should be read to go! Navigate to the repository directory (the one containing the config/, workflow/, etc. files) and run the following command:
snakemake --profile config/Computerome2. This will start the analysis. It is advisable to run the analysis on a subset of samples first to check that it works as expected.
Owner
- Name: Blaine Fritz
- Login: bgfritz1
- Kind: user
- Location: Copenhagen, Denmark
- Company: University of Copenhagen
- Repositories: 1
- Profile: https://github.com/bgfritz1
Postdoc at Costerton Biofilm Center, University of Copenhagen
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Blaine" given-names: "Fritz" orcid: "https://orcid.org/0000-0002-2085-894X" title: "Tandem RNA-sequencing Pipeline" version: 1.0 url: "https://github.com/bgfritz1/Tandem_RNAseq"
GitHub Events
Total
- Issues event: 1
- Push event: 3
Last Year
- Issues event: 1
- Push event: 3
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 1
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 1
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- bgfritz1 (2)