denovo-rnaseq
De novo assembly, Differential gene expression, and annotation workflow
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (7.5%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
·
Repository
De novo assembly, Differential gene expression, and annotation workflow
Basic Info
Statistics
- Stars: 5
- Watchers: 1
- Forks: 0
- Open Issues: 6
- Releases: 2
Created about 4 years ago
· Last pushed over 3 years ago
Metadata Files
Readme
License
Citation
README.md
denovo-rnaseq
Description
- This is a simple Snakemake workflow to perform de novo transcriptome assembly, DGE, and annotation.
- The workfow currently annotates the highly differential expressed genes with PVal of 0.005 and logfoldchange of 1.
Create conda enviornment
conda env create -f environment.yml
conda activate rnaseq
Limitations
- This workflow is meant to be easy to run, fast to implement, so it's currently not supporting flexibile configs.
- If you need to change configuration and parameters you will need to edit the Snakefile.
- It's only supporting RNASeq paired-end gzipped samples (not interleaved).
Main software used
- Error trimming: fastp
- De novo transcriptome assembly: rnaSpades
- Quantification: Salmon
- Differential gene expression: DESEQ2
- Annotation: Trinotate
How to use?
- Create a working directory
- Change the
ROOT_DIRin theSnakefileto match your working directory. - Create a directory with the name
samplesinside the working directory. - Put your samples in the
samplesdirectory with the naming convection:- R1:
_1.fastq.gz - R2:
_2.fastq.gz
- R1:
- Copy paste the tab-delimited file samples.tsv in your workflow directory.
- Modify the
samples.tsvto match your samples. Columns as following (sampletype, samplename, R1path, R2path).
You may use the following bash commands to update the Snakefile and samples.tsv files.
bash
curr=$(pwd)/workflow/
sed -i "s/REPLACE_ABSOLUTE_PATH/${curr//\//\\/}/g" workflow/samples.tsv
sed -i "s/REPLACE_ROOT_DIR/${curr//\//\\/}/g" Snakefile
Workflow rulegraph

Owner
- Name: Mohamed Abuelanin
- Login: mr-eyes
- Kind: user
- Location: Davis, CA
- Company: University of California, Davis
- Website: https://mr-eyes.com
- Twitter: mabuelanin
- Repositories: 18
- Profile: https://github.com/mr-eyes
Computer Science Ph.D. Student
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Abuelanin
given-names: Mohamed
orcid: https://orcid.org/0000-0002-3419-4785
title: "De novo RNA-Seq Snakemake Workflow"
version: 1.0.1
date-released: 2023-01-05
url: "https://github.com/mr-eyes/denovo-rnaseq"
GitHub Events
Total
Last Year
Dependencies
environment.yml
conda
- _libgcc_mutex 0.1
- _openmp_mutex 4.5
- _r-mutex 1.0.1
- _sysroot_linux-64_curr_repodata_hack 3
- aioeasywebdav 2.4.0
- aiohttp 3.7.4.post0
- alsa-lib 1.1.5
- amply 0.1.5
- appdirs 1.4.4
- async-timeout 3.0.1
- atk-1.0 2.36.0
- attmap 0.13.2
- attrs 21.4.0
- backports 1.0
- backports.functools_lru_cache 1.6.4
- bcrypt 3.2.0
- binutils_impl_linux-64 2.36.1
- binutils_linux-64 2.36
- bioconductor-annotate 1.72.0
- bioconductor-annotationdbi 1.56.1
- bioconductor-biobase 2.54.0
- bioconductor-biocfilecache 2.2.0
- bioconductor-biocgenerics 0.40.0
- bioconductor-biocio 1.4.0
- bioconductor-biocparallel 1.28.3
- bioconductor-biomart 2.50.0
- bioconductor-biostrings 2.62.0
- bioconductor-ctc 1.68.0
- bioconductor-delayedarray 0.20.0
- bioconductor-deseq2 1.34.0
- bioconductor-dexseq 1.40.0
- bioconductor-edger 3.36.0
- bioconductor-genefilter 1.76.0
- bioconductor-genelendatabase 1.30.0
- bioconductor-geneplotter 1.72.0
- bioconductor-genomeinfodb 1.30.0
- bioconductor-genomeinfodbdata 1.2.7
- bioconductor-genomicalignments 1.30.0
- bioconductor-genomicfeatures 1.46.1
- bioconductor-genomicranges 1.46.1
- bioconductor-go.db 3.14.0
- bioconductor-goseq 1.46.0
- bioconductor-iranges 2.28.0
- bioconductor-keggrest 1.34.0
- bioconductor-limma 3.50.1
- bioconductor-matrixgenerics 1.6.0
- bioconductor-qvalue 2.26.0
- bioconductor-rhtslib 1.26.0
- bioconductor-rsamtools 2.10.0
- bioconductor-rtracklayer 1.54.0
- bioconductor-s4vectors 0.32.3
- bioconductor-seqlogo 1.60.0
- bioconductor-summarizedexperiment 1.24.0
- bioconductor-xvector 0.34.0
- bioconductor-zlibbioc 1.40.0
- blast 2.12.0
- boost-cpp 1.74.0
- boto3 1.21.21
- botocore 1.24.21
- bowtie 1.0.0
- bowtie2 2.3.4.3
- brotlipy 0.7.0
- bwidget 1.9.14
- bzip2 1.0.8
- c-ares 1.18.1
- ca-certificates 2022.6.15
- cachetools 5.0.0
- cairo 1.16.0
- certifi 2021.5.30
- cffi 1.14.6
- chardet 4.0.0
- charset-normalizer 2.0.12
- coin-or-cbc 2.10.8
- coin-or-cgl 0.60.6
- coin-or-clp 1.17.6
- coin-or-osi 0.108.7
- coin-or-utils 2.11.6
- coincbc 2.10.8
- configargparse 1.5.3
- connection_pool 0.0.3
- coreutils 8.25
- cryptography 35.0.0
- curl 7.83.1
- datrie 0.8.2
- decorator 4.4.2
- defusedxml 0.7.1
- docutils 0.17.1
- dropbox 11.32.0
- entrez-direct 16.2
- expat 2.4.8
- fastp 0.23.2
- fftw 3.3.10
- filechunkio 1.8
- filelock 3.4.1
- font-ttf-dejavu-sans-mono 2.37
- font-ttf-inconsolata 3.000
- font-ttf-source-code-pro 2.038
- font-ttf-ubuntu 0.83
- fontconfig 2.14.0
- fonts-conda-ecosystem 1
- fonts-conda-forge 1
- freetype 2.10.4
- fribidi 1.0.10
- ftputil 5.0.4
- gcc_impl_linux-64 10.3.0
- gcc_linux-64 10.3.0
- gdk-pixbuf 2.42.8
- gettext 0.19.8.1
- gfortran_impl_linux-64 10.3.0
- gfortran_linux-64 10.3.0
- ghostscript 9.54.0
- giflib 5.2.1
- gitdb 4.0.9
- gitpython 3.1.18
- gmp 6.2.1
- google-api-core 2.8.0
- google-api-python-client 2.51.0
- google-auth 2.8.0
- google-auth-httplib2 0.1.0
- google-cloud-core 2.3.1
- google-cloud-storage 2.4.0
- google-crc32c 1.1.2
- google-resumable-media 2.3.3
- googleapis-common-protos 1.53.0
- graphite2 1.3.13
- graphviz 3.0.0
- grpcio 1.38.1
- gsl 2.7
- gtk2 2.24.33
- gts 0.7.6
- gxx_impl_linux-64 10.3.0
- gxx_linux-64 10.3.0
- harfbuzz 4.2.0
- hdf5 1.10.6
- hmmer 3.3.2
- htslib 1.14
- httplib2 0.20.4
- icu 69.1
- idna 3.3
- idna_ssl 1.1.0
- imagemagick 7.1.0_29
- importlib-metadata 4.8.1
- importlib_metadata 4.8.1
- iniconfig 1.1.1
- ipython_genutils 0.2.0
- isa-l 2.30.0
- jbig 2.1
- jinja2 3.0.3
- jmespath 0.10.0
- jpeg 9e
- jsonschema 4.1.2
- jupyter_core 4.8.1
- kallisto 0.48.0
- kernel-headers_linux-64 3.10.0
- keyutils 1.6.1
- kmer-jellyfish 2.3.0
- krb5 1.19.3
- lcms2 2.12
- ld_impl_linux-64 2.36.1
- lerc 3.0
- libblas 3.9.0
- libcblas 3.9.0
- libcrc32c 1.1.2
- libcurl 7.83.1
- libdb 6.2.32
- libdeflate 1.10
- libedit 3.1.20191231
- libev 4.33
- libffi 3.4.2
- libgcc 7.2.0
- libgcc-devel_linux-64 10.3.0
- libgcc-ng 12.1.0
- libgd 2.3.3
- libgfortran-ng 12.1.0
- libgfortran5 12.1.0
- libglib 2.70.2
- libgomp 12.1.0
- libiconv 1.16
- libidn2 2.3.2
- libjemalloc 5.2.1
- liblapack 3.9.0
- liblapacke 3.9.0
- libnghttp2 1.47.0
- libnsl 2.0.0
- libopenblas 0.3.20
- libpng 1.6.37
- libprotobuf 3.18.0
- librsvg 2.52.5
- libsanitizer 10.3.0
- libsodium 1.0.18
- libssh2 1.10.0
- libstdcxx-devel_linux-64 10.3.0
- libstdcxx-ng 12.1.0
- libtiff 4.4.0
- libtool 2.4.6
- libunistring 0.9.10
- libuuid 2.32.1
- libwebp 1.2.2
- libwebp-base 1.2.2
- libxcb 1.13
- libxml2 2.9.12
- libzlib 1.2.12
- llvm-openmp 8.0.1
- logmuse 0.2.6
- lz4-c 1.9.3
- make 4.3
- markupsafe 2.0.1
- more-itertools 8.13.0
- multidict 5.2.0
- nbformat 5.1.3
- ncurses 6.3
- networkx 2.5.1
- numpy 1.19.5
- oauth2client 4.1.3
- openjdk 11.0.1
- openjpeg 2.4.0
- openmp 8.0.1
- openssl 1.1.1o
- packaging 21.3
- pandas 1.1.5
- pandoc 2.18
- pango 1.50.7
- paramiko 2.11.0
- pcre 8.45
- pcre2 10.37
- peppy 0.31.2
- perl 5.32.1
- perl-app-cpanminus 1.7039
- perl-archive-tar 2.40
- perl-business-isbn 3.007
- perl-business-isbn-data 20210112.006
- perl-carp 1.38
- perl-common-sense 3.75
- perl-compress-raw-bzip2 2.103
- perl-compress-raw-zlib 2.105
- perl-constant 1.33
- perl-cpan-meta 2.150010
- perl-cpan-meta-requirements 2.140
- perl-cpan-meta-yaml 0.018
- perl-data-dumper 2.183
- perl-db-file 1.855
- perl-dbd-sqlite 1.70
- perl-dbi 1.643
- perl-encode 3.17
- perl-exporter 5.72
- perl-exporter-tiny 1.002002
- perl-extutils-cbuilder 0.280230
- perl-extutils-makemaker 6.66
- perl-extutils-manifest 1.73
- perl-extutils-parsexs 3.44
- perl-file-find-rule 0.34
- perl-file-path 2.09
- perl-file-temp 0.2304
- perl-getopt-long 2.52
- perl-io-compress 2.106
- perl-io-zlib 1.11
- perl-ipc-cmd 1.04
- perl-json 4.06
- perl-json-pp 4.09
- perl-json-xs 2.34
- perl-list-moreutils 0.430
- perl-list-moreutils-xs 0.430
- perl-locale-maketext-simple 0.21
- perl-mime-base64 3.16
- perl-module-build 0.4231
- perl-module-corelist 5.20220527
- perl-module-load 0.34
- perl-module-load-conditional 0.68
- perl-module-metadata 1.000037
- perl-number-compare 0.03
- perl-params-check 0.38
- perl-parent 0.236
- perl-pathtools 3.75
- perl-perl-ostype 1.010
- perl-scalar-list-utils 1.62
- perl-text-abbrev 1.02
- perl-text-glob 0.11
- perl-text-parsewords 3.31
- perl-threaded 5.32.1
- perl-types-serialiser 1.01
- perl-uri 5.10
- perl-version 0.9924
- pip 21.3.1
- pixman 0.40.0
- pkg-config 0.29.2
- pluggy 1.0.0
- ply 3.11
- prettytable 2.5.0
- protobuf 3.18.0
- psutil 5.8.0
- pthread-stubs 0.4
- pulp 2.5.1
- py 1.11.0
- pyasn1 0.4.8
- pyasn1-modules 0.2.7
- pycparser 2.21
- pygments 2.12.0
- pygraphviz 1.6
- pynacl 1.4.0
- pyopenssl 22.0.0
- pyparsing 3.0.9
- pyrsistent 0.17.3
- pysftp 0.2.9
- pysocks 1.7.1
- pytest 6.2.5
- python 3.6.15
- python-dateutil 2.8.2
- python-irodsclient 1.1.3
- python_abi 3.6
- pytz 2022.1
- pyu2f 0.1.5
- pyyaml 5.4.1
- r-amap 0.8_18
- r-ape 5.6_2
- r-argparse 2.1.5
- r-askpass 1.1
- r-assertthat 0.2.1
- r-backports 1.4.1
- r-base 4.1.3
- r-base64enc 0.1_3
- r-bh 1.78.0_0
- r-biasedurn 1.07
- r-bit 4.0.4
- r-bit64 4.0.5
- r-bitops 1.0_7
- r-blob 1.2.3
- r-brio 1.1.3
- r-broom 0.8.0
- r-bslib 0.3.1
- r-cachem 1.0.6
- r-callr 3.7.0
- r-catools 1.18.2
- r-cellranger 1.1.0
- r-cli 3.3.0
- r-clipr 0.8.0
- r-cluster 2.1.3
- r-colorspace 2.0_3
- r-cpp11 0.4.2
- r-crayon 1.5.1
- r-curl 4.3.2
- r-data.table 1.14.2
- r-dbi 1.1.2
- r-dbplyr 2.2.0
- r-desc 1.4.1
- r-diffobj 0.3.5
- r-digest 0.6.29
- r-dplyr 1.0.9
- r-dtplyr 1.2.1
- r-ellipsis 0.3.2
- r-evaluate 0.15
- r-fansi 1.0.3
- r-farver 2.1.0
- r-fastcluster 1.2.3
- r-fastmap 1.1.0
- r-fastmatch 1.1_3
- r-filelock 1.0.2
- r-findpython 1.0.7
- r-forcats 0.5.1
- r-formatr 1.12
- r-fs 1.5.2
- r-futile.logger 1.4.3
- r-futile.options 1.0.1
- r-gargle 1.2.0
- r-generics 0.1.2
- r-ggplot2 3.3.6
- r-glue 1.6.2
- r-googledrive 2.0.0
- r-googlesheets4 1.0.0
- r-gplots 3.1.3
- r-gtable 0.3.0
- r-gtools 3.9.2.2
- r-haven 2.5.0
- r-highr 0.9
- r-hms 1.1.1
- r-htmltools 0.5.2
- r-httr 1.4.3
- r-hwriter 1.3.2.1
- r-ids 1.0.1
- r-igraph 1.3.0
- r-isoband 0.2.5
- r-jquerylib 0.1.4
- r-jsonlite 1.8.0
- r-kernsmooth 2.23_20
- r-knitr 1.39
- r-labeling 0.4.2
- r-lambda.r 1.2.4
- r-lattice 0.20_45
- r-lifecycle 1.0.1
- r-locfit 1.5_9.5
- r-lubridate 1.8.0
- r-magrittr 2.0.3
- r-mass 7.3_57
- r-matrix 1.4_1
- r-matrixstats 0.62.0
- r-memoise 2.0.1
- r-mgcv 1.8_40
- r-mime 0.12
- r-modelr 0.1.8
- r-munsell 0.5.0
- r-nlme 3.1_158
- r-openssl 2.0.2
- r-phangorn 2.9.0
- r-pillar 1.7.0
- r-pkgconfig 2.0.3
- r-pkgload 1.2.4
- r-plogr 0.2.0
- r-plyr 1.8.7
- r-png 0.1_7
- r-praise 1.0.0
- r-prettyunits 1.1.1
- r-processx 3.6.0
- r-progress 1.2.2
- r-ps 1.7.0
- r-purrr 0.3.4
- r-quadprog 1.5_8
- r-r6 2.5.1
- r-rappdirs 0.3.3
- r-rcolorbrewer 1.1_3
- r-rcpp 1.0.8.3
- r-rcpparmadillo 0.11.2.0.0
- r-rcurl 1.98_1.7
- r-readr 2.1.2
- r-readxl 1.4.0
- r-rematch 1.0.1
- r-rematch2 2.1.2
- r-reprex 2.0.1
- r-reshape2 1.4.4
- r-restfulr 0.0.14
- r-rjson 0.2.21
- r-rlang 1.0.2
- r-rmarkdown 2.14
- r-rprojroot 2.0.3
- r-rsqlite 2.2.8
- r-rstudioapi 0.13
- r-rvest 1.0.2
- r-sass 0.4.1
- r-scales 1.2.0
- r-selectr 0.4_2
- r-sm 2.2_5.7
- r-snow 0.4_4
- r-statmod 1.4.36
- r-stringi 1.7.6
- r-stringr 1.4.0
- r-survival 3.3_1
- r-sys 3.4
- r-testthat 3.1.4
- r-tibble 3.1.7
- r-tidyr 1.2.0
- r-tidyselect 1.1.2
- r-tidyverse 1.3.1
- r-tinytex 0.40
- r-tzdb 0.3.0
- r-utf8 1.2.2
- r-uuid 1.1_0
- r-vctrs 0.4.1
- r-vioplot 0.3.7
- r-viridislite 0.4.0
- r-vroom 1.5.7
- r-waldo 0.4.0
- r-withr 2.5.0
- r-xfun 0.31
- r-xml 3.99_0.9
- r-xml2 1.3.3
- r-xtable 1.8_4
- r-yaml 2.3.5
- r-zoo 1.8_10
- ratelimiter 1.2.0
- readline 8.1.2
- requests 2.28.0
- rsa 4.8
- s3transfer 0.5.2
- salmon 1.8.0
- samtools 1.15
- sed 4.8
- seqkit 2.2.0
- setuptools 58.0.4
- six 1.16.0
- slacker 0.14.0
- smart_open 6.0.0
- smmap 3.0.5
- snakemake 6.15.5
- snakemake-minimal 6.15.5
- spades 3.15.4
- sqlite 3.38.5
- stone 3.3.1
- stopit 1.1.2
- sysroot_linux-64 2.17
- tabulate 0.8.9
- tbb 2021.5.0
- tk 8.6.12
- tktable 2.10
- toml 0.10.2
- toposort 1.7
- traitlets 4.3.3
- transdecoder 5.5.0
- trimmomatic 0.39
- trinity 2.13.2
- trinotate 3.2.2
- typing-extensions 4.1.1
- typing_extensions 4.1.1
- ubiquerg 0.6.1
- uritemplate 4.1.1
- urllib3 1.26.9
- veracitools 0.1.3
- wcwidth 0.2.5
- wget 1.20.3
- wheel 0.37.1
- wrapt 1.13.1
- xorg-fixesproto 5.0
- xorg-inputproto 2.3.2
- xorg-kbproto 1.0.7
- xorg-libice 1.0.10
- xorg-libsm 1.2.3
- xorg-libx11 1.7.2
- xorg-libxau 1.0.9
- xorg-libxdmcp 1.1.3
- xorg-libxext 1.3.4
- xorg-libxfixes 5.0.3
- xorg-libxi 1.7.10
- xorg-libxrender 0.9.10
- xorg-libxt 1.2.1
- xorg-libxtst 1.2.3
- xorg-recordproto 1.14.2
- xorg-renderproto 0.11.1
- xorg-xextproto 7.3.0
- xorg-xproto 7.0.31
- xz 5.2.5
- yaml 0.2.5
- yarl 1.6.3
- zipp 3.6.0
- zlib 1.2.12
- zstd 1.5.2
.github/workflows/test_workflow.yml
actions
- actions/cache v2 composite
- actions/checkout v2 composite
- conda-incubator/setup-miniconda v2 composite