rna_seq_pipeline
Modular RNA-seq pipeline using Nextflow, Salmon, and DESeq2
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.2%) to scientific vocabulary
Repository
Modular RNA-seq pipeline using Nextflow, Salmon, and DESeq2
Basic Info
- Host: GitHub
- Owner: Cslaer2025
- License: mit
- Language: Nextflow
- Default Branch: main
- Size: 7.81 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
RNA-seq Analysis Pipeline
A modular, scalable RNA-seq pipeline for quantifying and analyzing gene expression data. Built using Nextflow, Salmon, and DESeq2, with reproducibility and clarity in mind.
🧪 Features
- Pseudo-alignment using Salmon
- Quantification to transcript- and gene-level
- Differential expression analysis with DESeq2
- Modular structure using Nextflow
- Easy setup via Conda environments
- Generates QC reports, PCA plots, and volcano plots
🔧 Requirements
- Nextflow
- Conda / Mamba
- Git
- Internet access (for pulling reference or demo data)
📁 Repo Structure
| Folder | Description |
|----------------|-----------------------------------------------|
| data/ | Input FASTQ files or sample accessions |
| results/ | All output files: quant, stats, plots |
| workflow/ | Nextflow scripts (main.nf, modules, etc.) |
| scripts/ | Custom R and Python scripts for plotting, QC |
| envs/ | Conda YAMLs for reproducibility |
| config/ | Sample sheet, pipeline settings |
▶️ Quick Start
```bash
clone the repo
git clone https://github.com/Cslaer2025/rnaseqpipeline.git cd rnaseqpipeline
run pipeline (demo mode)
nextflow run workflow/main.nf -profile conda
rnaseqpipeline
Modular RNA-seq pipeline using Nextflow, Salmon, and DESeq2
f6ec3f3659f13f73b855f193dcad2c5ce5b0522a
🚀 How to Run the Pipeline
Make sure you have Nextflow and Conda installed.
📁 Project Structure
rnaseqpipeline/ ├── config/ │ └── samplesheet.csv # Sample metadata ├── data/ │ └── transcriptome.fa # Reference transcriptome ├── scripts/ │ └── deseq2.R # DESeq2 analysis script ├── results/ ├── main.nf # Nextflow workflow └── .gitignore
▶️ Command to Run
```bash nextflow run main.nf -profile standard
By default, this will:
Build a Salmon index
Run quantification on paired-end reads
Perform DESeq2 differential expression analysis
Output: results/deseq2results.csv 🧪 Sample Sheet Format sampleid condition fastq1 fastq2 sample1 control data/sample1R1.fastq.gz data/sample1R2.fastq.gz sample2 treatment data/sample2R1.fastq.gz data/sample2R2.fastq.gz yaml Copy Edit
🔹 3. Save and exit nano:
- Press
Ctrl + O→ then Enter to save - Press
Ctrl + Xto exit
✅ Git Commands to Commit and Push the README Update
```bash cd ~/rnaseqpipeline git add README.md git commit -m "Add usage instructions and project layout to README" git push
Owner
- Login: Cslaer2025
- Kind: user
- Repositories: 1
- Profile: https://github.com/Cslaer2025
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this pipeline, please cite it as below."
title: "RNA-Seq Analysis Pipeline using Nextflow, Salmon, and DESeq2"
authors:
- family-names: Sarmiento
given-names: Cleofes
orcid: https://orcid.org/0000-0002-1234-5678
affiliation: University of Nebraska Medical Center
date-released: 2025-04-14
version: 1.0.0
doi: 10.0000/zenodo.1234567
repository-code: https://github.com/Cslaer2025/rna_seq_pipeline
license: MIT
GitHub Events
Total
- Push event: 11
- Create event: 2
Last Year
- Push event: 11
- Create event: 2