bioinformatics-templates
Reusable R/Bash scripts for RNA-Seq (DESeq2), VCF filtering, and multi-omics visualization (heatmaps, volcano plots)
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.4%) to scientific vocabulary
Keywords
Repository
Reusable R/Bash scripts for RNA-Seq (DESeq2), VCF filtering, and multi-omics visualization (heatmaps, volcano plots)
Basic Info
- Host: GitHub
- Owner: somayehsarirchi
- License: mit
- Language: HTML
- Default Branch: main
- Homepage: https://github.com/somayehsarirchi
- Size: 17.5 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 2
Topics
Metadata Files
README.md
Bioinformatics Analysis Templates
This repository contains modular R/bash pipelines and example datasets for common bioinformatics workflows. It includes end-to-end analysis scripts and results for:
- 🧬 Bulk RNA-Seq differential expression analysis (DESeq2)
- 🌡 Heatmap generation
- 🌋 Volcano plot visualization
- 🧬 VCF filtering for WES-derived variant files
- 🧪 Weighted Gene Co-expression Network Analysis (WGCNA)
- 🔁 Raw RNA-Seq data preprocessing from FASTQ to Count Matrix
⚠️ All data and results are included for demonstration purposes only and contain no real patient identifiers or confidential content.
📁 Folder Structure
Bioinformatics-Templates/
├── README.md
├── CITATION.cff
├── LICENSE
├── demo_analysis.Rmd
├── deseq2/ # DESeq2 pipeline + example data
├── heatmap/ # Heatmap template + matrix
├── volcano/ # Volcano plot script + matrix
├── vcf_filtering/ # VCF filtering script + demo data
├── Raw_FASTQ_to_CountMatrix/ # Raw FASTQ → Count Matrix Bash pipeline
│ └── scripts/
├── GSE148355_WGCNA/ # WGCNA analysis example on GEO dataset
📜 Scripts Overview
| Script Name | Description |
|----------------------------|-----------------------------------------------------------------------------|
| DESeq2_pipeline.R | Normalize RNA-Seq counts and identify DEGs |
| Heatmap_Template.R | Generate clustered heatmap from expression matrix |
| volcano_plot.R | Create volcano plot of DEGs from DESeq2 results |
| VCF_Filtering.R | Apply filters to VCF files based on quality/depth/etc. |
| pipeline.sh | Bash pipeline from fastq-dump to htseq-count |
| WGCNA_analysis.R | Full WGCNA pipeline using GSE148355 dataset |
📂 Sample Data
| File | Description |
|----------------------------|-----------------------------------------------------------|
| example_counts.txt | Simulated raw count matrix |
| example_metadata.csv | Sample metadata with group labels |
| expression_matrix.csv | Expression matrix for heatmap/volcano |
| myvcf.vcf | Demo VCF file for variant filtering |
| vcf_filtered.csv | Filtered VCF output |
| GSE148355_raw_counts.csv | Real RNA-Seq counts from GEO |
| traits.csv | Sample traits (e.g. group, gender, age) |
🚀 How to Use
Clone the repository:
bash git clone https://github.com/somayehsarirchi/Bioinformatics-Templates.git cd Bioinformatics-TemplatesNavigate to the relevant analysis folder (e.g.,
deseq2/,GSE148355_WGCNA/)Run the script in R, RStudio, or terminal:
r source("DESeq2_pipeline.R")Render the demo notebook (optional): Open
demo_analysis.Rmdin RStudio and click Knit to generate the report.
⚙️ Requirements
- R version ≥ 4.2
- Tools: sratoolkit, HISAT2, HTSeq
- Required R packages:
DESeq2,WGCNA,pheatmap,ggplot2,readr,Hmisc,enrichR, etc. - Bash and Java (for FASTQ processing)
📜 License
This project is licensed under the MIT License.
📦 Citation
If you use this repository or its scripts, please cite it as:
Somayeh Sarirchi (2025). Bioinformatics Analysis Templates: RNA-seq, VCF filtering, and Visualization Pipelines. GitHub repository: https://github.com/somayehsarirchi/Bioinformatics-Templates
DOI: 10.5281/zenodo.16493302
👩🔬 Author
Somayeh Sarirchi
Bioinformatics Scientist
📧 s.sarirchi@gmail.com
🔗 GitHub Profile
Owner
- Login: somayehsarirchi
- Kind: user
- Website: linkedin.com/in/somayeh-sarirchi-9b2b59171
- Repositories: 1
- Profile: https://github.com/somayehsarirchi
Bioinformatician | Transcriptomics| RNA-Seq Analysis | Computational Drug Design | Whole exome Sequencing | Meta-Analysis
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this repository, please cite it using the metadata below."
authors:
- family-names: Sarirchi
given-names: Somayeh
orcid: https://orcid.org/0000-0002-2964-8447
title: "Bioinformatics Analysis Templates: RNA-seq, VCF filtering, and Visualization Pipelines"
version: "1.0.1"
date-released: 2025-07-27
doi: 10.5281/zenodo.16493302
repository-code: https://github.com/somayehsarirchi/Bioinformatics-Templates
url: https://github.com/somayehsarirchi/Bioinformatics-Templates
GitHub Events
Total
- Release event: 1
- Push event: 88
- Create event: 3
Last Year
- Release event: 1
- Push event: 88
- Create event: 3