sns-ext

Minerva version for Seq-N-Slide pipeline (https://github.com/igordot/sns)

https://github.com/naiklab/sns-ext

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.1%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Minerva version for Seq-N-Slide pipeline (https://github.com/igordot/sns)

Basic Info
  • Host: GitHub
  • Owner: Naiklab
  • License: mit
  • Language: Shell
  • Default Branch: main
  • Homepage:
  • Size: 153 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 2 years ago · Last pushed about 1 year ago
Metadata Files
Readme License Citation

README.md

SNS-EXT: Seq-N-Slide Extended for Minerva HPC

Athena Logo

A specialized adaptation of the Seq-N-Slide (SNS) pipeline optimized for Mount Sinai's Minerva High Performance Computing (HPC) environment with LSF job scheduling.

Overview

SNS-EXT provides automated workflows for common Illumina sequencing-based protocols, including:

  • RNA-seq: Differential gene expression analysis
  • ChIP-seq: Chromatin immunoprecipitation sequencing
  • ATAC-seq: Assay for transposase-accessible chromatin
  • WGBS/RRBS: Whole genome and reduced representation bisulfite sequencing
  • WES/WGS: Whole exome/genome variant detection
  • Species identification: Contaminant screening and quality control

Key Features

  • Minerva-optimized: Configured for LSF job scheduler and Minerva-specific resources
  • Modular design: Individual segments for flexible workflow construction
  • Quality control: Comprehensive QC metrics and reporting
  • Standardized outputs: Consistent file formats and directory structures
  • Resource management: Optimized memory and CPU allocation for HPC environment

Installation

```bash

Clone the repository

git clone https://github.com/Naiklab/sns-ext.git cd sns-ext

Make scripts executable

chmod +x run generate-settings gather-fastqs chmod +x routes/.sh chmod +x segments/.sh chmod +x scripts/*.sh ```

Quick Start

  1. Generate project settings:

bash ./generate-settings

  1. Gather FASTQ files:

bash ./gather-fastqs /path/to/fastq/directory

  1. Run analysis pipeline:

bash ./run [route] [sample-name]

Available Routes

| Route | Description | |-------|-------------| | rna-star | RNA-seq with STAR alignment | | rna-salmon | RNA-seq with Salmon quantification | | rna-rsem | RNA-seq with RSEM quantification | | chip | ChIP-seq analysis | | atac | ATAC-seq analysis | | wgbs | Whole genome bisulfite sequencing | | rrbs | Reduced representation bisulfite sequencing | | wes | Whole exome sequencing | | species | Species identification and contamination screening |

Directory Structure

text sns-ext/ ├── routes/ # Main analysis workflows ├── segments/ # Individual processing steps ├── scripts/ # Utility and analysis scripts ├── run # Main execution script ├── generate-settings # Project configuration utility └── gather-fastqs # FASTQ file organization utility

Configuration

The pipeline uses a settings file to configure analysis parameters. Key settings include:

  • Reference genome paths
  • Tool-specific parameters
  • Resource allocation (memory, CPU)
  • Output directory structure

Dependencies

  • LSF job scheduler (Minerva HPC)
  • Standard bioinformatics tools (STAR, Salmon, MACS2, etc.)
  • R with required packages
  • Python with necessary libraries

Original SNS Pipeline

This project is based on the original Seq-N-Slide pipeline:

GitHub: igordot/sns
Documentation: https://igordot.github.io/sns
DOI: DOI

Citation

If you use SNS-EXT in your research, please cite both this adaptation and the original SNS pipeline.

License

See LICENSE file for details.

Support

For issues specific to the Minerva adaptation, please open an issue in this repository. For general SNS questions, refer to the original documentation.

Owner

  • Name: Naik Lab
  • Login: Naiklab
  • Kind: organization
  • Location: United States of America

Citation (CITATION.cff)

cff-version: 1.0.0
message: "If you find this software useful, please cite it using these metadata."
title: "Seq-N-Slide (Minerva Version)"
abstract: "Alternative version of Seq-N-Slide, designed to work with Minerva HPC system at Mount Sinai. Automated workflows for common genomic sequencing-based protocols. Documentation is available at https://igordot.github.io/sns"
type: software
authors:
  - orcid: "https://orcid.org/0000-0003-4451-126X"
    given-names: Ikjot
    family-names: Sidhu
  - given-names: Igor
    family-names: Dolgalev
repository-code: "https://github.com/Naiklab/sns-ext"
url: "https://igordot.github.io/sns"
doi: 10.5281/zenodo.5550459
identifiers:
  - description: "Archived snapshots of Seq-N-Slide"
    type: doi
    value: "10.5281/zenodo.5550459"
  - description: "SciCrunch Research Resource Identifier for Seq-N-Slide"
    type: other
    value: "RRID:SCR_021752"
license: MIT

GitHub Events

Total
  • Push event: 7
  • Gollum event: 5
Last Year
  • Push event: 7
  • Gollum event: 5