cactus-snakemake

Snakemake workflows for performing whole genome alignment with Cactus efficiently on SLURM clusters

https://github.com/harvardinformatics/cactus-snakemake

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 11 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.2%) to scientific vocabulary
Last synced: 6 months ago · JSON representation ·

Repository

Snakemake workflows for performing whole genome alignment with Cactus efficiently on SLURM clusters

Basic Info
  • Host: GitHub
  • Owner: harvardinformatics
  • License: mit
  • Language: Python
  • Default Branch: main
  • Homepage:
  • Size: 33.4 MB
Statistics
  • Stars: 3
  • Watchers: 5
  • Forks: 1
  • Open Issues: 2
  • Releases: 4
Created about 3 years ago · Last pushed 8 months ago
Metadata Files
Readme Contributing License Citation

README.md

DOI

These pipelines facilitate the running of the Cactus whole genome alignment tool efficiently on SLURM (and possibly other) clusters.

Tutorials available on the FAS Informatics website:

<Whole genome alignment with Progressive Cactus>

<Adding a genome to a whole genome alignment>

<Adding an outgroup to a whole genome alignment>

<Replacing a genome in a whole genome alignment>

<Pangenome inference with Minigraph-Cactus>

⚠️ Important! cactus-snakemake v3.0.0 and later requires Cactus v2.9.9 or later.

Due to bug fixes in Cactus, v3.0.0+ of cactus-snakemake is only compatibile with Cactus v2.9.9 or later. Don't worry, cactus-snakemake will always use the latest version of Cactus available unless you specify otherwise in your config file. However, if you do wish to use an older version of Cactus, you can use cactus-snakemake v2.1.0.

Installation

Installation is done simply by cloning the repository:

{bash} git clone https://github.com/harvardinformatics/cactus-snakemake.git

Alternatively, you could just manually download the latest release and unzip it and it should be good to go.

However, Snakemake and Singularity are required as dependencies. For more information, see the setup instructions in any of the tutorials linked above.

Usage

Each pipeline has a different config file that is required to specify input and output options and cluster resources.

With the config file setup, the pipelines are generally run as:

{bash} snakemake -j <number of jobs to submit simultaneously> -e slurm -s </path/to/snakefile.smk> --configfile </path/to/your/snakmake-config.yml>

💡 Tip: Cannon cluster Snakemake plugin

If you are on the Harvard Cannon cluster, you can use the snakemake-executor-plugin-cannon to do automatic partition selection instead of the generic SLURM executor plugin. Install the plugin with pip or mamba and then use -e cannon in all of your commands instead of -e slurm.

For more information, see the setup and run instructions in each of the tutorials linked above.

Meta config options

Several meta config options exist across pipelines as pseudo-command line flags

| Command line flag | Description | | -------------------- | ----------- | | --config display=T | Print the current config settings and exit | | --config info=T | Display some information about the pipelines, including version and last commit date | | --config version=T | Display the version of the pipeline | | --config prep=T | Run all pre-processing steps and exit (e.g. output directory creation, cactus image download, running cactus-prepare). | | --config debug=T | The same as prep, but display extra information about the pre-processing steps. |

Citation

If you use this software, please cite:

Latest version

Thomas, G. (2025). Snakemake workflows for Cactus (Version 3.0.0) [Computer software]. https://doi.org/10.5281/zenodo.15699752

v2.1.0

Thomas, G. (2025). Snakemake workflows for Cactus (Version 2.1.0) [Computer software]. https://doi.org/10.5281/zenodo.15596990

To cite a specific version:
See the full list of versions and DOIs here.
(Find the version you used in the Zenodo version history.)

Owner

  • Name: Harvard Informatics
  • Login: harvardinformatics
  • Kind: organization
  • Location: Cambridge MA

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Thomas"
  given-names: "Gregg"
title: "Snakemake workflows for Cactus"
version: 3.0.0
doi: 10.5281/zenodo.15699752
date-released: 2025-06-05
url: "https://github.com/harvardinformatics/cactus-snakemake"

GitHub Events

Total
  • Create event: 10
  • Release event: 2
  • Issues event: 10
  • Watch event: 1
  • Delete event: 1
  • Issue comment event: 18
  • Push event: 64
  • Pull request event: 6
  • Fork event: 1
Last Year
  • Create event: 10
  • Release event: 2
  • Issues event: 10
  • Watch event: 1
  • Delete event: 1
  • Issue comment event: 18
  • Push event: 64
  • Pull request event: 6
  • Fork event: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 6
  • Total pull requests: 4
  • Average time to close issues: 14 days
  • Average time to close pull requests: 33 minutes
  • Total issue authors: 5
  • Total pull request authors: 1
  • Average comments per issue: 0.83
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 6
  • Pull requests: 4
  • Average time to close issues: 14 days
  • Average time to close pull requests: 33 minutes
  • Issue authors: 5
  • Pull request authors: 1
  • Average comments per issue: 0.83
  • Average comments per pull request: 0.0
  • Merged pull requests: 4
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gwct (2)
  • faye-romero (1)
  • zcalamari (1)
  • KristinaGagalova (1)
  • jasongallant (1)
Pull Request Authors
  • gwct (4)
Top Labels
Issue Labels
bug (2)
Pull Request Labels