cactus-snakemake
Snakemake workflows for performing whole genome alignment with Cactus efficiently on SLURM clusters
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 11 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.2%) to scientific vocabulary
Repository
Snakemake workflows for performing whole genome alignment with Cactus efficiently on SLURM clusters
Basic Info
Statistics
- Stars: 3
- Watchers: 5
- Forks: 1
- Open Issues: 2
- Releases: 4
Metadata Files
README.md
These pipelines facilitate the running of the Cactus whole genome alignment tool efficiently on SLURM (and possibly other) clusters.
Tutorials available on the FAS Informatics website:
<Whole genome alignment with Progressive Cactus>
<Adding a genome to a whole genome alignment>
<Adding an outgroup to a whole genome alignment>
<Replacing a genome in a whole genome alignment>
<Pangenome inference with Minigraph-Cactus>
⚠️ Important! cactus-snakemake v3.0.0 and later requires Cactus v2.9.9 or later.
Due to bug fixes in Cactus, v3.0.0+ of cactus-snakemake is only compatibile with Cactus v2.9.9 or later. Don't worry, cactus-snakemake will always use the latest version of Cactus available unless you specify otherwise in your config file. However, if you do wish to use an older version of Cactus, you can use cactus-snakemake v2.1.0.
Installation
Installation is done simply by cloning the repository:
{bash}
git clone https://github.com/harvardinformatics/cactus-snakemake.git
Alternatively, you could just manually download the latest release and unzip it and it should be good to go.
However, Snakemake and Singularity are required as dependencies. For more information, see the setup instructions in any of the tutorials linked above.
Usage
Each pipeline has a different config file that is required to specify input and output options and cluster resources.
With the config file setup, the pipelines are generally run as:
{bash}
snakemake -j <number of jobs to submit simultaneously> -e slurm -s </path/to/snakefile.smk> --configfile </path/to/your/snakmake-config.yml>
💡 Tip: Cannon cluster Snakemake plugin
If you are on the Harvard Cannon cluster, you can use the snakemake-executor-plugin-cannon to do automatic partition selection instead of the generic SLURM executor plugin. Install the plugin with pip or mamba and then use
-e cannonin all of your commands instead of-e slurm.
For more information, see the setup and run instructions in each of the tutorials linked above.
Meta config options
Several meta config options exist across pipelines as pseudo-command line flags
| Command line flag | Description |
| -------------------- | ----------- |
| --config display=T | Print the current config settings and exit |
| --config info=T | Display some information about the pipelines, including version and last commit date |
| --config version=T | Display the version of the pipeline |
| --config prep=T | Run all pre-processing steps and exit (e.g. output directory creation, cactus image download, running cactus-prepare). |
| --config debug=T | The same as prep, but display extra information about the pre-processing steps. |
Citation
If you use this software, please cite:
Latest version
Thomas, G. (2025). Snakemake workflows for Cactus (Version 3.0.0) [Computer software]. https://doi.org/10.5281/zenodo.15699752
v2.1.0
Thomas, G. (2025). Snakemake workflows for Cactus (Version 2.1.0) [Computer software]. https://doi.org/10.5281/zenodo.15596990
To cite a specific version:
See the full list of versions and DOIs here.
(Find the version you used in the Zenodo version history.)
Owner
- Name: Harvard Informatics
- Login: harvardinformatics
- Kind: organization
- Location: Cambridge MA
- Repositories: 64
- Profile: https://github.com/harvardinformatics
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Thomas" given-names: "Gregg" title: "Snakemake workflows for Cactus" version: 3.0.0 doi: 10.5281/zenodo.15699752 date-released: 2025-06-05 url: "https://github.com/harvardinformatics/cactus-snakemake"
GitHub Events
Total
- Create event: 10
- Release event: 2
- Issues event: 10
- Watch event: 1
- Delete event: 1
- Issue comment event: 18
- Push event: 64
- Pull request event: 6
- Fork event: 1
Last Year
- Create event: 10
- Release event: 2
- Issues event: 10
- Watch event: 1
- Delete event: 1
- Issue comment event: 18
- Push event: 64
- Pull request event: 6
- Fork event: 1
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 6
- Total pull requests: 4
- Average time to close issues: 14 days
- Average time to close pull requests: 33 minutes
- Total issue authors: 5
- Total pull request authors: 1
- Average comments per issue: 0.83
- Average comments per pull request: 0.0
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 6
- Pull requests: 4
- Average time to close issues: 14 days
- Average time to close pull requests: 33 minutes
- Issue authors: 5
- Pull request authors: 1
- Average comments per issue: 0.83
- Average comments per pull request: 0.0
- Merged pull requests: 4
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- gwct (2)
- faye-romero (1)
- zcalamari (1)
- KristinaGagalova (1)
- jasongallant (1)
Pull Request Authors
- gwct (4)