vanishingglaciermags
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 3 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (10.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: michoug
- License: mit
- Language: Python
- Default Branch: main
- Size: 65.4 KB
Statistics
- Stars: 4
- Watchers: 2
- Forks: 1
- Open Issues: 0
- Releases: 2
Metadata Files
README.md
VanishingGlacierMAGs generation and analysis
This pipeline is used to generate MAGs from different individual assemblies and their respective reads for the VanishingGlaciers project. The pipeline is based on the Snakemake workflow management system and is designed to be run on a high-performance computing cluster.
Pipeline description
- This pipeline starts with different individual assemblies (
fastafiles) and their respective reads (mg.r{1,2}.preprocessed.fqfiles). - To reduce computational time, the reads are subsampled to 10% reads per sample and the contigs less than 1.5 kbp are removed.
- The subsampled reads are then mapped against the assemblies using BWA.
- The mapped reads are then used to bin the contigs using MetaBAT2, CONCOCT and MetaBinner.
- The bins are then optimized using DAS_Tool.
- CheckM2 is used to estimate the quality of the bins and only the ones that are 50% complete are kept.
- MDMCleaner reduces contamination from those bins.
- Next, bins are dereplicated with dRep to form MAGs and only bins with >70% completeness and < 10% contamination are kept.
- Read mapping against all the MAGs is done using BWA.
- And GtdbTk is used for the taxonomy.
- MGThermometer is used to measure the
optimal growth ratebased on the relative abundance ofFIVYWRELaminoacids- Optimal growth rate is measured as follows,
math OGT = 937 * F_{IVYWREL} − 335
- Optimal growth rate is measured as follows,
Setup
Conda
```bash
install miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x8664.sh chmod u+x Miniconda3-latest-Linux-x8664.sh ./Miniconda3-latest-Linux-x86_64.sh # follow the instructions ```
Getting the repository including sub-modules
bash
git clone --recurse-submodules git@github.com:michoug/SnakemakeBinning.git
git checkout busi
Create the main snakemake environment
```bash
create venv
conda env create -f requirements.yaml -n "snakemake" ```
Run Setup
- Place your preprocessed/trim reads (e.g.
sample_r1.fastq.gzandsample_r2.fastq.gzfiles) in areadsfolder - Place the individual assemblies (e.g.
sample.fa) into anassemblyfolder - Modify the
config/config.yamlfile to change the different paths and eventually the different options - Modify the
config/all_samples.txtfile to include your samples
Without Slurm
snakemake -s workflow/Snakefile --configfile config/config.yaml --cores 28 --use-conda -rp
With Slurm
This part was mainly taken from @susheelbhanu nomis_pipeline
- Modify the
slurm.yamlfile by checkingpartition,qosandaccountthat heavily depends on your system - Modify the
sbatch.shfile by checking#SBATCH -p,#SBATCH --qos=and#SBATCH -Aoptions that heavily depends on your system
sbatch config/sbatch.sh
Owner
- Login: michoug
- Kind: user
- Repositories: 1
- Profile: https://github.com/michoug
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Michoud" given-names: "Gregoire" orcid: "https://orcid.org/0000-0003-1071-9900" - family-names: "Masimo" given-names: "Bourquin" orcid: "https://orcid.org/0000-0003-4187-7485" - family-names: "Susheel Banu" given-names: "Busi" orcid: "https://orcid.org/0000-0001-7559-3400" title: "VanishingGlacierMAGs" doi: 10.5281/zenodo.10648955 version: 1.0.0 date-released: 2024-02-11 url: "https://github.com/michoug/VanishingGlacierMAGs"
GitHub Events
Total
- Watch event: 3
Last Year
- Watch event: 3