symbnet

Teaching materials for SymbNET course: from MAGs to GEMs

https://github.com/franciscozorrilla/symbnet

Science Score: 59.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: ncbi.nlm.nih.gov, nature.com, zenodo.org
  • Committers with academic emails
    1 of 1 committers (100.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary

Keywords

assembled genome genomes metabolic metabolism metagem metagenome modeling models symbnet
Last synced: 6 months ago · JSON representation

Repository

Teaching materials for SymbNET course: from MAGs to GEMs

Basic Info
Statistics
  • Stars: 14
  • Watchers: 1
  • Forks: 3
  • Open Issues: 0
  • Releases: 1
Topics
assembled genome genomes metabolic metabolism metagem metagenome modeling models symbnet
Created about 4 years ago · Last pushed about 2 years ago
Metadata Files
Readme License Citation

README.md

SymbNET From Metagenomics to Metabolic Interactions
EMBL-EBI Virtual Course (Day 5)

DOI

Learning Outcomes

  • Generate genome-scale metabolic models (GEMs) from metagenome assembled genomes (MAGs)
  • Predict metabolic interactions within bacterial communities
  • Characterize communities using competition-cooperation plot
  • Explore uncertainty in GEM reconstruction and simulation
  • Pros and cons of using reference genomes vs metagenome-assembled or single-amplified genomes for metabolic modeling

Recommended Software

| Tool | Task | GitHub | Reference | | :--- | :--- | :--- | :--- | | CarveMe | Build GEMs from MAGs |Repo | Paper | | SMETANA | Predict metabolic interactions between GEMs | Repo | Paper | | metaGEM | Metagenomic metabolic modeling workflow | Repo | Paper | | Snakemake | Workflow management and reproducibility | Repo | Paper |

Pictured below is the metaGEM workflow for reconstructing and simulating metagenome based metabolic models. This training module will focus on how to generate and simulate communities of metabolic models using CarveMe and SMETANA.

Contents

  • /genomes/ ORF annotated protein fasta files (.faa)
  • /models/ pre-generated CarveMe GEMs for simulation (.xml)
  • /ensembles/ pre-generated ensemble models for network uncertainty (.xml)
  • /simulations/ pre-computed SMETANA simulations (.tsv)
  • /data/ various metadata, taxonomic assignments, media files, etc.
  • /scripts/ markdown, r-markdown, and jupyter notebooks for each exercise
  • /plots/ visualization of results

Exercises

Part I

  1. Start by cloning this repo
  2. Use CarveMe to generate GEMs for a bacterial community
  3. Visualize model metrics across species
  4. Use SMETANA detailed algorithm to predict metabolic interactions between species
  5. Visualize detailed interactions with alluvial diagrams

Part II

  1. Use SMETANA global algorithm to generate MIP & MRO metrics for community of GEMs
  2. Visualize communities on cooperation-competition plot
  3. Generate ensemble models (optional)
  4. Quantify network uncertainty (optional)
  5. Discussion of methods, results, and interpretation

Datasets

The following table describes in detail the 6 small bacterial communities of 5 species that we will consider for metabolic modeling. These include MAG,SAG, and reference genome-based communities; the samples also span the human gut, kefir, and soil habitats.

| Microbiome |Genome type| Condition | Species | Links | | :--- | :--- | :--- | :--- | :--- | | Human gut | MAGs |Normal Glucose Tolerance (NGT, ERR260255) |

  • B. uniformis
  • R. bromii
  • B. wexlerae
  • E. rectale
  • F. saccharivorans
| | | Human gut | MAGs |Impaired Glucose Tolerance (IGT, ERR260172) |
  • B. uniformis
  • R. bromii
  • B. wexlerae
  • E. rectale
  • F. saccharivorans
| | | Human gut | MAGs | Type II Diabetes (T2D, ERR260140) |
  • B. uniformis
  • R. bromii
  • B. wexlerae
  • E. rectale
  • F. saccharivorans
| | | Human gut| Reference genomes | Reference genomes taken from RefSeq |
  • B. uniformis
  • R. bromii
  • B. wexlerae
  • E. rectale
  • F. saccharivorans
| | | Kefir | SAGs | Fermented with German grains (GER6) |
  • L. mesenteroides
  • L. lactis
  • A. fabarum
  • L. kefiranofaciens
  • L. kefiri
| | | Soil | MAGs | Calcarosols from Uluru, Australia (ERR671933) |
  • fThermoleophilaceae
  • fHerpetosiphonaceae
  • fPhormidiaceae
  • fGeodermatophilaceae
  • f_Rubrobacteraceae
| |

Metabolic Modeling Repos

Tools

  • metaGEM: Reconstruction and simulation of genome scale metabolic models directly from metagenomes
  • DesignMC: Design microbial communities for production of specific target compounds using GEMs
  • HiOrCo: Compute higher order cooccurence using abundance across samples
  • Reframed: Metabolic modeling package

Resources

Please cite literature if you make use of relevant tools and/or resources.

Suggested Reading

  • Intro to FBA: What is flux balance analysis?
  • CarveMe: Fast automated reconstruction of genome-scale metabolic models for microbial species and communities
  • SMETANA: Metabolic dependencies drive species co-occurrence in diverse microbial communities
  • metaGEM: Reconstruction of genome scale metabolic models directly from metagenomes
  • Human gut study: Nutritional preferences of human gut bacteria reveal their metabolic idiosyncrasies
  • Kefir study: Metabolic cooperation and spatiotemporal niche partitioning in a kefir microbial community
  • Cooccurrence study: Polarization of microbial communities between competitive and cooperative metabolism

Software Requirements

The following software will be pre-installed in your virtual machines. In the future, you can set up these software requirements in a conda environment on your cluster or local machine using the recipe files under the /conda/ subdirectory. The exact dependencies and versions may vary based on your operating system. For example, to set up a conda environment on an M1 Macbook, assuming you are in the SymbNET repo root folder

bash $ conda env create -f conda/osx-64_permissive.yml

Alternatively, you can manually create an environment using conda create bash $ conda create --yes -n symbnet

You can activate the environment using source or conda command bash $ source activate symbnet

Then pip install software listed below as required, e.g. to install CarveMe and SMETANA bash $ pip install --user carveme smetana

Then conda install software listed below as required, e.g. to install prodigal and diamond bash $ conda install -c bioconda prodigal diamond

You will also need to obtain a free academic initiative license from IBM to use the academic version of CPLEX. You must then follow the installation instructions to set up CPLEX on your local machine or cluster. Refer to your local cluster's wiki page to see if they have a load-able CPLEX module that you can use. CPLEX versions 12.7-12.9 are recommended. Note that the free version of CPLEX that can be obtained with pip install DOES NOT WORK FOR BIOLOGICAL NETWORKS OF OUR SIZE.

Core

Dependencies

Additonal packages

About SymbNET

SymbNET is a European network for host-microbe interactions research, funded by the European Unions Horizon 2020 research and innovation programme. The project coordinated by FCG-IGC (Instituto Gulbenkian de Cincia, Portugal), brings together the world-leading research institutions EMBL (European Molecular Biology Laboratory, Germany), CAU (Christian-Albrechts-Universitt zu Kiel, Germany), and UNIL (Universit de Lausanne, Switzerland), and a local widening partner ITQB NOVA (Instituto de Tecnologia Qumica e Biolgica, Portugal).

This project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement N 952537

Contributors

  • Francisco Zorrilla, MRC Toxicology Unit - University of Cambridge
  • Eva-Maria Geissen, Center for Biological Modelling - EMBL Heidelberg
  • Maria Zimmermann-Kogadeeva, EMBL Heidelberg
  • Kiran R. Patil, MRC Toxicology Unit - University of Cambridge

Owner

  • Name: Francisco Zorrilla
  • Login: franciscozorrilla
  • Kind: user
  • Location: Cambridge, UK
  • Company: MRC Toxicology, University of Cambridge

PhD Student in the Patil Lab. Interested in metagenomics, metabolic modeling of microbial communities, and machine learning

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2

Committers

Last synced: 8 months ago

All Time
  • Total Commits: 370
  • Total Committers: 1
  • Avg Commits per committer: 370.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Francisco Zorrilla f****4@c****k 370
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 8 months ago

All Time
  • Total issues: 0
  • Total pull requests: 1
  • Average time to close issues: N/A
  • Average time to close pull requests: less than a minute
  • Total issue authors: 0
  • Total pull request authors: 1
  • Average comments per issue: 0
  • Average comments per pull request: 0.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • franciscozorrilla (2)
Top Labels
Issue Labels
Pull Request Labels