https://github.com/bayer-group/seqteleporter

SeqTeleporter: A tool designed to optimize and streamline the design of combinatorial protein library for protein engineering.

https://github.com/bayer-group/seqteleporter

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (10.6%) to scientific vocabulary

Keywords

beat-undefined

Keywords from Contributors

interpretability standardization animal hack autograder report
Last synced: 9 months ago · JSON representation

Repository

SeqTeleporter: A tool designed to optimize and streamline the design of combinatorial protein library for protein engineering.

Basic Info
  • Host: GitHub
  • Owner: Bayer-Group
  • License: bsd-3-clause
  • Language: Jupyter Notebook
  • Default Branch: open-source
  • Homepage:
  • Size: 65 MB
Statistics
  • Stars: 6
  • Watchers: 1
  • Forks: 1
  • Open Issues: 3
  • Releases: 0
Topics
beat-undefined
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme Contributing Codeowners

README.md

Icon_small.png

SeqTeleporter: A Rapid Sequence Space Explorer For Combinatorial Protein Library

CI Build Static Typing Demo Page

Tutorial

Setup environment

Compatible python version: 3.9, 3.10, 3.11

  1. pip install --upgrade pip
  2. pip install -r requirements-dev.txt

Run the tool

If you wish to run the tool please check out the tutorial here

Abstract

In the highly dynamic field of pharmaceutical R&D, the development of therapeutic biologics demands innovative solutions that enhance efficiency and reduce costs while maintaining robustness and reliability. We present SeqTeleporter, a state-of-the-art tool designed to address these needs by optimizing the process of multi-site mutagenesis, a crucial step in protein engineering. SeqTeleporter empowers the creation of any desired mutant combinations from multiple positions of interest, with numerous variations for each position. It intelligently divides sequences with mutations, allowing for their reuse across multiple design-build-test-learn cycles, thereby significantly accelerating the construction of protein engineering libraries. To illustrate the power of SeqTeleporter, consider the scenario of targeting any preferred combinations of mutants from 24 positions of interest, with 2 variations for each position. This allows for the creation of over 10^7 distinct combinations in the sequence space. The tool is designed to facilitate the one-step construction of any desired combination found within over 10^7 "teleportable" coordinates in the sequence space. This capability enables SeqTeleporter to move beyond the limitations of traditional protein library design. It provides a unique capability to instantly navigate, explore, and sample extensive sequence spaces in a single step, all while optimizing costs and ensuring a reliable process. SeqTeleporter is more than just a multi-site mutagenesis tool. It's potential for seamless integration with machine learning processes further elevates its significance, positioning it as a key player in next-generation protein engineering strategies. In summary, SeqTeleporter emerges as a valuable asset in the field of therapeutic biologics engineering. By enhancing the efficiency of the protein engineering process, it brings us one step closer to our goal: improving patient lives through the development of effective and affordable biological therapies.

concept_picture.jpg Figure 1. SeqTeleporter concept enabling instantaneous sampling of vast sequence space and integration to MoClo framework.\ (A) By distributing the mutations on interest evenly on each module of the gene of interest, we unlock the ability to probe a vast variant space by recombining modules containing part of the desired genotype. For instance, a gene with 24 mutations sites and 2 variations for each site results in around 16 million distinct genotypes. By distributing the mutation sites to six partitions of the gene, we only required 16 distinct fragments for each partition - in total 96 fragments – to cover the desired variant space of 16 million. (B) The variant space of the desired mutations can be conceptualized as the 3-dimentional space, with x and y axis being all 163 (=4096) possible genotype combinations of 3 of the six partitions, and the z-axis being the fitness of each of the variant. By combining one fragment of desired genotype from each partition, we can obtain any genotype of interest within the 16 million distinct genotypes. In another words, we conceptually “teleport” to any coordinate of interest in the variant space. (C) SeqTeleporter optimize the combinatorial library generation considering the integration of users existing MoClo framework for optimal protein expression and purification.

Process Overview

In this section you can find a detailed visual overview of the process of SeqTeleporter.

ProcessOverview.jpg Figure 2 SeqTeleporter Components and Workflow Overview.

References

  1. Codon Usage Data: Codon usage tabulated from the international DNA sequence databases: status for the year 2000. Nakamura, Y., Gojobori, T. and Ikemura, T. (2000) Nucl. Acids Res. 28, 292.

Owner

  • Name: Bayer Open Source
  • Login: Bayer-Group
  • Kind: organization

Science for a better life

GitHub Events

Total
  • Push event: 12
Last Year
  • Push event: 12

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 49
  • Total Committers: 2
  • Avg Commits per committer: 24.5
  • Development Distribution Score (DDS): 0.224
Past Year
  • Commits: 49
  • Committers: 2
  • Avg Commits per committer: 24.5
  • Development Distribution Score (DDS): 0.224
Top Committers
Name Email Commits
Mu-En Chung c****n@g****m 38
github-actions[bot] 4****] 11

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 3
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 1
  • Total pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • chungmuen (3)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/build.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
.github/workflows/generate_demo.yml actions
  • actions/checkout v4 composite
  • actions4git/add-commit-push v1 composite
.github/workflows/static_typing.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
requirements-dev.txt pypi
  • biopython * development
  • coverage * development
  • dnachisel * development
  • docstr-coverage * development
  • flake8 * development
  • jupyter * development
  • mypy * development
  • numpy ==1.26.4 development
  • openpyxl ==3.1.2 development
  • pandas ==2.2.0 development
  • pandas-stubs * development
  • pyarrow ==15.0.0 development
  • pytest * development
  • python-dotenv ==1.0.1 development
  • python-pptx * development
  • pywin32 ==306 development
  • sphinx * development
  • sphinx-rtd-dark-mode * development
  • sphinx-rtd-theme * development
  • sphinx_mdinclude * development
  • static-frame * development
  • testbook * development
  • types-requests * development
  • xlrd * development