psnpbind-build

https://github.com/ammar257ammar/psnpbind-build

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: ammar257ammar
License: agpl-3.0
Default Branch: master
Size: 505 KB

Statistics

Stars: 2
Watchers: 1
Forks: 2
Open Issues: 0
Releases: 1

Created about 6 years ago · Last pushed over 2 years ago

Metadata Files

Readme License Citation Authors

PSnpBind, a workflow to construct binding site mutated protein-ligand database

GitHub GitHub release (latest by date)

This repository is part of the "PSnpBind, a database to highlight pocket SNPs effects on protein-ligand binding affinity" project and it is the main repository to reproduce the project methodology and results.

NOTE: all the following instructions are for Linux operating system and tested on Ubuntu 18. These tools are not tested on other systems like Windows or MacOS.

Clone this repository to the location of your preference, then follow the next sections!

First, build the Docker images for the required tools

Follow the instructions inside each one of the following repositories:

| Tool | More Info | | ------------------------------------------------------------ | ------------------------------------------------------------ | | PSBAP Core | GitHub top language Lines of code GitHub GitHub release (latest by date) | | PSBAP FoldX | GitHub | | PSBAP Gromacs | GitHub | | PSBAP OpenBabel | GitHub | | PSBAP Vina | GitHub |

Second, download the required datasets to the corresponding locations

Download ChEMBL database version 25 from the URL and unpack it to "data/chembl/chembl_25.sdf" :

ftp://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl25/chembl25.sdf.gz

Download UniProt natural variants database from the URL and unpack it to "data/uniprotvariation/homosapiens_variation.txt":

ftp://ftp.uniprot.org/pub/databases/uniprot/currentrelease/knowledgebase/variants/homosapiens_variation.txt.gz

Register an account on http://www.pdbbind-cn.org and login to download CASF2016 from the URL and unpack it to "data/pdbbind/CASF2016/coreset":

http://www.pdbbind-cn.org/download/CASF-2016.tar.gz

The folders of the PDB entries should be immediately under the mentioned path.

Define the following environment variables in the terminal:

The path to the clone repository "pocket-snps-effect-binding-affinity" will be called PSBAP_ROOT for the remaining of this Readme file.

bash export CONFIG_PATH=PSBAP_ROOT/config export DATA_PATH=PSBAP_ROOT/data export TSV_PATH=PSBAP_ROOT/tsv export FEATURES_PATH=PSBAP_ROOT/features export PROCESSING_PATH=PSBAP_ROOT/processing

Third, start applying the PSBAP steps as following:

Filter CASF to include structure with quality <= 2.5 Angstrom, generate and download their (SIFTS, FASTA and DSSP), and map UniProt variant to the selected PDBbind protiens.

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op init

Map the selected UniProt variants to PDBbind proteins pockets, and prepare the folder structure for FoldX:

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op pocket-snps-mapping-and-foldx-prep

Run FoldX to introduce the mapped pocket's SNPs onto the proteins PDBs (choose NUMOFTHREADS depending on the amount of CPUs you want to allocate for FoldX):

```bash docker run -it -v $PROCESSINGPATH/foldx:/pdb \ --name psbap-foldx --rm \ psbap-foldx RepairPDB NUMOF_THREADS

# After finish, run the next command:

docker run -it -v $PROCESSINGPATH/foldx:/pdb \ --name psbap-foldx --rm \ psbap-foldx BuildModel NUMOF_THREADS ```

Generate FoldX report:

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op foldx-report

Perform energy minimization on the proteins structures:

```bash docker run -it -v $PROCESSINGPATH/foldx:/pdb \ --name psbap-gromacs --rm \ psbap-gromacs prepare NUMOF_THREADS true

# After finish, run the next command:

docker run -it -v $PROCESSINGPATH/foldx:/pdb \ --name psbap-gromacs --rm \ psbap-gromacs em NUMOF_THREADS true

# After finish, run the next command:

docker run -it -v $PROCESSINGPATH/foldx:/pdb \ --name psbap-gromacs --rm \ psbap-gromacs export NUMOF_THREADS true ```

Prepare ligands folders for the corresponding selected PDBbind entries:

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op prepare-ligands-folders

Select similar ligands, split, perform energy minimization (MMFF94) using OpenBabel:

```bash docker run -v $PROCESSINGPATH/ligands:/pdb \ -v $DATAPATH/chembl:/index \ --name psbap-openbabel --rm \ psbap-openbabel search-and-split

# After finish, run the next command:

docker run -v $PROCESSINGPATH/ligands:/pdb \ -v $DATAPATH/chembl:/index \ --name psbap-openbabel --rm \ psbap-openbabel minimize ```

Prepare ligands information (IDs, Tanimoto index):

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op ligands-tanimoto-dataset

Prepare folders and configurations for AutoDock Vina:

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op prepare-vina-folders-config

Perform docking using AutoDock Vina by running the following script inside a bash script file :

```bash

!/bin/bash

cd $PROCESSING_PATH/vina-docking

for PDB in *; do

docker run -it  -v $PROCESSING_PATH/vina-docking:/pdb \
                --name psbap-vina --rm \
                psbap-vina NUM_OF_THREADS PDB

done ```

Extract Vina docking results:

bash docker run -v $CONFIG_PATH:/config \ -v $PROCESSING_PATH:/processing \ -v $DATA_PATH:/data \ -v $TSV_PATH:/tsv \ -v $FEATURES_PATH:/features \ --name psbap-core --rm \ psbap-core -op generate-dockings-results

Owner

Name: Ammar Ammar
Login: ammar257ammar
Kind: user
Location: The Netherlands
Company: Maastricht University

Repositories: 14
Profile: https://github.com/ammar257ammar

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: Ammar
    given-names: Ammar
title: "PSnpBind, a workflow to construct binding site mutated protein-ligand database"
version: v1.0.0
date-released: 2021-07-18
url: "https://github.com/ammar257ammar/PSnpBind-Build"
preferred-citation:
  type: article
  authors:
    - family-names: Ammar
      given-names: Ammar
    - family-names: Cavill
      given-names: Rachel
    - family-names: Evelo
      given-names: Chris T.
    - family-names: Willighagen
      given-names: Egon L.
  title: "PSnpBind: a database of mutated binding site protein–ligand complexes constructed using a multithreaded virtual screening workflow"
  year: 2022
  month: 2
  day: 28
  journal: Journal of Cheminformatics
  doi: 10.1186/s13321-021-00573-5
  url: https://doi.org/10.1186/s13321-021-00573-5

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

psnpbind-build

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

PSnpBind, a workflow to construct binding site mutated protein-ligand database

First, build the Docker images for the required tools

Second, download the required datasets to the corresponding locations

Third, start applying the PSBAP steps as following:

!/bin/bash

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year