evans-yamamoto_et_al_2021
Jupyter notebooks for BFG-PCA related simulation and analysis.
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 1 DOI reference(s) in README -
✓Academic publication links
Links to: ncbi.nlm.nih.gov -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.2%) to scientific vocabulary
Keywords
Repository
Jupyter notebooks for BFG-PCA related simulation and analysis.
Basic Info
Statistics
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Barcode Fusion Genetics related codes
This page explains how to install and execute the BFG-PCA codes. If you have any questions, please don't hesitate to post a question in the Discussions.
Please make sure you have appropriate Python and pip before starting.
sh
Python version >=3.5
pip version >= 1.1.0
Dependencies :
sh
numpy version >=1.19
tqdm version >=2.2.4
To install these pakcages, first clone this repository by
sh
git clone https://github.com/DanYamamotoEvans/BFG-PCA.git
Next, go to the location of the BFG-PCA folder in the terminal, and install the dependencies by
sh
pip install .
Other core programs to install:
- Jupyter-notebook
sh
pip install jupyterlab
- Commandline BLAST+
Follow the instruction manual for installation. Set the PATH of the binary file.
Execute the following to see if installation is complete.
sh
blastn -help
Overview
This script was built to perform experimental plans and data analysis for Barcode Fusion Genetics screenings. There are four main steps in this suit, which I have prepared jupyter-notebooks for each.
- Monte-Carlo simulation
- Barcode calling
- Normalization
- Performance measure
- (Visualization, You will need to install R.)
Monte-carlo simulation of BFG screening proccess
Since BFG screenings have multiple sampling steps while handling a complex pool of strains, we suimulate the sampling process with a Monte-Carlo simulation. This notebook follows the procedures of BFG screenings, and allows the user to estimate the nessesary paramaters for sampling.
Barcode calling
This notebook creates the BLAST databse based on your barcode database file, and performs BLAST on the fastq files you provide. The results will be parsed to combine the count data. Please look at the wiki for input file specifications.
Normalization
This notebook normalizes the count data, and compute raw PPI signals based on the barcode counts in the control condition and auto-activity level. It will also output some csv files for plotting stats.
Performance measure
Based on the normalized scores, this notebook computes the agreement against the BioGRID database. It willcomute the agreement for various percentile values of the PPI scores generated from multiple replicates.
Visualization
This notebook will help you plot the data you obtained from the BFG screening. Please go to the visualization page in the wiki for more details on the plots.
References
- Yachie et al, 2016 / Initial report of BFG. The codes here were built based on perl scripts provided from Dr. Nozomu Yachie.
- Evans-Yamamamto et al, 2022, Nucleic Acids Research / This repositry was created in part of this work to make BFG-PCA accessible.
Citation
Pleae cite Evans-Yamamamto et al, 2022, Nucleic Acids Research if you use this repository.
Owner
- Name: Landrylab
- Login: Landrylab
- Kind: organization
- Email: landrylaboratory@gmail.com
- Website: http://landrylab.ibis.ulaval.ca/
- Repositories: 13
- Profile: https://github.com/Landrylab
Evolutionary Cell and Systems Biology
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please our paper as below." authors: - family-names: "D. Evans-Yamamoto et al" given-names: " " orcid: "https://orcid.org/0000-0001-6467-3827" title: "BFG-PCA: tools and resources that expand the potential for binary protein interaction discovery" version: 1.0.0 doi: 10.1101/2021.07.27.453987 date-released: 2021-07-27 url: "https://github.com/DanYamamotoEvans/BFG-PCA"
GitHub Events
Total
Last Year
Dependencies
- numpy >=1.19
- tqdm >=2.2.4