https://github.com/broadinstitute/lincs-profiling-complementarity

Analyzing and comparing signal found in different profiling technologies

https://github.com/broadinstitute/lincs-profiling-complementarity

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 2 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Analyzing and comparing signal found in different profiling technologies

Basic Info
  • Host: GitHub
  • Owner: broadinstitute
  • License: bsd-3-clause
  • Language: Jupyter Notebook
  • Default Branch: master
  • Homepage:
  • Size: 1.65 GB
Statistics
  • Stars: 7
  • Watchers: 3
  • Forks: 5
  • Open Issues: 15
  • Releases: 0
Created almost 6 years ago · Last pushed almost 4 years ago
Metadata Files
Readme License

README.md

Morphology and gene expression profiling provide complementary information for mapping cell state

DOI

Assessing information contained in different profiling data modalities.

Assays

We evaluated two assays:

  • L1000 (gene expression)
  • Cell Painting (morphology)

Perturbations

We tested each assay in their ability to measure cell states from 1,327 compound perturbations across 6 doses (0.04M, 0.12M, 0.37M, 1.11M, 3.33M, and 10M) from the Drug Repurposing Hub.

Evaluations

We evaluated the two assays in several tasks:

  • Reproducibility
  • Sample diversity
  • Feature diversity
  • MOA prediction (unsupervised)
  • Gene target prediction (unsupervised)
  • MOA prediction (supervised)
  • Gene Ontology pathway prediction (supervised)

Data collection and data processing procedures

our workflow

Data collection and data processing workflows, related to Figure 1. (a) We cultured A549 lung cancer cells and exposed them to 1,327 different compound perturbations in about six doses per compound. We plated these cells in 384 well plates, and, using the same plate layout, measured gene expression (using the L1000 assay) and morphology (using the Cell Painting assay) in compound-perturbed A549 cells. (b) Our image-based profiling pipeline we used to process the Cell Painting images. We used pycytominer to process the single cell profiles. All processing code and profile data are available at https://github.com/broadinstitute/lincs-cell-painting. Image data available at Image Data Resource (accession: idr0125).

Findings summary

We find that each assay provides complementary information for mapping cell state. Cell Painting has generally higher reproducibility, but suffers from more technical artifacts. L1000 has a more diverse feature space, but contains less diverse samples. In general Cell Painting captures more MOAs by an unsupervised analysis, but L1000 performs better in deep learning predictions.

Importantly, each assay captures complementary cell states. By combining each data type, one can capture more mechanisms than either alone.

See our paper (linked below) for more details and extensive resources.

Reproduce computational environment

We use a combination of conda and pip to manage the proper python packages for data assessment and model predictions. To reproduce our environment run the following:

```bash

In the top folder of the directory

conda env create --force --file environment.yml && conda activate lincs-complimentarity && cd 2.MOA-prediction/ && python setup.py && cd .. ```

We also need to setup custom computational environments for tensorflow and pytorch for the MOA prediction analysis.

```bash

Navigate into the MOA prediction folder

cd 2.MOA-prediction

Step 1 - Tensorflow

Initialize a virtual environment

python3 -m venv tensorflow_env

Activate the environment

source tensorflow_env/bin/activate

Upgrade pip if necessary

python3 -m pip install --upgrade pip

Install tensorflow requirements

python3 -m pip install -r tensorflow_requirements.txt

Step 2 - Pytorch

python3 -m venv pytorchenv source pytorchenv/bin/activate python3 -m pip install -r pytorch_requirements.txt && python3 setup.py ```

Citation

For a complete discussion of our findings please view our preprint:

Morphology and gene expression profiling provide complementary information for mapping cell state. Gregory P. Way, Ted Natoli, Adeniyi Adeboye, Lev Litichevskiy, Andrew Yang, Xiaodong Lu, Juan C. Caicedo, Beth A. Cimini, Kyle Karhohs, David J. Logan, Mohammad Rohban, Maria Kost-Alimova, Kate Hartland, Michael Bornholdt, Niranj Chandrasekaran, Marzieh Haghighi, Shantanu Singh, Aravind Subramanian, Anne E. Carpenter. bioRxiv 2021.10.21.465335; doi: https://doi.org/10.1101/2021.10.21.465335

Owner

  • Name: Broad Institute
  • Login: broadinstitute
  • Kind: organization
  • Location: Cambridge, MA

Broad Institute of MIT and Harvard

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 29
  • Total pull requests: 118
  • Average time to close issues: about 1 month
  • Average time to close pull requests: about 22 hours
  • Total issue authors: 5
  • Total pull request authors: 3
  • Average comments per issue: 2.55
  • Average comments per pull request: 0.23
  • Merged pull requests: 115
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • gwaybio (10)
  • AdeboyeML (3)
  • shntnu (2)
  • yhan8 (1)
  • shihanyu (1)
Pull Request Authors
  • gwaybio (57)
  • AdeboyeML (4)
  • shntnu (1)
Top Labels
Issue Labels
Pull Request Labels