https://github.com/broadinstitute/lincs-profiling-complementarity
Analyzing and comparing signal found in different profiling technologies
https://github.com/broadinstitute/lincs-profiling-complementarity
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
✓DOI references
Found 2 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.8%) to scientific vocabulary
Repository
Analyzing and comparing signal found in different profiling technologies
Basic Info
Statistics
- Stars: 7
- Watchers: 3
- Forks: 5
- Open Issues: 15
- Releases: 0
Metadata Files
README.md
Morphology and gene expression profiling provide complementary information for mapping cell state
Assessing information contained in different profiling data modalities.
Assays
We evaluated two assays:
- L1000 (gene expression)
- Cell Painting (morphology)
Perturbations
We tested each assay in their ability to measure cell states from 1,327 compound perturbations across 6 doses (0.04M, 0.12M, 0.37M, 1.11M, 3.33M, and 10M) from the Drug Repurposing Hub.
Evaluations
We evaluated the two assays in several tasks:
- Reproducibility
- Sample diversity
- Feature diversity
- MOA prediction (unsupervised)
- Gene target prediction (unsupervised)
- MOA prediction (supervised)
- Gene Ontology pathway prediction (supervised)
Data collection and data processing procedures

Data collection and data processing workflows, related to Figure 1. (a) We cultured A549 lung cancer cells and exposed them to 1,327 different compound perturbations in about six doses per compound. We plated these cells in 384 well plates, and, using the same plate layout, measured gene expression (using the L1000 assay) and morphology (using the Cell Painting assay) in compound-perturbed A549 cells. (b) Our image-based profiling pipeline we used to process the Cell Painting images. We used pycytominer to process the single cell profiles. All processing code and profile data are available at https://github.com/broadinstitute/lincs-cell-painting. Image data available at Image Data Resource (accession: idr0125).
Findings summary
We find that each assay provides complementary information for mapping cell state. Cell Painting has generally higher reproducibility, but suffers from more technical artifacts. L1000 has a more diverse feature space, but contains less diverse samples. In general Cell Painting captures more MOAs by an unsupervised analysis, but L1000 performs better in deep learning predictions.
Importantly, each assay captures complementary cell states. By combining each data type, one can capture more mechanisms than either alone.
See our paper (linked below) for more details and extensive resources.
Reproduce computational environment
We use a combination of conda and pip to manage the proper python packages for data assessment and model predictions. To reproduce our environment run the following:
```bash
In the top folder of the directory
conda env create --force --file environment.yml && conda activate lincs-complimentarity && cd 2.MOA-prediction/ && python setup.py && cd .. ```
We also need to setup custom computational environments for tensorflow and pytorch for the MOA prediction analysis.
```bash
Navigate into the MOA prediction folder
cd 2.MOA-prediction
Step 1 - Tensorflow
Initialize a virtual environment
python3 -m venv tensorflow_env
Activate the environment
source tensorflow_env/bin/activate
Upgrade pip if necessary
python3 -m pip install --upgrade pip
Install tensorflow requirements
python3 -m pip install -r tensorflow_requirements.txt
Step 2 - Pytorch
python3 -m venv pytorchenv source pytorchenv/bin/activate python3 -m pip install -r pytorch_requirements.txt && python3 setup.py ```
Citation
For a complete discussion of our findings please view our preprint:
Morphology and gene expression profiling provide complementary information for mapping cell state. Gregory P. Way, Ted Natoli, Adeniyi Adeboye, Lev Litichevskiy, Andrew Yang, Xiaodong Lu, Juan C. Caicedo, Beth A. Cimini, Kyle Karhohs, David J. Logan, Mohammad Rohban, Maria Kost-Alimova, Kate Hartland, Michael Bornholdt, Niranj Chandrasekaran, Marzieh Haghighi, Shantanu Singh, Aravind Subramanian, Anne E. Carpenter. bioRxiv 2021.10.21.465335; doi: https://doi.org/10.1101/2021.10.21.465335
Owner
- Name: Broad Institute
- Login: broadinstitute
- Kind: organization
- Location: Cambridge, MA
- Website: http://www.broadinstitute.org/
- Twitter: broadinstitute
- Repositories: 1,083
- Profile: https://github.com/broadinstitute
Broad Institute of MIT and Harvard
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 29
- Total pull requests: 118
- Average time to close issues: about 1 month
- Average time to close pull requests: about 22 hours
- Total issue authors: 5
- Total pull request authors: 3
- Average comments per issue: 2.55
- Average comments per pull request: 0.23
- Merged pull requests: 115
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- gwaybio (10)
- AdeboyeML (3)
- shntnu (2)
- yhan8 (1)
- shihanyu (1)
Pull Request Authors
- gwaybio (57)
- AdeboyeML (4)
- shntnu (1)