Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 77 DOI reference(s) in README -
✓Academic publication links
Links to: arxiv.org, biorxiv.org, nature.com -
✓Committers with academic emails
3 of 8 committers (37.5%) from academic institutions -
✓Institutional organization owner
Organization broadinstitute has institutional domain (www.broadinstitute.org) -
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.0%) to scientific vocabulary
Keywords from Contributors
Repository
Cell Painting Gallery
Basic Info
- Host: GitHub
- Owner: broadinstitute
- License: other
- Default Branch: main
- Homepage: https://broadinstitute.github.io/cellpainting-gallery/
- Size: 5.31 MB
Statistics
- Stars: 104
- Watchers: 6
- Forks: 14
- Open Issues: 14
- Releases: 0
Metadata Files
README.md
Cell Painting Gallery
This page provides a guide to the datasets that are available in the Cell Painting Gallery, hosted by the AWS Registry of Open Data (RODA): https://registry.opendata.aws/cellpainting-gallery
Citation/license
All the data is released with CC0 1.0 Universal (CC0 1.0). Still, professional ethics require that you cite the appropriate resources/publications, listed below, when using individual datasets, along with our Nature Methods publication announcing the Cell Painting Gallery (Weisbart et al., 2024). For example,
We used the dataset
cpg0000(Chandrasekaran et al., 2022), available from the Cell Painting Gallery (Weisbart et al., 2024) on the Registry of Open Data on AWS (https://registry.opendata.aws/cellpainting-gallery/).
Please also acknowledge the Registry of Open Data (RODA) on AWS for their support in hosting the data, e.g., "We thank the AWS Open Data Sponsorship Program for sponsoring data storage."
Documentation
Please see our documentation for extensive supporting information.
It includes:
- how to browse gallery data
- how to download gallery data (with AWS CLI, Quilt, or dataset-specific tools)
- how to contribute to the gallery
Available datasets
All datasets are generated using the Cell Painting assay unless indicated otherwise. Several updates to that protocol exist (Cell Painting wiki).
The datasets are stored with the prefix indicated by the dataset name.
e.g. the first dataset is located at s3://cellpainting-gallery/cpg0000-jump-pilot and can be listed using AWS CLI aws s3 ls --no-sign-request s3://cellpainting-gallery/cpg0000-jump-pilot/ (note the / at the end).
See browsing data in our documentation for more information on viewing the gallery in a browser and examples of how to list files using AWS CLI or boto3.
The datasets' accession numbers are the first seven characters of the dataset name.
e.g. the accession number of the first dataset is cpg0000.
| Dataset name | Description | Publication to cite | Associated repositories | Total size | Images size | Numerical data size | Cell Painting protocol | Other aliases | |------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------:|:-----------:|:-------------------:|:----------------------:|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:| | cpg0000-jump-pilot | 300+ compounds and 160+ genes (CRISPR knockout and overexpression) profiled in A549 and U2OS cells, at two timepoints | (Chandrasekaran et al., 2024) Publication, Preprint, Description of Cell Painting v2.5. | data, JUMP-Cell Painting Hub | 12.3 TB | 6.1 TB | 6.1 TB | v2.5 | | | cpg0001-cellpainting-protocol | 300+ compounds profiled in U2OS cells using several different modifications of the Cell Painting protocol | (Cimini et al., 2022) Publication, Preprint Description of Cell Painting v3. | data, JUMP-Cell Painting Hub | 40.3 TB | 18.7 TB | 21.6 TB | v3 and experiments | | | cpg0002-jump-scope | 90 compounds (JUMP-MOA plate) profiled in U2OS using different microscopes and settings | (Tromans-Coia and Jamali et al., 2023) Publication, Preprint | data, analysis, JUMP-Cell Painting Hub | 16.7 TB | 12.5 TB | 4.2 TB | v2.5 | | | cpg0003-rosetta | 28,000+ genes and compounds profiled in Cell Painting and L1000 gene expression | (Haghighi et al., 2022) Publication, Preprint, | data | 8.5 GB | 0 | 8.5 GB | | | | cpg0004-lincs | 1,571 compounds across 6 doses in A549 cells | (Way et al., 2022) Publication, Preprint | data | 65.7 TB | 61.9 TB | 3.8 TB | v2 | idr0125 | | cpg0005-gerry-bioactivity | 30 related synthetic compounds across 6 doses in U2OS cells | (Gerry et al., 2016) Publication | | 356 GB | 356 GB | | v1 | | | cpg0010-caie-drugresponse | MCF-7 breast cancer cells treated with 113 small molecules at eight concentrations. | (Caie et al., 2010) Publication | | 239.2 GB | 98.4 GB | 140.8 GB | other variation | BBBC021 | | cpg0011-lipocyteprofiler | Variety of lipocytes in different metabolic states and with genetic and drug perturbations | (Laber and Strobel et al., 2023) Publication, Preprint Description of Cell Painting lipocyte variant. | Batch1 and Batch3 data, analysis | 1.2 TB | 1.2 TB | 16 MB | lipocyte | | | cpg0012-wawer-bioactivecompoundprofiling | 30,000 compound dataset in U2OS cells. Original images re-profiled in 2023 (original profiles available in workspace/gigascienceprofiles) | (Wawer et al., 2014) Publication Description of Cell Painting v1, (Bray et al., 2017) Publication Description of Cell Painting v2 | data | 10.7 TB | 3.1 TB | 7.6 TB | v1 | idr0016, CDRP, BBBC036, BBBC047 | | cpg0015-heterogeneity | 2,200+ compounds and 200+ genes profiles in U2OS cells | (Rohban et al., 2019) Publication | data | 204 GB | 0 | 204 GB | | idr0016, idr0036 | | cpg0016-jump | 116,000+ compounds and 16+ genes (CRISPR knockout and overexpression) profiled in U2OS cells. Over 8 million images (>126 TB), over 1.5 billion cells of numerical data (>126TB), for over 250 TB data in total. | (Chandrasekaran et al., 2023) Preprint | datasets resource, JUMP-Cell Painting Hub | 358.4 TB | | | v3 | | | cpg0017-rohban-pathways | 323 genes overexpressed in U2OS cells. Original images re-profiled in 2023 (original profiles not in gallery) | (Rohban et al, 2017) Publication, Preprint | re-profiled data, original data | 321 GB | 189 GB | 132 GB | v1 | BBBC037, TA-ORF, idr0033 | | cpg0018-singh-seedseq | U2OS cells treated with each of 315 unique shRNA sequences | (Singh et al. 2013) Publication | | 247.1 GB | 247.1 GB | 0 | | BBBC025 | | cpg0019-moshkov-deepprofiler | 8.3 million single cells from 232 plates, across 488 treatments from 5 public datasets, used for learning representations | (Moshkov et al., 2024) Publication, Preprint | data, software | 522 GB | 482 GB | 40 GB | dataset dependent | | | cpg0021-periscope | 30 million cells with 20,000 single-gene knockouts in pooled format. A549 cells and HeLa cells in two growth media | (Ramezani, Weisbart, Bauman, and Singh et al., 2025) Preprint, Publication, Description of Cell Painting pooled variant. Also has data from (Haghighi et al., 2023) Preprint, Paper. | analysis, A549 data, HeLa data, PLePI | 56.0 TB | 45.0 TB | 11.0 TB | pooled | | | cpg0022-cmqtl | 297 iPSC lines | (Tegtmeyer et al., 2024) Publication, Preprint | data | 3.7 TB | 2.8 TB | 945 GB | v2.5 | | | cpg0026-lacostehaghighi-rare-diseases | Protein localization of 3,448 missense variants in 1,269 genes in HeLa cells | (Lacoste and Haghighi et al., 2024) Publication, Preprint | analysis | 11 TB| 9.4 TB | 1.6 TB | Protein of interest, Hoechst, ConA, Mitotracker | | | cpg0028-kelley-resistance | Bortezomib resistant HCT116 clones | (Kelley et al., 2023) Publication | data | 4.1 TB | 1.9 TB | 2.2 TB | | | | cpg0030-gustafsdottir-cellpainting | U2OS cells treated with each of 1600 known bioactive compounds. Description of Cell Painting v1. | (Gustafsdottir et al., 2013) Publication | | 234 GB | 234 GB | .3 GB | v1 | BBBC022, idr0036 | | cpg0031-caicedo-cmvip | ORF over-expression of 596 alleles of 53 genes in A549 cells. Original images re-profiled in 2023 (original profiles available in workspace/profiles_orig) | (Caicedo et al., 2023) Publication, Preprint | original data, re-profiled data | 2.2 TB | 605 GB | 1.6 TB | v1 | BBBC043, LUAD | | cpg0034-arevalo-su-motive | A graph dataset comprising Cell Painting features for 11,000 genes and 3,600 compounds, along with their relationships extracted from seven publicly available databases | (Arevalo and Su et al., 2024) Publication, Preprint | analysis | 4.5 GB | 0 GB | 4.5 GB | v3 | | | cpg0036-EU-OS-bioactives | 2464 compounds from EU-OPENSCREEN Bioactive compound set, four imaging sites, two cell lines (HepG2 & U2OS) | (Wolff et al., 2025) Publication Preprint | aggregated profiles, analysis scripts, compound information | 3.5 TB | 3.5 TB | | v1 | Bioactives, EU-OS-Bioactives |
Owner
- Name: Broad Institute
- Login: broadinstitute
- Kind: organization
- Location: Cambridge, MA
- Website: http://www.broadinstitute.org/
- Twitter: broadinstitute
- Repositories: 1,083
- Profile: https://github.com/broadinstitute
Broad Institute of MIT and Harvard
GitHub Events
Total
- Create event: 16
- Issues event: 22
- Watch event: 28
- Delete event: 13
- Issue comment event: 41
- Push event: 49
- Pull request review comment event: 6
- Gollum event: 1
- Pull request review event: 22
- Pull request event: 44
- Fork event: 2
Last Year
- Create event: 16
- Issues event: 22
- Watch event: 28
- Delete event: 13
- Issue comment event: 41
- Push event: 49
- Pull request review comment event: 6
- Gollum event: 1
- Pull request review event: 22
- Pull request event: 44
- Fork event: 2
Committers
Last synced: 9 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| Erin Weisbart | 5****t | 103 |
| Shantanu Singh | s****h@b****g | 68 |
| Anne Carpenter | a****e@b****g | 4 |
| Christopher Schmied | 1****c | 3 |
| Beth Cimini | b****7 | 3 |
| emiglietta | e****a@g****m | 2 |
| Alán F. Muñoz | a****o@b****g | 2 |
| Suganya Sivagurunathan | s****n@g****m | 2 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 25
- Total pull requests: 49
- Average time to close issues: 6 months
- Average time to close pull requests: 6 days
- Total issue authors: 10
- Total pull request authors: 7
- Average comments per issue: 4.8
- Average comments per pull request: 0.24
- Merged pull requests: 39
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 15
- Pull requests: 38
- Average time to close issues: 8 days
- Average time to close pull requests: 7 days
- Issue authors: 7
- Pull request authors: 7
- Average comments per issue: 0.33
- Average comments per pull request: 0.13
- Merged pull requests: 30
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- ErinWeisbart (10)
- shntnu (4)
- bethac07 (3)
- jasperhyp (2)
- mengchengyao (1)
- danrlu (1)
- fleurvereijken1999 (1)
- gwaybio (1)
- AnneCarpenter (1)
- The-Real-JerryChen (1)
Pull Request Authors
- ErinWeisbart (23)
- shntnu (10)
- emiglietta (5)
- sugan89 (4)
- schmiedc (3)
- bethac07 (2)
- afermg (2)