Complete Datasets#
| Dataset name | Description | Publication to cite | Associated repositories | Total size | Images size | Numerical data size | Cell Painting protocol | Other aliases | 
|---|---|---|---|---|---|---|---|---|
| cpg0000-jump-pilot | 300+ compounds and 160+ genes (CRISPR knockout and overexpression) profiled in A549 and U2OS cells, at two timepoints | (Chandrasekaran et al., 2024) Publication, Preprint, Description of Cell Painting v2.5. | 12.3 TB | 6.1 TB | 6.1 TB | v2.5 | ||
| cpg0001-cellpainting-protocol | 300+ compounds profiled in U2OS cells using several different modifications of the Cell Painting protocol | (Cimini et al., 2022) Publication, Preprint Description of Cell Painting v3. | 40.3 TB | 18.7 TB | 21.6 TB | v3 and experiments | ||
| cpg0002-jump-scope | 90 compounds (JUMP-MOA plate) profiled in U2OS using different microscopes and settings | (Tromans-Coia and Jamali et al., 2023) Publication, Preprint | 16.7 TB | 12.5 TB | 4.2 TB | v2.5 | ||
| cpg0003-rosetta | 28,000+ genes and compounds profiled in Cell Painting and L1000 gene expression | (Haghighi et al., 2022) Publication, Preprint, | 8.5 GB | 0 | 8.5 GB | |||
| cpg0004-lincs | 1,571 compounds across 6 doses in A549 cells | (Way et al., 2022) Publication, Preprint | 65.7 TB | 61.9 TB | 3.8 TB | v2 | idr0125 | |
| cpg0005-gerry-bioactivity | 30 related synthetic compounds across 6 doses in U2OS cells | (Gerry et al., 2016) Publication | 356 GB | 356 GB | v1 | |||
| cpg0010-caie-drugresponse | MCF-7 breast cancer cells treated with 113 small molecules at eight concentrations. | (Caie et al., 2010) Publication | 239.2 GB | 98.4 GB | 140.8 GB | other variation | ||
| cpg0011-lipocyteprofiler | Variety of lipocytes in different metabolic states and with genetic and drug perturbations | (Laber and Strobel et al., 2023) Publication, Preprint Description of Cell Painting lipocyte variant. | 1.2 TB | 1.2 TB | 16 MB | lipocyte | ||
| cpg0012-wawer-bioactivecompoundprofiling | 30,000 compound dataset in U2OS cells. Original images re-profiled in 2023 (original profiles available in workspace/gigascience_profiles) | (Wawer et al., 2014) Publication Description of Cell Painting v1, (Bray et al., 2017) Publication Description of Cell Painting v2 | 10.7 TB | 3.1 TB | 7.6 TB | v1 | ||
| cpg0015-heterogeneity | 2,200+ compounds and 200+ genes profiles in U2OS cells | (Rohban et al., 2019) Publication | 204 GB | 0 | 204 GB | |||
| cpg0016-jump | 116,000+ compounds and 15,000+ genes (CRISPR knockout and overexpression) profiled in U2OS cells. Over 8 million images (>126 TB), over 1.5 billion cells of numerical data (>126TB), for over 250 TB data in total. | (Chandrasekaran et al., 2023) Preprint | 358.4 TB | v3 | ||||
| cpg0017-rohban-pathways | 323 genes overexpressed in U2OS cells. Original images re-profiled in 2023 (original profiles not in gallery) | (Rohban et al, 2017) Publication, Preprint | 321 GB | 189 GB | 132 GB | v1 | ||
| cpg0018-singh-seedseq | U2OS cells treated with each of 315 unique shRNA sequences | (Singh et al. 2013) Publication | 247.1 GB | 247.1 GB | 0 | |||
| cpg0019-moshkov-deepprofiler | 8.3 million single cells from 232 plates, across 488 treatments from 5 public datasets, used for learning representations | (Moshkov et al., 2024) Publication, Preprint | 522 GB | 482 GB | 40 GB | dataset dependent | ||
| cpg0021-periscope | 30 million cells with 20,000 single-gene knockouts in pooled format. A549 cells and HeLa cells in two growth media | (Ramezani, Weisbart, Bauman, and Singh et al., 2025) Preprint, Publication, Description of Cell Painting pooled variant. Also has data from (Haghighi et al., 2023) Preprint, Paper. | 56.0 TB | 45.0 TB | 11.0 TB | pooled | ||
| cpg0022-cmqtl | 297 iPSC lines | (Tegtmeyer et al., 2024) Publication, Preprint | 3.7 TB | 2.8 TB | 945 GB | v2.5 | ||
| cpg0026-lacoste_haghighi-rare-diseases | Protein localization of 3,448 missense variants in 1,269 genes in HeLa cells | (Lacoste and Haghighi et al., 2024) Publication, Preprint | 11 TB | 9.4 TB | 1.6 TB | Protein of interest, Hoechst, ConA, Mitotracker | ||
| cpg0028-kelley-resistance | Bortezomib resistant HCT116 clones | (Kelley et al., 2023) Publication | 4.1 TB | 1.9 TB | 2.2 TB | |||
| cpg0029-chroma-pilot | Comparison of alternative Cell Painting dyes | (Sivagurunathan et al., 2025) Publication | 1.5 TB | 495 GB | 1 TB | v3 and experiments | ||
| cpg0030-gustafsdottir-cellpainting | U2OS cells treated with each of 1600 known bioactive compounds. Description of Cell Painting v1. | (Gustafsdottir et al., 2013) Publication | 234 GB | 234 GB | .3 GB | v1 | ||
| cpg0031-caicedo-cmvip | ORF over-expression of 596 alleles of 53 genes in A549 cells. Original images re-profiled in 2023 (original profiles available in workspace/profiles_orig) | (Caicedo et al., 2023) Publication, Preprint | 2.2 TB | 605 GB | 1.6 TB | v1 | BBBC043, LUAD | |
| cpg0034-arevalo-su-motive | A graph dataset comprising Cell Painting features for 11,000 genes and 3,600 compounds, along with their relationships extracted from seven publicly available databases | (Arevalo and Su et al., 2024) Publication, Preprint | 4.5 GB | 0 GB | 4.5 GB | v3 | ||
| cpg0036-EU-OS-bioactives | 2464 compounds from EU-OPENSCREEN Bioactive compound set, four imaging sites, two cell lines (HepG2 & U2OS) | (Wolff et al., 2025) Publication Preprint | 3.5 TB | 3.5 TB | v1 | Bioactives, EU-OS-Bioactives | ||
| cpg0038-tegtmeyer-neuropainting | Multiple brain cell types | (Tegtmeyer et al., 2024) Preprint, Description of Neuropainting variant | 2.0 TB | 1.8 TB | 240 GB | neuropainting |