Workflows using the Cell Painting Gallery#
Data in the Cell Painting Gallery can be used for many different purposes in many different ways. We provide here links to external workflows that access Cell Painting Gallery data to serve as inspiration for others in making their own workflows.
CellPose#
Workflow: Segment images in cpg0016-jump using CellPose in a SnakeMake pipeline.
CellPose is an open-source generalist algorithm for cellular segmentation. Snakemake is an open-source workflow engine that provides a readable Python-based workflow definition language and execution environment.
This workflow was used to create single-cell crops from the entire cpg0016-jump
dataset and can be customized to run on subsets filtered by Metadata such as Source, Batch, Plate, Well Site, and InChIKey.
Distributed-CellProfiler#
Distributed-CellProfiler is simple, open-source Python software for running encapsulated docker containers with CellProfiler in the Amazon Web Services (AWS) infrastructure. CellProfiler is an open-source image analysis software that excels in GUI or headless handling of high-content workflows. CellProfiler has dozens of native image and object handling modules as well as the ability to run custom modules (i.e. plugins). CellProfiler-plugins include modules that call out to other softwares such as open-source image analysis software ImageJ/FIJI, including open-source Deep Learning softwares such as CellPose and Stardist.
This workflow is a minimal example used to demonstrate data access from the CellPainting-Gallery. Distributed-CellProfiler workflows have been used to generate most of the illumination correction files and classic morphological profiles found in the CellPainting-Gallery.
Though we don’t have separate tutorials at this point in time, other softwares in the DistributedScience suite (such as Distributed-FIJI or Distributed-OMEZarrCreator) can be similarly used to access data in the Cell Painting Gallery.
cpg0021-periscope representative cell image retrieval#
The PERISCOPE dataset provides a tool for retrieving single cell image crops for each of the three whole-genome screens included in the dataset. The number of cells returned and bounding box size are configurable. Cells can either be representative or random for any given gene or CRISPR guide. The tool can be found in the paper repository.