Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Pipeline Overview

Pipeline Architecture

The pipeline processes data through two parallel arms (the Cell Painting arm and the ISS arm) that operate independently on separate stacks of images before converging for final analysis. A mermaid diagram follows:

Cell Painting Arm (Phenotype)

Barcoding Arm (Genotype)

Combined Analysis

Once both arms pass quality control, the final Analysis pipeline aligns Cell Painting and barcoding images, segments cells from the phenotypic stains, measures morphological features, reads an SBS barcodes for each SBS focus and selects a best match from the barcode library, and assigns barcode foci to cells — linking each cell’s genotype to its phenotype.

The “Stop-and-Check” Workflow

Making morphological measurements in high-content imaging data is computationally expensive. To avoid wasting resources on poor-quality data, the pipeline implements a “Stop-and-Check” workflow controlled by two parameters:

Phase 1 - Initial Processing: The pipeline runs through the QC checkpoints for both arms (SegCheck QC for painting, Preprocess QC for barcoding), generates QC montages, and stops before Stitch & Crop.

Phase 2 - Manual Review: You review QC outputs in results/workspace/qc_reports/.

Phase 3 - Production Run: If data looks good, set --qc_painting_passed true and --qc_barcoding_passed true, then resume with -resume. The pipeline continues from cached results.

Data Hierarchy

Understanding how the pipeline organizes data:

LevelDescriptionExample
BatchCollection of plates processed together. Assumed to share some amount of technical artifacts.Batch1
PlatePhysical multi-well plate. Typically a smaller number of larger wells than for an arrayed experiment.Plate1
WellSingle experimental unit. All cells within a well share a common pool of genetic perturbations and may share a common secondary perturbation (such as an additional drug treatment).A01
SiteField of view within a well1, 2, 3...
CycleSequencing round for barcoding. Also supports multi-round phenotype acquisition1, 2, 3...