RegenotypeCNVs
Re-genotypes probable mosaic variants across multiple batches. This is a "cohort-level" workflow that operates on all batches.
The following diagram illustrates the recommended invocation order:
Inputs
All array inputs of batch data must match in order. For example, the order of the batches
array should match that of
depth_vcfs
, batch_depth_vcfs
, etc.
depth_vcfs
Array of genotyped depth caller variants for all batches, generated in GenotypeBatch.
cohort_depth_vcf
Merged depth caller variants for the cohort, generated in MergeBatchSites.
batch_depth_vcfs
Array of filtered depth caller variants for all batches, generated in FilterBatch. Order must match that of depth_vcfs.
coveragefiles
Array of merged RD evidence files for all batches from GatherBatchEvidence. Order must match that of depth_vcfs.
medianfiles
Array of median coverage tables for all batches from GatherBatchEvidence. Order must match that of depth_vcfs.
RD_depth_sepcutoffs
Array of "depth_depth" genotype cutoff files (depth evidence for depth-based calls) generated in GenotypeBatch. Order must match that of depth_vcfs.
n_per_split
Records per shard when scattering variants. Decrease to increase parallelism if the workflow is running slowly.
n_RD_genotype_bins
Number of depth genotyping bins. Most users should leave this at the default value.
batches
Array of batch identifiers. Should match the name used in GatherBatchEvidence. Order must match that of depth_vcfs.
cohort
Cohort name. May be alphanumeric with underscores.
regeno_coverage_medians
Array of regenotyping metrics generated in GenotypeBatch.
Outputs
regenotyped_depth_vcfs
Array of batch depth VCFs after regenotyping.