Skip to main content

GVS AoU Reblock gVCF

Pipeline VersionDate UpdatedDocumentation AuthorQuestions or Feedback
aou_9.0.0August, 2025WARP PipelinesFile an issue

Introduction to the GVS AoU Reblock gVCF workflow

GvsAoUReblockGvcf reblocks existing per-sample gVCFs using GATK ReblockGVCF and optionally copies results to site-specific AoU research buckets.

This workflow is commonly used as a preparation step before external joint calling systems (e.g., GVS-based aggregation workflows).

Quickstart table

Pipeline FeatureDescriptionSource
Analysis typegVCF reblocking and optional site-bucket copy
Workflow languageWDL 1.0openWDL
Data input file formatgVCF path + reference bundle
Data output file formatReblocked gVCF + index
Primary softwareGATK ReblockGVCF + gsutilGATK

Set-up

GVS AoU Reblock gVCF installation and requirements

The workflow code can be downloaded by cloning the WARP GitHub repository. For the latest release, please see the GvsAoUReblockGvcf changelog.

The pipeline can be deployed using Cromwell, a GA4GH-compliant workflow management system.

Inputs

Input descriptions

Input variable nameDescriptionType
gvcfGCS path to input gVCF (.g.vcf.gz).String
gvcf_index(Optional) Index path for input gVCF. Defaults to gvcf + ".tbi".String?
ref_dictReference sequence dictionary.File
ref_fastaReference FASTA.File
ref_fasta_indexFASTA index.File
requester_pays_project(Optional) GCS requester-pays project passed to GATK.String?
site_id(Optional) Destination site code (bi, bcm, uw).String?
docker_imageDocker image for GATK ReblockGVCF. Default: us.gcr.io/broad-gatk/gatk:4.2.6.1.String

GVS AoU Reblock gVCF tasks and tools

This workflow runs a single task.

  1. Reblock and optionally copy output
Task name and WDL linkToolSoftwareDescription
ReblockAndCopyGATK ReblockGVCFconfigurable (docker_image)Reblocks input gVCF, then optionally copies output and index to a site-specific bucket prefix.

1. Reblock and optionally copy output

ReblockAndCopy runs GATK with fixed GQ bands and optional requester-pays settings. If site_id is provided, output files are copied to the mapped destination bucket path.

Outputs

Output variable nameFilename, if applicableOutput format and description
reblocked_gvcf<basename>.reblocked.g.vcf.gzReblocked gVCF output (optionally delocalized to site-specific bucket path).
reblocked_gvcf_index<basename>.reblocked.g.vcf.gz.tbiTabix index for reblocked gVCF.

Versioning

All GvsAoUReblockGvcf releases are documented in the changelog.

Feedback

Please help us make our tools better by filing an issue in WARP; we welcome pipeline-related suggestions or questions.