Fingerprinting
ListGenotypedVCFs
- description
- List all VCFs that have been genotyped
Inputs
Required
fingerprint_store(String, required): A bucket that holds all fingerprinting VCFs
Outputs
vcf_gs_paths(File)
PickGenotypeVCF
- description
- Pick a subset of VCFs that have been genotyped
Inputs
Required
fingerprinting_vcf_gs_paths(File, required): A file holding GS paths to fingerprinting GT'ed VCFs
Optional
vcf_name(String?): an expression used for picking up VCFs, the filter will be applied to VCF names, a match will lead to the VCF to be included
Outputs
vcfs(Array[String])
FilterGenotypesVCF
- description
- Filter out unwanted genotypes from a VCF
Inputs
Required
fingerprint_vcf(File, required): A VCF file that has been genotyped
Defaults
filters(Array[String], default=['_random\t', '_decoy\t', '_alt\t', '^chrUn', '^HLA', '^EBV']): An array of chromosome names to filter out when verifying fingerprints
Outputs
ready_to_use_vcf(File)
ExtractGenotypingSites
- description
- Extract genotyping sites from a VCF
Inputs
Required
fingerprint_vcf(File, required): A VCF file that has been genotyped
Outputs
sites(File)
MergeGenotypingSites
- description
- Merge genotyping sites from multiple VCFs
Inputs
Required
all_sites(Array[File], required): An array of genotyping sites
Outputs
merged_sites(File)
ExtractRelevantGenotypingReads
- description
- Based on genotyping (SNP) sites, extract reads that overlap those places
Inputs
Required
aligned_bai(File, required)aligned_bam(File, required); localization_optional: truegenotyping_sites_bed(File, required)
Optional
runtime_attr_override(RuntimeAttr?)
Outputs
relevant_reads(File)relevant_reads_bai(File)
ResetCLRBaseQual
- description
- Reset base quality scores to a fixed value
Inputs
Required
arbitrary_bq(Int, required): A fixed base quality scorebai(File, required): A BAI filebam(File, required): A BAM file
Optional
runtime_attr_override(RuntimeAttr?)
Outputs
barbequed_bam(File)barbequed_bai(File)
CheckFingerprint
- description
- Uses Picard tool CheckFingerprint to verify if the samples in provided VCF and BAM arise from the same biological sample
Inputs
Required
aligned_bai(File, required)aligned_bam(File, required); description: GCS path to aligned BAM file, supposed to be of the same sample as from the fingerprinting VCF; localization_optional: truefingerprint_vcf(File, required): Fingerprint VCF file from local database; note that sample name must be the same as in BAMhaplotype_map(File, required): Happlotype map file for the reference build used. See https://bit.ly/3QyZbwtvcf_sample_name(String, required): Sample name in VCF, possibly different from that in the BAM.
Optional
runtime_attr_override(RuntimeAttr?)
Outputs
summary_metrics(File)detail_metrics(File)metrics_map(Map[String,String])
CheckCLRFingerprint
- description
- Uses Picard tool CheckFingerprint to verify if the samples in provided VCF and the CLR BAM arise from the same biological sample.
Inputs
Required
aligned_bai(File, required)aligned_bam(File, required)fingerprint_vcf(File, required)haplotype_map(File, required): Haplotype map file for the reference build used. See https://bit.ly/3QyZbwtvcf_sample_name(String, required): Sample name in VCF, possibly different from that in the BAM.
Optional
runtime_attr_override(RuntimeAttr?)
Defaults
min_base_q(Int, default=0)
Outputs
summary_metrics(File)detail_metrics(File)metrics_map(Map[String,String])
ReheaderFullGRCh38VCFtoNoAlt
- desciption
- Reheader the fingperint VCF that's generated with full GRCh38 reference to the no_alt header; project specific.
Inputs
Required
full_GRCh38_vcf(File, required): Full GRCh38 VCF file.
Outputs
reheadered_vcf(File)