gnomad_qc.v5.resources.annotations

Script containing annotation related resources.

Module Functions

gnomad_qc.v5.resources.annotations.get_trio_stats([...])

Get gnomAD v5 (AoU genomes only) trio stats VersionedTableResource.

gnomad_qc.v5.resources.annotations.get_sib_stats([...])

Get the gnomAD v5 (AoU genomes only) sibling stats VersionedTableResource.

gnomad_qc.v5.resources.annotations.get_aou_downsampling([...])

Get the downsampling annotation table.

gnomad_qc.v5.resources.annotations.group_membership([...])

Get the group membership Table for coverage, AN, quality histograms, and frequency calculations.

gnomad_qc.v5.resources.annotations.qual_hists([...])

Get the quality histograms annotation table.

gnomad_qc.v5.resources.annotations.coverage_and_an_path([...])

Fetch filepath for all sites coverage or allele number Table.

gnomad_qc.v5.resources.annotations.get_freq([...])

Get the frequency annotation Table for v5.

gnomad_qc.v5.resources.annotations.get_info_ht([...])

Get the gnomAD v5 (AoU genomes only) info VersionedTableResource.

Script containing annotation related resources.

gnomad_qc.v5.resources.annotations.get_trio_stats(test=False, environment='rwb')[source]

Get gnomAD v5 (AoU genomes only) trio stats VersionedTableResource.

Parameters:
  • test (bool) – Whether to use a temporary path for testing.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

AoU trio stats VersionedTableResource.

gnomad_qc.v5.resources.annotations.get_sib_stats(test=False, environment='rwb')[source]

Get the gnomAD v5 (AoU genomes only) sibling stats VersionedTableResource.

Parameters:
  • test (bool) – Whether to use a tmp path for testing.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

AoU sibling stats VersionedTableResource.

gnomad_qc.v5.resources.annotations.get_aou_downsampling(test=False, environment='rwb')[source]

Get the downsampling annotation table.

v5 downsamplings only applies to the AoU dataset.

Parameters:
  • test (bool) – Whether to use a tmp path for tests. Default is False.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

Hail Table containing downsampling annotations.

gnomad_qc.v5.resources.annotations.group_membership(test=False, data_set='aou', environment='rwb')[source]

Get the group membership Table for coverage, AN, quality histograms, and frequency calculations.

Parameters:
  • test (bool) – Whether to use a tmp path for tests. Default is False.

  • data_set (str) – Data set of annotation resource. Default is “aou”.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

Hail Table containing group membership annotations.

gnomad_qc.v5.resources.annotations.qual_hists(test=False, environment='rwb')[source]

Get the quality histograms annotation table.

Parameters:
  • test (bool) – Whether to use a tmp path for tests. Default is False.

  • environment (str) – Environment to use for quality histograms. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

Hail Table containing quality histogram annotations.

gnomad_qc.v5.resources.annotations.coverage_and_an_path(test=False, data_set='aou', environment='rwb')[source]

Fetch filepath for all sites coverage or allele number Table.

Note

If data_set is ‘gnomAD’, the returned table only contains coverage and AN for consent drop samples.

Parameters:
  • test (bool) – Whether to use a tmp path for testing. Default is False.

  • data_set (str) – Dataset identifier. Must be one of “aou” or “gnomad”. Default is “aou”.

  • environment (str) –

Return type:

VersionedTableResource

Returns:

Coverage and allele number Hail Table.

gnomad_qc.v5.resources.annotations.get_freq(version='5.0', data_type='genomes', test=False, data_set='aou', environment='rwb')[source]

Get the frequency annotation Table for v5.

Parameters:
  • version (str) – Version of annotation path to return.

  • data_type (str) – Data type of annotation resource (“genomes” or “exomes”).

  • test (bool) – Whether to use a tmp path for testing.

  • data_set (str) – Data set of annotation resource. Default is “aou”.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

TableResource

Returns:

Hail Table containing frequency annotations.

gnomad_qc.v5.resources.annotations.get_info_ht(test=False, environment='rwb')[source]

Get the gnomAD v5 (AoU genomes only) info VersionedTableResource.

Parameters:
  • test (bool) – Whether to use a tmp path for testing.

  • environment (str) – Environment to use. Default is “rwb”. Must be one of “rwb”, “batch”, or “dataproc”.

Return type:

VersionedTableResource

Returns:

Info VersionedTableResource.