Skip to main content

ATAC Library Metrics Overview

The ATAC pipeline uses SnapATAC2 to generate library-level metrics in CSV format.

MetricDescription
NHash_IDA unique identifier used to track and reference the specific sample or dataset.
sequenced_readsThe total number of reads generated from the sequencing process, which includes both reads that are mapped and unmapped.
sequenced_read_pairsThe total number of read pairs (two reads per pair) generated from the sequencing process. This is typically half of the total sequenced reads if all reads are paired.
fraction_valid_barcodeThe fraction of reads that contain a valid barcode, indicating the proportion of reads that are correctly assigned to a specific cell or sample.
fraction_Q30_bases_in_read_1The proportion of bases in Read 1 that have a Phred quality score of 30 or higher, indicating high-confidence base calls.
fraction_Q30_bases_in_read_2The proportion of bases in Read 2 that have a Phred quality score of 30 or higher, indicating high-confidence base calls.
number_of_cellsThe estimated number of cells captured and sequenced in the experiment, based on the barcodes identified.
mean_raw_read_pairs_per_cellThe average number of raw read pairs associated with each cell, providing an indication of the sequencing depth per cell.
median_high-quality_fragments_per_cellThe median number of high-quality (e.g., confidently mapped) fragments associated with each cell, representing typical fragment quality across cells.
fraction of high-quality fragments in cellsThe fraction of high-quality fragments that are associated with identified cells, indicating the proportion of good-quality data that is cell-associated.
fraction_of_transposition_events_in_peaks_in_cellsThe fraction of transposition events within identified cells that occur within peaks, which are regions of accessible chromatin.
fraction_duplicatesThe fraction of sequenced fragments that are duplicates, which can result from PCR amplification or other factors, indicating the redundancy in the sequencing data.
fraction_confidently_mappedThe fraction of sequenced fragments that are confidently mapped to the reference genome, indicating the proportion of reads that align well to the genome.
fraction_unmappedThe fraction of sequenced fragments that could not be mapped to the reference genome, which can indicate sequencing errors, contamination, or regions not covered by the reference.
fraction_nonnuclearThe fraction of sequenced fragments that are mapped to non-nuclear (e.g., mitochondrial or other organellar) DNA, providing insight into contamination or organellar activity.
fraction_fragment_in_nucleosome_free_regionThe fraction of sequenced fragments that map to nucleosome-free regions, which are indicative of accessible chromatin.
fraction_fragment_flanking_single_nucleosomeThe fraction of sequenced fragments that map to regions flanking single nucleosomes, indicating regions with partial chromatin accessibility.
tss_enrichment_scoreA measure of the enrichment of transposition events at transcription start sites (TSS), indicating the accessibility of promoters across the genome.
fraction_of_high-quality_fragments_overlapping_TSSThe fraction of high-quality fragments that overlap transcription start sites (TSS), providing insight into promoter accessibility.
Number_of_peaksThe total number of peaks, or regions of accessible chromatin, identified in the dataset, representing potential regulatory elements.
fraction_of_genome_in_peaksThe fraction of the genome that is covered by identified peaks, indicating the extent of chromatin accessibility across the genome.
fraction_of_high-quality_fragments_overlapping_peaksThe fraction of high-quality fragments that overlap with identified peaks, providing an indication of the efficiency of the assay in capturing accessible regions.
atac_percent_targetPercent of cells recovered; value is calculated as estimated_cells/expected_cells.