What tags are included in an Optimus BAM file?
The Optimus Pipeline outputs a barcoded BAM file of aligned reads. There are multiple tags within the BAM file, including standard tags from 10X genomics and Sequence Alignment Map (SAM) files. The table below details the tags used by the Optimus Pipeline and the relevant sources/tools from which the pipeline obtains the tags.
Optimus Pipeline BAM Tag | Details | Source |
---|---|---|
AS | Alignment score generated by aligner | SAM |
CB | Chromium cellular barcode sequence that is error-corrected and confirmed against a list of known-good barcode sequences | 10X Genomics |
CR | Chromium cellular barcode sequence as reported by the sequencer | 10X Genomics |
CY | Chromium cellular barcode read quality. Phred scores as reported by sequencer | 10X Genomics |
GX | Gene ID | STAR aligner |
GN | Gene names | STAR aligner |
GS | Reserved for backwards compatibility reasons | SAM |
HI | Query hit index | SAM |
MD | String for mismatching indexes | SAM |
NH | Number of reported alignments that contain the query in the current record | SAM |
NM | Edit difference to the reference | SAM |
RG | Read group | SAM |
UB | Chromium molecular barcode sequence that is error-corrected among other molecular barcodes with the same cellular barcode and gene alignment | 10X Genomics |
UG | Read group ID | Umi-tools |
UR | Chromium molecular barcode sequence as reported by the sequencer | 10X Genomics |
UY | Chromium molecular barcode read quality. Phred scores as reported by sequencer | 10X Genomics |
jI | [jI:B:I,Start1,End1,Start2,End2,...] Start and End of introns for all junctions (1-based) | STAR |
jM | [jM:B:c,M1,M2,...] Intron motifs for all junctions (i.e. N in CIGAR) | STAR |
nM | The number of mismatches per (paired) alignment | STAR |
sF | The feature type and number of genes for each read | STAR v2.7.11a and later |