public abstract class AbstractOpticalDuplicateFinderCommandLineProgram extends CommandLineProgram
Modifier and Type | Field and Description |
---|---|
protected static htsjdk.samtools.util.Log |
LOG |
int |
OPTICAL_DUPLICATE_PIXEL_DISTANCE |
protected OpticalDuplicateFinder |
opticalDuplicateFinder |
java.lang.String |
READ_NAME_REGEX |
COMPRESSION_LEVEL, CREATE_INDEX, CREATE_MD5_FILE, GA4GH_CLIENT_SECRETS, MAX_RECORDS_IN_RAM, QUIET, REFERENCE_SEQUENCE, TMP_DIR, VALIDATION_STRINGENCY, VERBOSITY
Constructor and Description |
---|
AbstractOpticalDuplicateFinderCommandLineProgram() |
Modifier and Type | Method and Description |
---|---|
protected java.lang.String[] |
customCommandLineValidation()
Put any custom command-line validation in an override of this method.
|
void |
setupOpticalDuplicateFinder() |
doWork, getCommandLine, getCommandLineParser, getDefaultHeaders, getMetricsFile, getNestedOptions, getNestedOptionsForHelp, getStandardUsagePreamble, getVersion, instanceMain, instanceMainWithExit, parseArgs, setDefaultHeaders
protected static htsjdk.samtools.util.Log LOG
@Option(doc="Regular expression that can be used to parse read names in the incoming SAM file. Read names are parsed to extract three variables: tile/region, x coordinate and y coordinate. These values are used to estimate the rate of optical duplication in order to give a more accurate estimated library size. Set this option to null to disable optical duplicate detection. The regular expression should contain three capture groups for the three variables, in order. It must match the entire read name. Note that if the default regex is specified, a regex match is not actually done, but instead the read name is split on colon character. For 5 element names, the 3rd, 4th and 5th elements are assumed to be tile, x and y values. For 7 element names (CASAVA 1.8), the 5th, 6th, and 7th elements are assumed to be tile, x and y values.", optional=true) public java.lang.String READ_NAME_REGEX
@Option(doc="The maximum offset between two duplicate clusters in order to consider them optical duplicates. The default is appropriate for unpatterned versions of the Illumina platform. For the patterned flowcell models, 2500 is moreappropriate. For other platforms and models, users should experiment to find what works best.") public int OPTICAL_DUPLICATE_PIXEL_DISTANCE
protected OpticalDuplicateFinder opticalDuplicateFinder
public AbstractOpticalDuplicateFinderCommandLineProgram()
public void setupOpticalDuplicateFinder()
protected java.lang.String[] customCommandLineValidation()
CommandLineProgram
customCommandLineValidation
in class CommandLineProgram