CLEANDUPLICATES - winkusch/Easy2 GitHub Wiki
Clean duplicates. Number of duplicates will be written to the report into variable numDuplicates.
FUNCTION | PARAMETER | DEFAULT | DESCRIPTION |
---|---|---|---|
CLEANDUPLICATES | --colInMarker | Column name of the input marker column. | |
CLEANDUPLICATES | --strMode | keepfirst | "Set mode for handling of the duplicates. Optional. Default: keepfirst Please use: ‘keepfirst’ to keep the first and to exclude all latter appearances of a SNP; ‘removeall’ to remove all duplicated SNPs; ‘keepall’ to keep all duplicated SNPs and to not remove any rows; ‘samplesize’ to keep the duplicated SNP with the highest sample size; |
CLEANDUPLICATES | --colN | If strMode is set to “samplesize”,the column name of the samplesize column needs to be defined here. | |
CLEANDUPLICATES | --colCrit | column on which the criterium (min/max) is applied | |
CLEANDUPLICATES | --strCritDir | direction for duplicate exclusion |