CLEANDUPLICATES - winkusch/Easy2 GitHub Wiki

Clean duplicates. Number of duplicates will be written to the report into variable numDuplicates.

FUNCTION PARAMETER DEFAULT DESCRIPTION
CLEANDUPLICATES --colInMarker Column name of the input marker column.
CLEANDUPLICATES --strMode keepfirst "Set mode for handling of the duplicates. Optional. Default: keepfirst Please use: ‘keepfirst’ to keep the first and to exclude all latter appearances of a SNP; ‘removeall’ to remove all duplicated SNPs; ‘keepall’ to keep all duplicated SNPs and to not remove any rows; ‘samplesize’ to keep the duplicated SNP with the highest sample size;
CLEANDUPLICATES --colN If strMode is set to “samplesize”,the column name of the samplesize column needs to be defined here.
CLEANDUPLICATES --colCrit column on which the criterium (min/max) is applied
CLEANDUPLICATES --strCritDir direction for duplicate exclusion