duplicate_status - AtlasOfLivingAustralia/ala-dataquality GitHub Wiki

Superseded - replaced by duplicateStatus

Short description

Identifies suspected duplicates (D), and what appears to be the best representation of suspected duplicate records (R)

Description

Provides information on whether a record is a suspected duplicate. Values are: "R" for the representative record for a set of suspected duplicates; "D" for suspected duplicates; and, "Not Supplied" where there is no suspected duplication.

Relevant standards

Expert vocabulary

ALA usage

Values are:

D - Duplicate

R - Representative record

Not supplied (for no value)

Technical description, provenance, code

https://github.com/AtlasOfLivingAustralia/biocache-store/blob/651ecb50870dfeb15b2be91dfb6b3641d4166b7f/src/main/scala/au/org/ala/biocache/tool/DuplicationDetection.scala