Anonymize or Deidentify Data - dcm4che/dcm4chee-arc-light GitHub Wiki

This page briefly guides and acquaints the user with coercion type to anonymize or de-identify DICOM dataset attributes' values.

Content

Brief Description

A DICOM dataset coming in or going out of the archive may require anonymization or de-identification of certain attribute values, to allow DICOM data to be used for research or teaching purposes or to share with any third party systems. Anonymize or de-identify certain DICOM attributes with help of a Anonymize Data coercion type.

Anonymization or de-identification of DICOM dataset attribute values is done in conformance with DICOM Part 15's Attribute Confidentiality Profiles

Refer

Applicability using Archive Attribute Coercion rules

New Archive Attribute Coercion can be used to anonymize or de-identify DICOM attributes in :

  • incoming C-STORE requests on receive of DICOM objects to archive sent from external systems
  • outgoing C-STORE requests on sending DICOM objects from archive to external systems
  • incoming (MWL) C-FIND requests to archive
  • incoming (MWL) C-FIND responses to archive
  • outgoing (MWL) C-FIND requests from archive
  • incoming MPPS N-CREATE requests to archive

Legacy Archive Attribute Coercions can be used to anonymize or de-identify DICOM attributes in :

  • outgoing C-STORE requests on sending DICOM objects from archive to external systems

Configurable fields on Archive Attribute Coercion rules

Anonymizing or de-identifying DICOM dataset attributes can be applied using either :

(Recommended) Specifying one or more Supported De-identification Profiles as comma separated in Attribute Coercion URI, f.e. deidentify:RetainDeviceIdentityOption,RetainUIDsOption on New Archive Attribute Coercion

OR

Selecting one or more Supported De-identification Profiles in field De-identification(s) on Legacy Archive Attribute Coercion

See

De-identification Action Codes

Each de-identification profile or option protects or retains specific DICOM attributes in a dataset, based on specific de-identification action codes.

Action Code Meaning
D replace with a non-zero length value that may be a dummy value and consistent with the Value Representation
Z replace with a zero length value, or a non-zero length value that may be a dummy value and consistent with the Value Representation
X remove the attribute value
U replace with a non-zero length UID that is internally consistent within a set of Instances
K keep (unchanged for non-Sequence Attributes, cleaned for Sequences)

Action codes are applicable to both Sequence and non Sequence attributes. In the case of Sequences, the action is applicable to the Sequence and all of its contents.

Supported De-identification Profiles

If this option is used in the coercion i.e. Attribute Coercion URI is set as deidentify:RetainLongitudinalTemporalInformationFullDatesOption, following date time fields in incoming DICOM objects are retained in the DB.

  • Curve Date (0008,0025)
  • Curve Time (0008,0035)
  • Expected Completion Date Time (0040,4011)
  • Instance Coercion Date Time (0008,0015)
  • Instance Creation Date (0008,0012)
  • Instance Creation Time (0008,0013)
  • Last Menstrual Date (0010,21D0)
  • Observation Date Time (0040,A032)
  • Observation Date (Trial) (0040,A192)
  • Observation Time (Trial) (0040,A193)
  • Overlay Date (0008,0024)
  • Overlay Time (0008,0034)
  • Performed Procedure Step End Date (0040,0250)
  • Performed Procedure Step End Date Time (0040,4051)
  • Performed Procedure Step End Time (0040,0251)
  • Performed Procedure Step Start Date (0040,0244)
  • Performed Procedure Step Start Date Time (0040,4050)
  • Performed Procedure Step Start Time (0040,0245)
  • Procedure Step Cancellation Date Time (0040,4052)
  • Scheduled Procedure Step End Date (0040,0004)
  • Scheduled Procedure Step End Time (0040,0005)
  • Scheduled Procedure Step Modification Date Time (0040,4010)
  • Scheduled Procedure Step Start Date (0040,0002)
  • Scheduled Procedure Step Start Date Time (0040,4005)
  • Scheduled Procedure Step Start Time (0040,0003)
  • Timezone Offset From UTC (0008,0201)
  • Acquisition Date (0008,0022)
  • Acquisition Time (0008,0032)
  • Admitting Date (0038,0020)
  • Admitting Time (0038,0021)
  • Series Date (0008,0021)
  • Series Time (0008,0031)
  • Study Date (0008,0020)
  • Study Time (0008,0030)
  • Acquisition Date Time (0008,002A)
  • Content Date (0008,0023)
  • Content Time (0008,0033)
  • End Acquisition Date Time (0018,9517)
  • Start Acquisition Date Time (0018,9516)
  • Verification Date Time (0040,A030)

If this option is used in the coercion i.e. Attribute Coercion URI is set as deidentify:RetainDeviceIdentityOption, following device identity specific fields in incoming DICOM objects are retained in the DB.

  • Cassette ID (0018,1007)
  • Gantry ID (0018,1008)
  • Generator ID (0018,1005)
  • Performed Station AE Title (0040,0241)
  • Performed Station Geographic Location Code Sequence (0040,4030)
  • Performed Station Name (0040,0242)
  • Performed Station Name Code Sequence (0040,4028)
  • Plate ID (0018,1004)
  • Scheduled Procedure Step Location (0040,0011)
  • Scheduled Station AE Title (0040,0001)
  • Scheduled Station Geographic Location Code Sequence (0040,4027)
  • Scheduled Station Name (0040,0010)
  • Scheduled Station Name Code Sequence (0040,4025)
  • Scheduled Study Location (0032,1020)
  • Scheduled Study Location AE Title (0032,1021)
  • Source Serial Number (3008,0105)
  • Detector ID (0018,700A)
  • Device Serial Number (0018,1000)
  • Station Name (0008,1010)
  • Device UID (0018,1002)

If this option is used in the coercion i.e. Attribute Coercion URI is set as deidentify:RetainInstitutionIdentityOption, following institution identity specific fields in incoming DICOM objects are retained in the DB.

  • Institution Name (0008,0080)
  • Institution Code Sequence (0008,0082)
  • Institution Address (0008,0081)
  • Institutional Department Name (0008,1040)
  • Institutional Department Type Code Sequence (0008,1041)

If this option is used in the coercion i.e. Attribute Coercion URI is set as deidentify:RetainUIDsOption, SOP Class & SOP Instance UIDs within following fields in incoming DICOM objects are retained in the DB.

  • Referenced Performed Procedure Step Sequence (0008,1111)
  • Referenced Study Sequence (0008,1110)

6. Retain Patient ID Hash Option (proprietary to DCM4CHEE 5)

If this option is used in the coercion i.e. Attribute Coercion URI is set as deidentify:RetainPatientIDHashOption, patient identifier values in incoming DICOM objects are hashed and are retained in the DB.

  • Patient ID (0010,0020)
  • Issuer of Patient ID (0010,0021)
  • Issuer of Patient ID Qualifiers Sequence (0010,0024)
⚠️ **GitHub.com Fallback** ⚠️