Home - jacquesfauquex/DCKV GitHub Wiki

DCKV representation of DICOM dataset

Background

DICOM binary (DICM) metadata format improves the key/value paradigm of any basic metadata in three axis:

the attributes are orderered within lists in function of 4-bytes tags identifying each attribute.
an attribute can be multivalued,
an attribute can be the root of an array of enclosed ordered lists of attributes

A translation of DICM into XML facilitates the discrete access to any attribute of the root ordered list, or of enclosed ones, thanks to the XML tool XPath.

Another translation of the binary model into JSON simplifies the parsing of subsets of metadata in QIDO responses using ecmascript (javascript) and the many other languages which support maps (also called associative array, or dictionary).

Both the XML and JSON translations are text-based representations derived from the explicit binary syntax. They replace the binary structuring glue by textual markup, which allows to replicate :

the association of multiple values to one key,
the encapsulation of various items into one sequence,
the nesting of the various attributes of a dataset into one item

The parsing of an attribute implies the parsing of the complete context preceding it, in order to discover its chain of encapsulation. For instance, the triplet attributes code,scheme,meaning only make sense when related to their enclosing sequence attribute.

This constraint is burdensome, especially in simple use cases. Even when the attribute should be found at the root level, previous sequences alter the classification into the serialized file and force the parser to dig into them before reaching the root level attribute of interest.

Novelty of our new representation

Our new representation aims at incorporating the context into the key part for the attribute, so that each attribute is fully defined individually by the key.

We call this new representation "Dicom Contextualized Key Value" (DCKV). We manage two blends of it:

_DKV : refers to the result of the parsing of one DICM instance
EDKV : refers to the result of the parsing of one or more DICM instances of a same Study (which we also call exam, with the initial letter E, to differentiate it from the initial letter of a Series).

DCKV can be translated back and forth from and to the already existing DICM, XML and JSON representations of DICOM datasets. It has been designed to easily serialize to anyone of the three others.

Implementation of the "ascending order" rule

In DICOM representations, the attributes shall be ordered in tag ascending order within the base dataset (and also within any encapsulated dataset).

A tag in its binary form is a sequence of two two-bytes words (group and unit). The order of the bytes within the two-bytes words depends on the endianness of the computer. But as for now, big endian has been deprecated, and the canonical little endian binary representation of the tag in DICOM binary is a sequence of four bytes as follows:

0 group less significant byte (g)
1 group most significant byte (G)
2 unit less significant byte (u)
3 unit most significant byte (U)

Such serialization makes ordering tags difficult, because it implies permuting the byte order in each of the tags before ordering them. This is in fact what is performed in the text representation of a tag by means of a chain of 8 hexadecimal chars (two consecutive ones represent one byte) which represents the order GgUu, ready for ordering.

So our internal model of a tag is a chain of 4 bytes ordered GgUu.

Key format

### SI

A list of attributes (A) is called an item (I). A special attribute type sequence (S) contains a positional list of items (I 1 comes before I 2, which comes before I 3 and so on).

SI*

Sequence containment recursion is authorized.

When the attribute is buried into one or more levels of encapsulation of sequence, a chain of sequence-item is required to locate it fully.

AR

In order to interpret correctly an attribute, its representation (R) including value representation and charset need to be known. we make this information available in 4 bytes :

two ascii letters VR (value representation) datatype
an uint16 index of charset defined in attribute (0008,0005) of the dataset. Index defined here

We label AR (8 bytes long) the attribute followed by representation details.

PF

Root tags in the dataset of a DICOM instance are not prefixed by any item number. This is so because the standard builds up on instances.

But as far as DCKV in its EDKV blend is concerned, that's the Exam which is fundamental. That's why, in order to keep together all the attributes of all the instances of an Exam, we prepend to the key an 8 bytes prefix PF (SI)* AR

PF, SI and AR are 8 bytes length each, which is a nice size for 64 bit computing.

Sequences and items delimiters

With the purpose of simplifying the serialization into binary DICOM, we also materialize each item start, item end and sequence end of the dataset as if they were attributes. To make it possible, we created 4 private representations vr.

- S start      : GGggUUuu00000000
- I start tag  :                 000000002B2B0000
- I end tag    :                 FFFFFFFF5F5F0000
- S end        : GGggUUuuFFFFFFFF

0x2B2B in ASCII is ++ 0x5F5F in ASCII is --

Example: Sequence with an empty item and an item with contents :

GGggUUuu 00000000
GGggUUuu 00000001 00000000-2B2B0000
GGggUUuu 00000001 FFFFFFFF-5F5F0000
GGggUUuu 00000002 00000000-2B2B0000
GGggUUuu 00000002 GGggUUuu-44410000
GGggUUuu 00000002 FFFFFFFF-5F5F0000
GGggUUuu FFFFFFFF

Range selection

The key ordered list allows for range selection. A specific range selection type, the sharedPrefix one is usefull for the sellection of:

groups. For instance, group 0002, private groups (odds with the exception of 0001,0003,0005,0007,FFFF)
sequence contents (all the items). Also works for encapsulated sequences
item contents

Lazy parsing and value format

We register values as are in the binary DICM representation (including padding), that is as a byte chain.

Our parsing is lazy. value parsing is differed until representation needs. This optimizes the parsing process, since many attributes will never be represented, which implies that their value doesn´t need to be parsed.

Another benefit of lazy parsing is that serialization of the values is merely a copy operation.