1.1 General Design and Arquitecture - PRIDE-Archive/ms-data-core-api GitHub Wiki

This is the general design and Arquitecture of the library:

ms-data-core-api Design

The current Data Object Model supports file formats containing spectrum, identification and quantitation information. Specifically, it supports three major PSI (Proteomics Standards Initiative) data standard formats: mzML, mzIdentML and mzTab. Every file format is read using different file-specific readers and translated using Transformers to the Data Object Model. The Data Object Model consists of different classes representing the main data types in proteomics studies such as chromatogram, spectrum, peptide, protein, etc. A novel cache system was implemented in order to increase the performance and memory usage of the library. This cache system is especially useful for GUI (Graphical User Interface) components that require concurrent operations in the same data. Finally, a set of controllers that extends a general DataAccessController Interface enables the data retrieval from the Data Object Model.