CitationCorpus Integration - renepickhardt/related-work.net GitHub Wiki

Project Structure

The project is naturally structured in three layers:

  • DataImport
  • CorpusDatastore
  • FrontEnd Applications

DataImport

This layer covers:

  • download of source files
  • cleanup of data
  • conversion into DataStoreFormat

CorpusDatastore

Is a standalone databaseapplication which stores the authorative versions of the citation corpus data. It provides API's for data import and export in well defined semantic formats (most likely using SPAR).

FrontEnd Applications

This covers the RelatedWork-Webapplication and data indexing and presentation/visualization tools to be developed by <cottagelabs.com>.

The data is read out form the CorpusDatastore and imported into customized databases which is suited for the individual application.