File structure - VincTheSecond/rextractor GitHub Wiki

File structure in RExtractor installation:

  • /RExtractor -- perl PM files

  • /data -- documents processed by RExtractor

    • /submitted -- original document submitted into system

    • /converted -- internal XML format created from original submitted document

    • /treex -- output of the Treex NLP system (XML Treex files)

    • /serialized -- simple export of the most important NLP data from the Treex XML files

    • /logs -- individual log files for each submitted document

    • /exported -- output of the RExtractor analysis. It contains modified original files with annotations and description file with entity and relation descriptions.

  • /database -- Database of Entities (DBE) and Database of Relations (DBR)

  • /images -- images for web interface

  • /javascript -- javascript files for web interface

  • /styles -- CSS files for web interface

  • /servers -- temporary and internal files for RExtractor components

    • /logs -- component log files

    • /pids -- PID of running daemons

    • /tmp -- temporary files