Install local data - presemt-ntnu/transglobal GitHub Wiki

All locally cached data is located in the _local directory. It contains preprocessed data such as pickled dictionaries and lemmatized evaluation data. There are two ways to obtain the local data: (1) download a copy of all local data or (2) build all local data yourself.

Download local data

To download the local data run:

$ env/download_local_data.sh

(It may be convenient to setup passwordless SSH access to the server translate.idi.ntnu.no.)

Build local data

Building local data has so far only been done on Mac OS X. However, it should work on Linux provided the taggers are installed.

For building local data yourself, you need:

  • private data installed
  • TreeTagger installed for English and German
  • Oslo-Bergen Tagger installed
  • an internet connection (to access the web service for annotating Greek input)

Then proceed with

$ env/setup_local_data.py