Import Centre - freedict/fd-dictionaries GitHub Wiki

Import Centre

The dictionaries in FreeDict have a variety of sources, including external projects or manually written dictionaries. This page lists the tools that FreeDict uses to import the latest version of a dictionary. Please note that not all dictionaries are in the fd-tools repository, generated ones are on https://download.freedict.org/generated.

In most cases, dictionaries from external sources are encoded in their own format, requiring a specialised importer. Others use common formats. FreeDict's advantage is the common pivot format (TEI XML) from which we can export to any dictionary format, given we support the conversion from TEI.

Please document importers here Include the used language, usage hints, the location in the Git repository and other instructions. All importers are located below tools/. Please create a separate wiki site, with an entry from here pointing to it, if your explanations are too long.

  • dict2tei.py - conversion of an already formatted dictd database into TEI format

  • ding2tei: Haskell programm to import the DING eng-deu / deu-eng dictionary

  • hd2tei.pl - conversion of the "hd" format (which dictfmt also understands) into TEI format

  • JMdict*.xsl: Style sheets to create dictionaries out of the JMdict project. The process is documented here.

  • tab2tei.pl - conversion of tab delimited plain text file into TEI format