UFAL - sporedata/researchdesigneR GitHub Wiki

General description

UFAL Medical Corpus v. 1.0 (https://ufal.mff.cuni.cz/ufal_medical_corpus) is a collection of parallel corpora that aims at a more reliable machine translation of medical texts. It contains parallel sentences including Czech, French, German, Hungarian, Polish, Romanian Spanish, and Swedish; each language paired with English.

UFAL Medical Corpus v.1.0 also serves as the training data for WMT17 Biomedical Task.

Data access

Upon registration, you will receive a unique username. This unique username along with a shared password "ufalmedi" will be requested at the following link: