finnpos–eval - mpsilfve/FinnPos GitHub Wiki

Usage

finnpos-eval sys_data gold_data (model)

Purpose

Evaluate a tagged file against a gold standard file.

Notes

The data files sys_data and gold_data need to conform the the FinnPos data format. Additionally, they need to represent the same input data. that is their word forms need to agree.

If a model is provided, finnpos-eval can provide statistics separately for in-vocabuary and out-of-vocabulary words.

Examples

$ finnpos-eval ftb.test.sys ftb.test.gold ftb.model
Comparing ftb.test.sys and ftb.test.gold (gold standard).
Label accuracy for all words: 0.932939
Label accuracy for IV words:  0.970814
Label accuracy for OOV words: 0.787924
Lemma accuracy for all words: 0.902333
Lemma accuracy for IV words:  0.989879
Lemma accuracy for OOV words: 0.567137

$ finnpos-eval ftb.test.sys ftb.test.gold
Comparing ftb.test.sys and ftb.test.gold (gold standard).
Label accuracy for all words: 0.932939
Label accuracy for IV words:  -1
Label accuracy for OOV words: -1
Lemma accuracy for all words: 0.902333
Lemma accuracy for IV words:  -1
Lemma accuracy for OOV words: -1
-1 denotes unknown value.