treealigneval - Helsinki-NLP/Lingua-Align GitHub Wiki

Table of Contents

NAME

treealigneval - a script for computing precision and recall scores for tree aligmnent

SYNOPSIS

DESCRIPTION

Both gold-standard-file and tree-alignment-file should be in Stockholm Tree Aligner Format. Here is an example:

The `treealigneval` script will read both files and compare the links. It will output precision, recall and F values. Here is an example output:

`NT` refers to non-terminal nodes and `T` refers to terminal nodes (treealigneval uses type attributes in the alignment file to determine if a node is a terminal node or a non-terminal node; if this attribute is not included it assumes that all nodes with an I500 is a terminal node). Precision and recall values for specific link types may be lower than the overall numbers because the proposed link type has to match whereas in the total numbers all proposed links are considered.

OPTIONS

 * -b firstSentId Start evaluating at this source language sentence ID. If you don't specify -b the evaluation script will use all sentences for which at least one link has been proposed. That means that the scores might be too high because the aligner may just not have aligned anything for in some sentence pairs (usually it will be fine).
 * -e lastSentId Stop evaluating at this source language sentence ID. This is for the same reason as for -b.
 * -g format This specifies the format of the gold standard file. Default is `sta` (Stockholm Tree Aligner format). Other formats are not really included/tested yet. An alternative would be, for example, the format of the Dublin Subtree Aligner.
 * -s format The format of the tree aligmnets proposed by the system. Default is again `sta`.

SEE ALSO

Lingua::treealign, Lingua::Align::Trees

AUTHOR

Joerg Tiedemann

COPYRIGHT AND LICENSE

Copyright (C) 2009 by Joerg Tiedemann

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.8 or, at your option, any later version of Perl 5 you may have available.

POD ERRORS

Hey! The above document had some coding errors, which are explained below:

 * Around line 368: Unterminated D<...> sequence

Deleting unknown formatting code D<>

⚠️ **GitHub.com Fallback** ⚠️