The script implements the BELU score as described in the "official" paper by Papineni

Papineni, Kishore, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2001. Bleu: a method for automatic evaluation of machine translation. Research Report RC22176, IBM Research
Division, Thomas J. Watson Research Center.

The four figures "91.3/65.9/50.9/39.5" are the precisions of 1- grams, 2-grams, 3-grams, and 4-grams, respectively

and the global score (BLEU )is computed as the geometric mean of these precisions multiplied by the Brevity Penalty (BP)
caluclated according to the length ratio (ration)

Look at the paper for more details.

The prefix "multi" means that the script computes BLEU score with multiple references.

best regards
Nicola


On Nov 25, 2008, at 11:15 PM, Alok Kothari wrote:

How to Interpret Multi-Bleu score ??

This is Mine for a certain pair of languages
 Pair 1 ->  BLEU = 46.94, 91.3/65.9/50.9/39.5 (BP=0.796, ration=0.814)
 Pair 2-> BLEU = 0.47, 28.2/3.3/0.5/0.3 (BP=0.236, ration=0.409)
For the Perl-Script Given Here

http://www.statmt.org/wmt06/shared-task/multi-bleu.perl<ATT00001.txt>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to