[Moses-support] Cumulative BLEU scores

Nat Gillin Mon, 24 Oct 2016 21:01:39 -0700

Dear Moses Community,

When using mteval-13a.pl, we note that the output looks like this:


length ratio: 1.07303974221267 (1998/1862), penalty (log): 0

NIST score = 5.0564  BLEU score = 0.2318 for system "Google"


# ------------------------------------------------------------------------


Individual N-gram scoring

        1-gram   2-gram   3-gram   4-gram   5-gram   6-gram   7-gram
8-gram   9-gram

        ------   ------   ------   ------   ------   ------   ------
------   ------

 NIST:  4.4488   0.5554   0.0477   0.0045   0.0000   0.0000   0.0000
0.0000   0.0000  "Google"


 BLEU:  0.5415   0.2972   0.1752   0.1025   0.0626   0.0354   0.0193
0.0085   0.0017  "Google"


# ------------------------------------------------------------------------

Cumulative N-gram scoring

        1-gram   2-gram   3-gram   4-gram   5-gram   6-gram   7-gram
8-gram   9-gram

        ------   ------   ------   ------   ------   ------   ------
------   ------

 NIST:  4.4488   5.0043   5.0520   5.0564   5.0564   5.0564   5.0564
5.0564   5.0564  "Google"


 BLEU:  0.5415   0.4012   0.3044   0.2318   0.1784   0.1362   0.1031
0.0754   0.0493  "Google"

And at
https://github.com/moses-smt/mosesdecoder/blob/master/scripts/generic/mteval-v13a.pl#L823,
it tries to calculate the cumulative score by accumulate the individual
ngram precisions and at each order of ngram add to it and do a
normalization before calculating the cumulative score for each order of
nrgram.

The question is why does it add the brevity penalty? (i.e. $len_score)

Also, is this score discussed in any paper?

Thanks in advance for the clarifications!

Regards,
Nat

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] Cumulative BLEU scores

Reply via email to