Hi all, I have a very naive question, as I guess so.
I'm currently try to write an BLEU score measurement program. I just have a very naive wonder when I don't calculate BLEU score for a * FILE, *which contains from 1,000 to 2,000 sentences, in the traditional way. I calculate the BLEU score for each one, and assign the BLEU score for the test File by the average value of every pair. As the result, I will obtain a quite low result in compare to the traditional way. So, which way is better ? Thanks and best regards, C. Hoang -- Hoàng Cường SMTNerd
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
