hi arda
I think an email by Chris Dyer sums up the issue that it's pretty hard
to beat the phrase-based BLEU for many language pairs.
http://www.mail-archive.com/[email protected]/msg01995.html
here's Edinburgh's attempt from this years WMT10:
http://aclweb.org/anthology-new/W/W10/W10-1715.pdf
The straightforward way of adding syntax severely reduces BLEU, you have
to add something extra to get any gains. Off the top of my head, the
main ways that i've seen so far is
1. Add alternative parses, eg. forest decoding
2. Mix up the parse tree, eg. SAMT
3. Soft constrain instead of hard constraints, eg
http://www.isi.edu/~chiang/papers/acl2010-chiang.pdf
4. Occasionally ignoring syntax, eg.
http://aclweb.org/anthology-new/W/W10/W10-1761.pdf
There's loads of other ways & papers i haven't mentioned
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support