Hi all,
 
I am running some experiments where the BLEU score in my experiments is a 
little lower than from BLEU scores published in recent papers. I am trying to 
figure out why this is, but I am running out of ideas. Any help would be 
appreciated. I am making a list of possible points which might be relevant.
 
I am doing a FR -> EN translation on wmt 08 europarl v3b.
 
Koehn in 2003 reports a bleu score of 0.3389 with a language model of ngram 3
mylonakis in 2008 report a bleu score of 0.3303 with a language model of ngram 5
 
my model gets a bleu score of 0.3145 on a language model of ngram 4 and 0.3178 
on a LM of ngram 5.
 
I have a moses compilation of 8-5-2008 and doing the normal baseline 
instructions as found under baseline system wmt08.
 
Questions:
1) is it possible I have some faulty moses compilation. Should I redownload 
another version from svn?
2) does it affect the reordering model if I choose another ngram length during 
training?
3) my moses.ini prints:
 
============================
# limit on how many phrase translations e for each phrase f are loaded
# 0 = all elements loaded
[ttable-limit]
20
0
# distortion (reordering) files
[distortion-file]
0-0 msd-bidirectional-fe 6 /scratch/res_10/mosespl5pc1.0/model/reordering-table.
msd-bidirectional-fe.0.5.0-0.gz
===================================
 
3) why are there two lines under table-limit? should there be just one?
4) Wat do the 6 and the 5 mean in the reordering table line.
 
regards,
Sanne
 
 
 


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to