On Tue, Feb 24, 2015 at 10:03 PM, Matthias Huck  wrote:

Please try to investigate what's going on (if you have the time).

 So far, I have been able to obtain a list of 50 unique hypotheses using
either of these two methods in v3.0:

1.  Manually adding the "distinct" option to the -n-best-list parameter
when calling moses.

Note that my version of mert-moses.pl does not contain the "distinct"
keywords. I could not understand why keeping them in the first place did
not produce a unique n-best list, though.

2. Manually changing the PhrasePenalty parameter to exp(1)   (=2.718)
Comparing the test.filtered.ini.1 file and the phrase table to those of the
same experiment I had done back in version 0.x, I noticed that the phrase
penalty value has been removed from the phrase table and included in the
ini file as a standard feature function. For my example, this value was
computed to be -0.999959. I changed this value to 2.718 and rerun the moses
command (without even using the "distinct" option), which produced 50
unique hypotheses.

I must also add that the n-best lists generated by these two methods are
not exactly the same. For my application, I find the hypotheses output by
method2 more useful.

Machine translation is not my area of specialization, so I do not know
whether setting the phrase penalty to a fixed value is a bad practice. But
at least, it works for me. Is there a way to set this value in the
configuration file so that I do not have to change the ini file each time I
run the experiment?

Thanks,

ED
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to