On Tue, Feb 24, 2015 at 10:03 PM, Matthias Huck wrote:
Please try to investigate what's going on (if you have the time). So far, I have been able to obtain a list of 50 unique hypotheses using either of these two methods in v3.0: 1. Manually adding the "distinct" option to the -n-best-list parameter when calling moses. Note that my version of mert-moses.pl does not contain the "distinct" keywords. I could not understand why keeping them in the first place did not produce a unique n-best list, though. 2. Manually changing the PhrasePenalty parameter to exp(1) (=2.718) Comparing the test.filtered.ini.1 file and the phrase table to those of the same experiment I had done back in version 0.x, I noticed that the phrase penalty value has been removed from the phrase table and included in the ini file as a standard feature function. For my example, this value was computed to be -0.999959. I changed this value to 2.718 and rerun the moses command (without even using the "distinct" option), which produced 50 unique hypotheses. I must also add that the n-best lists generated by these two methods are not exactly the same. For my application, I find the hypotheses output by method2 more useful. Machine translation is not my area of specialization, so I do not know whether setting the phrase penalty to a fixed value is a bad practice. But at least, it works for me. Is there a way to set this value in the configuration file so that I do not have to change the ini file each time I run the experiment? Thanks, ED
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
