I finally went through the all Baseline process with the KenLM model. results are mitigated, so from here what would be the best practices ?
1) I saw online a bunch of corpus available from the European Union should this be used to train the translation system AND the langue model or just one of the 2 ? 2) Is there a benchmark between the different model (Kenlm, Irstlm, ...) ie is there a big difference in the observed results ? is it worth trying several ones ? 3) I read an article mentioning that the results after the tuning were not as good as before ... does this make any sense ? Thanks. _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support