Kenneth Heafield wrote: > kenlm's query tool implicitly places <s> at the beginning. It doesn't > appear in the output, but you can see the effect because the n-gram > length after the is 2, not 1.
Does this happen when kenlm is called from Moses as well? There seem to me to be many reasons not to do this: How do you know whether full sentences are being translated? What if the translation model already includes sentence boundary tokens? (See my recent message about why this might be desirable) But most importantly: How do you know whether the language model was trained that way? - John Burger MITRE _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
