it's ok to put <s> & </s> into the language model.

However, don't put them into the phrase-based phrase-table. It's ok to put them into the phrase-table for the chart decoder, which is what the svn update refers to.


On 09/06/2011 11:53, Tom Hoar wrote:
Hieu,

Your most recent SVN update comment says:
     "dont process unknown words for 1st or last place. They're the<s>  &  </s>  and 
should only be handled by the glue rules"

We generate language model text corpus files with the<s>  and</s>  tag on each 
line, but we do not add them in 1st/last place. Should they be there? Is it a problem if 
they're not there?
    <?_task=mail&_id=9934662234df050879222f&_action=compose#>
Thanks,
Tom
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to