Hi, if the translation model also maps into individual Chinese characters as words in the translation table, then this is works out of the box. If you use different tokenization for translation model and language model, you need to add some code in the language model scoring, which should not too much work either.
-phi On Thu, May 21, 2009 at 10:18 AM, kpzhang <[email protected]> wrote: > Hello everyone, > I set up an English to Chinese translation system, in which I use > different tokenization schemes on chinese sentences in parallel training > text and chinese sentences for language model traing. Can I use a > character-based chinese language model (instead of word-based model) in > decoding stage? Thanks in advance. > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
