Lane Schwartz <dowobeha@...> writes: > > I have a number of distinct monolingual corpora. I've been training them as separate LMs. I now want to run a variant where they are all concatenated together, and then trained as a single LM. The EMS walkthrough says this should be possible (http://www.statmt.org/moses/?n=FactoredTraining.EMS#ntoc19), but doesn't give the requisite syntax. What is the EMS syntax to do this? > > Thanks, > Lane
Hi Lane, check commit 27fd45d - it implements basic support for concatenation of LM corpora in EMS. Feel free to tinker with it to make it more configurable - being able to override which corpora to concatenate would be nice, for example. best wishes, Rico _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
