Hi Vincent

On 28/07/15 10:18, Vincent Nguyen wrote:
> Thanks Barry. Answers and other questions below.
>
> Le 28/07/2015 10:25, Barry Haddow a écrit :
>> Hi Vincent
>>
>>> 2 bugs report :
>>> in the LM Corpus definition for Europarl : the $pair-extension is
>>> missing before .$output-extension
>>> in the step 5 (maybe for others too) generation of the 
>>> moses.tuned.ini.5
>>> file there is a missing ".gz" at the end of phrase-table.5
>>> in the PhraseDictionaryMemory definition.
>> These seem OK to me. For europarl, it points to the monolingual 
>> corpus, and for the phrase table the .gz is implicitly added. Did 
>> they not work for you?
>
> I am NOT talking about the [CORPUS:europarl] section but
> the [LM:europarl] I think in this section you need the $pair-extension
> same as [LM:nc] where it was fine
> Anyway :yes I had an error

Europarl releases usually contain the parallel files (e.g. 
europarl-v7.fr-en.fr) and monolingual files (e.g. europarl-v7.en).

>
> Also : when.gz is missing yes it stops and give an error message.

OK, this used to work.

>
>
>>
>>> I tried to remove the "IGNORE" for the Interpolated-LM section
>>> I am still using KenLM.
>>> BUT I get a message saying I need to define srilm-dir
>>> is SRILM mandatory to turn on the interpolated-lm with KenLM only ?
>> That's right, the interpolated LM uses some code from SRILM. You can 
>> still use KenLM to create the individual language models, and use 
>> KenLM during decoding,
>
> OK. But related question :
> If I do not interpolate, and if I keep the 2 (or more) LM in the 
> moses.ini file
> does the decoder work similarly as if I had interpolated 2 LMs ?

If you do not interpolate using EMS then both LMs will be features in 
the model - i.e. you get log-linear interpolation. See here for an early 
comparison of linear and log-linear interpolation - 
https://aclweb.org/anthology/W/W07/W07-0717.pdf - there has been other 
work since then. Note that SRILM does not do linear interpolation correctly,


cheers - Barry


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to