Hi,

Ok, I understand this approach now. I generally like the idea of backoffs and 
it give more opportunity to further tune the engines. And it's even more 
powerful when multiple LMs goes in place.  Just I think the MERT might have 
difficulties to handle such huge numbers of parameters to be tuned (especially 
for Czech where we hit the sparse issues ).

Tomas

-----Original Message-----
From: [email protected] [mailto:[email protected]] On Behalf Of Philipp Koehn
Sent: Monday, November 3, 2014 11:47 PM
To: Tomas Fulajtar
Cc: moses-support ([email protected])
Subject: Re: [Moses-support] Combine models with backoff

Hi,

you should just add multiple language models - tuning will find proper weights 
for them. There is no way to do backoff with language models, just 
interpolation.

-phi

On Mon, Oct 27, 2014 at 12:49 PM, Tomas Fulajtar <[email protected]> wrote:
> Hi all,
>
>
>
> I would like to combine two phrase based engines. One, smaller is 
> trained on desired domain data, but with limited corpus size. The 
> second is the legacy one with huge phrase table and LM, but with kind 
> of older/more obsolete terminology.  Thus the idea is to combine both 
> to preserve domain/language style from the first engine, but also 
> reduce OOV with application of the second engine.
>
>
>
> I think what I am looking for is the Back-off model - use the small 
> one as a preferred one , and then the second in case of phrases not 
> found. I have setup such a config in accordance with 
> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc25,.
>
>
>
> [feature]
>
> PhraseDictionaryCompact name=A
>
> PhraseDictionaryCompact name=BackOff
>
>
>
> [mapping]
>
> 0 T 0
>
> 1 T 1
>
>
>
> [decoding-graph-backoff]
>
> 0
>
> 1
>
>
>
> [weight]
>
> A = 0 0 0 0
>
> BackOff = 0 0 0 0
>
>
>
> And it seems to work (weights were tuned afterwards with mert).
>
>
>
> I have also read the
> http://comments.gmane.org/gmane.comp.nlp.moses.user/10099. However 
> there is not mentioned how the LMs  combination could be managed. I 
> can add both to ini file and perform the weights tuning, or is it 
> better to set the weights manually? I believe that phrase table 
> backoff would ensure the preference of model A terminology, while 
> combination of both LMs would make the translation smoother as it can benefit 
> from the second, bigger LM.
>
>
>
> Could you please correct my assumptions?  I hope the explanation does 
> make some sense…
>
>
>
> Thank you very much,
>
>
>
> Tomas
>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to