Re: [Moses-support] Theoretical question about how moses works with multiple trans&gen tables and paths

Hieu Hoang Tue, 08 Nov 2011 08:27:14 -0800

hi stefan

On 07/11/2011 18:16, Stefan Dumitrescu wrote:

Hi all,
I am rather new to the MT domain, and i have a few theoretical questions:
1. In the manual, at the factored training, there is the followingexample (from language de to en):
--translation-factors 0-0 \
--generation-factors 0-2 \
My question: When translating, what happens after translation ofsurface form to surface form (T0)? How does the generation table ofconditional probabilities p(surface_en|pos_en) affect the previoustranslation? I mean, during the hypothesis expansion of thetranslation 0-0 is the generation table used? Or only after thetranslation results (after the beam search, etc), is the G0 generationtable used somehow?

with this trans + generation model, the model probability is p( target0, 2 | source 0).

A cartesian product of the candidates in the translation & generationsteps is calculated BEFORE hypothesis expansion.


I think there's some explanation of it here
  http://homepages.inf.ed.ac.uk/pkoehn/publications/emnlp2007-factored.pdf
for a more long-winded description, try chapter 2 of my thesis
   http://www.statmt.org/~s0565741/ddd.pdf

I am confused because the previous example in the manual was just--translation-factors 0-0,2 , where i kind of understand that duringthe hypothesis expansion, a less probable hypothesis chain from thefactor 2 (POS) point of view will get a lower score (because of lowerprobability in the POS LM). But how does the process work when addinga generation table?
I'm trying to understand why i should choose to go with a t0-0 andthen a g0-2 instead of going directly with a t0-0,2 (for example).

good question ;) imo, you should only do that when you're there'sextreme morphological differences between the 2 languages and t0-0,2 hasdata sparsity problems. Even then, it's not a good idea to depend on itsolely

Another example: for a chain of t1-1 g1-2 t3-2 g1,2-0 how dothe different translations and generations interact? Are theysequential, parallel? Is there some resource/book/article with thembetter explained?

you'll get in big trouble doing that. The cartesian product will blow upthe decoder during decoding unless you severely prune it. Then you'llhave issues with search errors

2. For multiple decoding paths like:
--translation-factors 0-0,2+1-0,2 \
--decoding-steps t0:t1
how is the best translation chosen? If we consider that both thesurface forms and the lemmas are found in the phrase tables (so nobackoff necessary), thus each decoding path outputting a valid answer,how is the best answer chosen? Always the translation from the firsttable?

Hypthesis expansion is performed with translation rules from both pathsand the best hypo is choosen simply from the 1 with the highest(weighted) probability.

Thank you for taking the time to read so far :) and thanks again foranswering, because i don't really know where to ask this anywhere else,
Stefan



_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Theoretical question about how moses works with multiple trans&gen tables and paths

Reply via email to