Hi,

I am trying to build a factored translation system for English-Turkish
language pair, but I ran into some problems. I would be really pleased if
you can answer my questions.

1. My parallel data is formatted like:

y+POS|y|POS -----> English side (source)
x+morph|x|morph ----> Turkish side (target)

2. I am training the system with these options:

-translation-factors 0-0+1-1+2-2
-reordering-factors 0,1,2-0,1,2
-generation-factors 0-1,2+1-2+1,2+0
-decoding-steps t0,g0:t1,g1,t2,g2

So I first translate from surface to surface and generate target lemma and
morphology from surface. Another path is, translation from lemma to lemma,
generation of morph. from lemma, translation from POS to morph and
generation of surface from lemma and morph.

3. I do not know how to add the reordering into this so i just added
"-reordering-factors 0,1,2-0,1,2" with "-alignment grow-diag-final-and
-reordering msd-bidirectional-fe" options.
Is this right?

4. After the training I am trying to run testing but immediately I get a
segmentation fault.
The segmentation fault comes when it tries to translate the first sentence
while calculating options.
I feel I have done something wrong but I don't know what.

5. Another problem is if I use generation models, tuning also throws
segmentation fault. So I cannot tune with MERT. It throws the error while
loading generation table. What should I do to make it run?

I would be really pleased if you can help me.

Kind regards,
Haydar Imren
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to