Hi, I am trying to build a factored translation system for English-Turkish language pair, but I ran into some problems. I would be really pleased if you can answer my questions.
1. My parallel data is formatted like: y+POS|y|POS -----> English side (source) x+morph|x|morph ----> Turkish side (target) 2. I am training the system with these options: -translation-factors 0-0+1-1+2-2 -reordering-factors 0,1,2-0,1,2 -generation-factors 0-1,2+1-2+1,2+0 -decoding-steps t0,g0:t1,g1,t2,g2 So I first translate from surface to surface and generate target lemma and morphology from surface. Another path is, translation from lemma to lemma, generation of morph. from lemma, translation from POS to morph and generation of surface from lemma and morph. 3. I do not know how to add the reordering into this so i just added "-reordering-factors 0,1,2-0,1,2" with "-alignment grow-diag-final-and -reordering msd-bidirectional-fe" options. Is this right? 4. After the training I am trying to run testing but immediately I get a segmentation fault. The segmentation fault comes when it tries to translate the first sentence while calculating options. I feel I have done something wrong but I don't know what. 5. Another problem is if I use generation models, tuning also throws segmentation fault. So I cannot tune with MERT. It throws the error while loading generation table. What should I do to make it run? I would be really pleased if you can help me. Kind regards, Haydar Imren
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
