Hi all,

I am trying to do factored translation.

I have an input language (en) with these factors:

  0. word
  1. lemma
  2. pos
  3. rel

I have a target language (nl) with these factors:

  0. word
  1. lemma
  2. pos
  3. postag
  4. rel

TRAINING

I want to use word(0), lemma(1) and pos(2) from source language (en),
and word(0), lemma(1), postag(3) from target language (nl).

I run this command:

nohup /net/aps/64/opt/moses/mosesdecoder/scripts/training/train-model.perl \
  -cores 12 \
  -root-dir train \
  -corpus /net/aistaff/kleiweg/moses/europarl/factored/corpus/train-small.clean 
\
  -f en -e nl -alignment grow-diag-final-and -reordering msd-bidirectional-fe \
  --lm 0:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-word:8 \
  --lm 3:3:/net/aistaff/kleiweg/moses/europarl/factored/lm/blm.nl-postag:8 \
  --translation-factors 1-1+2-3+0-0,3 \
  --generation-factors 1-3+1,3-0 \
  --decoding-steps t0,g0,t1,g1:t2 \
  -external-bin-dir /net/aps/64/opt/moses/mgizapp/bin \
  -mgiza \
  -mgiza-cpus=12 \
   > training.out \
   2> training.err &

Did I specify the right translation-factors, generation-factors
and decoding-steps?

Did I specify the right language models?

Should I specify --alignment-factors, like this:

  --alignment-factors 1,2-1,3

Training ends with these lines on stderr:

(9) create moses.ini @ Sat Mar 15 17:19:20 CET 2014
Use of uninitialized value $___REORDERING_FACTORS in split at 
/net/aps/64/opt/moses/mosesdecoder/scripts/training/train-model.perl line 1997.

Should I specify --reordering-factors ? What does this do? What
do I specify? All I can find in the manual is this:

"Reordering tables can be trained with --reordering-factors.
Syntax is the same as for translation factors."

That isn't very helpful.

The model that was created seems to be working (even though step
9 of training gave an error), but it's slow, 10 minutes or more
for a sentence, where unfactored translation only took a few seconds. 

TUNING

What type of data should I give mert-moses.pl to work with? I
noticed the source language should only have factors 0, 1 and 2,
or I get an error. What factors should there be in the target
language?



-- 
Peter Kleiweg
http://pkleiweg.home.xs4all.nl/
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to