Hello,

I'm training factored models but I get the next error:

$SCRIPTS_ROOTDIR/training/train-model.perl -scripts-root-dir
$SCRIPTS_ROOTDIR -root-dir $f-$e -corpus data/tagged/train.cln -f $f -e $e
-lm 0:$Gram:$WD/lm/train.$e.qblm.mm:1 -lm 1:$Gram:$WD/lm/train-pos.$
e.qblm.mm:1 --translation-factors 0-0,1
(...)
Reading more sentence pairs into memory ...
ERROR: Forbidden zero sentence length 0
ERROR: Forbidden zero sentence length 0
ERROR: Forbidden zero sentence length 0
ERROR: Forbidden zero sentence length 0
ERROR: Forbidden zero sentence length 0
(...)
ERROR: Forbidden zero sentence length 0
ERROR: Forbidden zero sentence length 0
ERROR: Execution of: /Data/moses/tools/bin/GIZA++  -CoocurrenceFile
en-ja/giza.ja-en/ja-en.cooc -c en-ja/corpus/ja-en-int-train.snt -m1 5 -m2 0
-m3 3 -m4 3 -model1dumpfrequency 1 -model4smoothfactor 0.4 -nodumps 1
-nsmooth 4 -o en-ja/giza.ja-en/ja-en -onlyaldumps 1 -p0 0.999 -s
en-ja/corpus/en.vcb -t en-ja/corpus/ja.vcb
 died with signal 11, without coredump

My tagged corpora are like this:
head -n 1 data/tagged/train.??
==> data/tagged/train.en <==
known|VBN as|IN sesshu|NN (|-LRB- 1420|CD -|: 1506|CD )|-RRB- ,|, he|PRP
was|VBD an|DT ink|JJ painter|NN and|CC zen|NN monk|NN active|JJ in|IN the|DT
muromachi|JJ period|NN in|IN the|DT latter|JJ half|NN of|IN the|DT 15th|JJ
century|NN ,|, and|CC was|VBD called|VBN a|DT master|NN painter|NN .|.

==> data/tagged/train.ja <==
雪舟|名詞 (|記号 せっしゅう|名詞 、|記号 年|名詞 (|記号 年|名詞 )|記号 年|名詞 (|記号 永|名詞 正|名詞 年|名詞 )|記号
)|記号 は|助詞 号|名詞 で|助詞 、|記号 世紀|名詞 後半|名詞 室町|名詞 時代|名詞 に|助詞 活躍|名詞 し|動詞 た|助動詞 水墨|名詞
画家|名詞 ・|記号 禅僧|名詞 で|助詞 、|記号 画聖|名詞 と|助詞 も|助詞 称え|動詞 られる|動詞 。|記号

Anyone know why it fails?

Best regards
-- 
Alex Helle
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to