Hello, I'm training factored models but I get the next error:
$SCRIPTS_ROOTDIR/training/train-model.perl -scripts-root-dir $SCRIPTS_ROOTDIR -root-dir $f-$e -corpus data/tagged/train.cln -f $f -e $e -lm 0:$Gram:$WD/lm/train.$e.qblm.mm:1 -lm 1:$Gram:$WD/lm/train-pos.$ e.qblm.mm:1 --translation-factors 0-0,1 (...) Reading more sentence pairs into memory ... ERROR: Forbidden zero sentence length 0 ERROR: Forbidden zero sentence length 0 ERROR: Forbidden zero sentence length 0 ERROR: Forbidden zero sentence length 0 ERROR: Forbidden zero sentence length 0 (...) ERROR: Forbidden zero sentence length 0 ERROR: Forbidden zero sentence length 0 ERROR: Execution of: /Data/moses/tools/bin/GIZA++ -CoocurrenceFile en-ja/giza.ja-en/ja-en.cooc -c en-ja/corpus/ja-en-int-train.snt -m1 5 -m2 0 -m3 3 -m4 3 -model1dumpfrequency 1 -model4smoothfactor 0.4 -nodumps 1 -nsmooth 4 -o en-ja/giza.ja-en/ja-en -onlyaldumps 1 -p0 0.999 -s en-ja/corpus/en.vcb -t en-ja/corpus/ja.vcb died with signal 11, without coredump My tagged corpora are like this: head -n 1 data/tagged/train.?? ==> data/tagged/train.en <== known|VBN as|IN sesshu|NN (|-LRB- 1420|CD -|: 1506|CD )|-RRB- ,|, he|PRP was|VBD an|DT ink|JJ painter|NN and|CC zen|NN monk|NN active|JJ in|IN the|DT muromachi|JJ period|NN in|IN the|DT latter|JJ half|NN of|IN the|DT 15th|JJ century|NN ,|, and|CC was|VBD called|VBN a|DT master|NN painter|NN .|. ==> data/tagged/train.ja <== 雪舟|名詞 (|記号 せっしゅう|名詞 、|記号 年|名詞 (|記号 年|名詞 )|記号 年|名詞 (|記号 永|名詞 正|名詞 年|名詞 )|記号 )|記号 は|助詞 号|名詞 で|助詞 、|記号 世紀|名詞 後半|名詞 室町|名詞 時代|名詞 に|助詞 活躍|名詞 し|動詞 た|助動詞 水墨|名詞 画家|名詞 ・|記号 禅僧|名詞 で|助詞 、|記号 画聖|名詞 と|助詞 も|助詞 称え|動詞 られる|動詞 。|記号 Anyone know why it fails? Best regards -- Alex Helle
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
