hi , professor Now I want to translate the German to English, I first run the baseline system .Then i want to use the transliteration-module to handle the UNK: I do as the website(http://www.statmt.org/moses/?n=Advanced.OOVs) says: first I run this scripts: as this (https://docs.google.com/document/d/1G9RjczZXWGHU6byJFORf6uToItph1jU_piL53wQhGXg/edit) " /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/scripts/Transliteration/train-transliteration-module.pl --corpus-f /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/HEJUNQIN/corpus/train.clean.de --corpus-e /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/HEJUNQIN/corpus/train.clean.en --alignment /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/HEJUNQIN/working/train/model/aligned.grow-diag-final-and --moses-src-dir /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/ --external-bin-dir /nobackup/f1/asr/wulong/MOSESHEJUNQIN/Moses/mosesdecoder/mgizacombinedtools/ --input-extension de --output-extension en --srilm-dir /nobackup/f1/asr/wulong/srilm-1.7.2/bin/i686-m64/ --out-dir translit-en-hi > translit.out 2>&1 " I use all Absolute paths. But there erros in the mail attachments . It says " mosesdecoder/mgizacombinedtools/mkcls -c50 -n2 -p/working/translit-en-hi/training/corpus.de"Then I check the file and find that ./working/translit-en-hi/training/corpus.de is empty .then I check "/working/translit-en-hi/"and I find that "1-1.de-en.cleaned?? 1-1.de-en.mined-pairs?? 1-1.de-en.pair-probs" are also empty which lead to corpus.de to be empty . Then I check the train-transliteration-module.pl and find the reason why "/working/translit-en-hi/"and I find that "1-1.de-en.cleaned?? 1-1.de-en.mined-pairs?? 1-1.de-en.pair-probs" are empty is the line 268 " `$MOSES_SRC_DIR/scripts/Transliteration/clean.pl $OUT_DIR/1-1.$inp_ext-$op_ext > $OUT_DIR/1-1.$inp_ext-$op_ext.cleaned`;" then I check $MOSES_SRC_DIR/scripts/Transliteration/clean.pl and find that all the print...... statements have become #print ..... so it cannot output to $OUT_DIR/1-1.$inp_ext-$op_ext.cleaned. so I think if there is a bug in the clean.pl . Thank you very much !
clean.pl
Description: Binary data
translit.out
Description: Binary data_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support