Hi Yared, It seems to be that you used the -n-gram-count switch which only works with SRI LMs. Thanks to Jehan Pages, you can use -lm=IRSTLM and -build-lm=/path/to/build-lm.sh to train a recasing model using IRSTLM. Prerequisite for this is the a "proper-cased/mixed-cased" IRST LM containing <s> elements. The -corpus switch should point to the your cased data. John Burger gives a nice general overview for the recasing process: http://www.mail-archive.com/[email protected]/msg00696.html
Of course, you might want to only evaluate lowercased data - that's up to your approach. Then there is no need to train a recasing model. Hope this helps. Best, Daniel -----Ursprüngliche Nachricht----- Von: Yared Mekuria [mailto:[email protected]] Gesendet: 27 April 2012 07:52 An: [email protected] Betreff: To ask for steps for Evaluation of MT system when IRSTLM used. Hello Daniel, I am on the evaluation part of the MT system, and I don't understand how evaluation is performed when IRSTLM language model is used. I use the The following command to train the recaser /home/admin1/mose/moses-scripts/scripts-20120409-0748/recaser/train-recaser. perl -train-script /home/admin1/mose/moses-scripts/scripts-20120409-0748/training/train-model.p erl -ngram-count mose/bin/irstlm/bin/build-lm.sh -corpus worked/corpus/news-commentary.tok.en -dir /home/admin1/worked/recaser -scripts-root-dir /home/admin1/mose/moses-scripts/scripts-20120409-0748 and the I got this error, ERROR: Language model file not found or empty: /home/admin1/worked/recaser/cased.irstlm.gz at /home/admin1/mose/moses-scripts/scripts-20120409-0748/training/train-model.p erl line 324. I don't have cased data, and is it necessary to use cased data for evaluation when IRST LM used? Please suggest me on it. Yared. Best regards. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
