Hi Yared,

It seems to be that you used the -n-gram-count switch which only works with
SRI LMs. Thanks to Jehan Pages, you can use -lm=IRSTLM and
-build-lm=/path/to/build-lm.sh to train a recasing model using IRSTLM.
Prerequisite for this is the a "proper-cased/mixed-cased" IRST LM containing
<s> elements. The -corpus switch should point to the your cased data. John
Burger gives a nice general overview for the recasing process:
http://www.mail-archive.com/[email protected]/msg00696.html

Of course, you might want to only evaluate lowercased data - that's up to
your approach. Then there is no need to train a recasing model.

Hope this helps.

Best,
Daniel

-----Ursprüngliche Nachricht-----
Von: Yared Mekuria [mailto:[email protected]] 
Gesendet: 27 April 2012 07:52
An: [email protected]
Betreff: To ask for steps for Evaluation of MT system when IRSTLM used.

Hello Daniel,
I am on the evaluation part of the MT system, and I don't understand how
evaluation is performed when IRSTLM language model is used.
I use the The following command to train the recaser

/home/admin1/mose/moses-scripts/scripts-20120409-0748/recaser/train-recaser.
perl
-train-script
/home/admin1/mose/moses-scripts/scripts-20120409-0748/training/train-model.p
erl
-ngram-count mose/bin/irstlm/bin/build-lm.sh -corpus
worked/corpus/news-commentary.tok.en -dir /home/admin1/worked/recaser
-scripts-root-dir
/home/admin1/mose/moses-scripts/scripts-20120409-0748

and the I got this error,

ERROR: Language model file not found or empty:
/home/admin1/worked/recaser/cased.irstlm.gz at
/home/admin1/mose/moses-scripts/scripts-20120409-0748/training/train-model.p
erl
line 324.

I don't have cased data, and is it necessary to use cased data for
evaluation when IRST LM used?

Please suggest me on it.
Yared.
Best regards.


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to