Re: [Moses-support] Training scripts

Hieu Hoang Wed, 02 Feb 2011 19:38:01 -0800

hi nakul

you should follow this tutorial and learn a bit more about the variousstages of the smt processing and how to run the scripts:

    http://www.statmt.org/moses_steps.html


On 03/02/2011 10:21, nakul sharma wrote:

Hi all,
i have installed latest version of moses from sourceforge.net<http://sourceforge.net>.
i am just clarifying, do we need to place the corpus of both thelanguages (both source and target) as input for clean-corpus-n.perl ?i executed script for both these lang and got following messages:-
For Source:-

./clean-corpus-n.perl 200EnglishSens en hi 200EnglishSens.clean 1 50
clean-corpus.perl: processing 200EnglishSens.en & .hi to200EnglishSens.clean, cutoff 1-50
Input sentences: 203  Output sentences:  187

For Target :-

./clean-corpus-n.perl 200HindiSens hi en 200HindiSens.clean 1 50
clean-corpus.perl: processing 200HindiSens.hi & .en to200HindiSens.clean, cutoff 1-50
Use of uninitialized value $opn in open at ./clean-corpus-n.perl line 46.
Use of uninitialized value $opn in concatenation (.) or string at./clean-corpus-n.perl line 46.
Can't open '' at ./clean-corpus-n.perl line 46
So the problem is again seems to be with the target lang. How to solvethis problem of broken UTF as it was pointed out Tom.
--
Thanks & Regards,
nakul.


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Training scripts

Reply via email to