hi nakul

you should follow this tutorial and learn a bit more about the various stages of the smt processing and how to run the scripts:
    http://www.statmt.org/moses_steps.html

On 03/02/2011 10:21, nakul sharma wrote:

Hi all,



i have installed latest version of moses from sourceforge.net <http://sourceforge.net>.

i am just clarifying, do we need to place the corpus of both the languages (both source and target) as input for clean-corpus-n.perl ? i executed script for both these lang and got following messages:-

For Source:-

./clean-corpus-n.perl 200EnglishSens en hi 200EnglishSens.clean 1 50
clean-corpus.perl: processing 200EnglishSens.en & .hi to 200EnglishSens.clean, cutoff 1-50

Input sentences: 203  Output sentences:  187

For Target :-

./clean-corpus-n.perl 200HindiSens hi en 200HindiSens.clean 1 50
clean-corpus.perl: processing 200HindiSens.hi & .en to 200HindiSens.clean, cutoff 1-50
Use of uninitialized value $opn in open at ./clean-corpus-n.perl line 46.
Use of uninitialized value $opn in concatenation (.) or string at ./clean-corpus-n.perl line 46.
Can't open '' at ./clean-corpus-n.perl line 46

So the problem is again seems to be with the target lang. How to solve this problem of broken UTF as it was pointed out Tom.

--
Thanks & Regards,
nakul.


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to