I have just finished truecasing arabic-english corpora fromthe multiUN
parallel text
corpora and now I was at the cleaning step but I had an error when typing
this:
~/mosesdecoder/scripts/training/clean-corpus-n.perl
~/corpus/multiUN.ar-en.true ar en ~/corpus/multiUN.ar-en.clean 1 80

The result was:
clean-corpus.perl: processing /home/tjr/corpus/multiUN.ar-en.true.ar & .en
to /home/tjr/corpus/multiUN.ar-en.clean, cutoff 1-80
Input sentences: 0  Output sentences:  0

Why would this be? Can it be because of the 'ar' symbol as it is undefined
in the Moses system or sth else?
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to