I have just finished truecasing arabic-english corpora fromthe multiUN parallel text corpora and now I was at the cleaning step but I had an error when typing this: ~/mosesdecoder/scripts/training/clean-corpus-n.perl ~/corpus/multiUN.ar-en.true ar en ~/corpus/multiUN.ar-en.clean 1 80
The result was: clean-corpus.perl: processing /home/tjr/corpus/multiUN.ar-en.true.ar & .en to /home/tjr/corpus/multiUN.ar-en.clean, cutoff 1-80 Input sentences: 0 Output sentences: 0 Why would this be? Can it be because of the 'ar' symbol as it is undefined in the Moses system or sth else?
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
