Hello everyone, I try to run the script for my two tokenizer.perl development file. I'm having a problem when running, but I do not understand why. A message appears:
/home/Bureau/moses/moses/scripts/tokenizer$ ./tokenizer.perl -l fr < /home/Bureau/work/test-fr.fr > /home/Bureau/work/input.tok Tokenizer Version 1.0 Language: fr WARNING: No known abbreviations for language 'fr', attempting fall-back to English version... utf8 "\xE9" does not map to Unicode at ./tokenizer.perl line 47, <STDIN> line 1. Malformed UTF-8 character (fatal) at ./tokenizer.perl line 67, <STDIN> line 1. Thank you very much. Sincerely Cyrine
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
