Hi all I'm looking at the non-breaking prefix files for the tokenizer in the directory /scripts/tokenizer/nonbreaking_prefixes
i don't quite know what languages the 2-letter file suffixes stand for. I can hazzard a guest but it's prob be better to be sure and write it down somewhere. Can anybody enlighten me? Specifically ca el is pl ro sk sl su
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
