Hi everybody, When i'm trying to tokenize my sinhala dataset it gives me a warning message like this "WARNING: No known abbreviations for language 'si', attempting fall-back to English version..."
And my letters have changed a bit. Is their anyway to tokenize sinhala data with this tokenizer.perl ? I'm looking forward for your help. Thanks in advance! Tharaka
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
