krinsang opened a new issue, #732: URL: https://github.com/apache/lucenenet/issues/732
I've noticed that the ICUTokenizer for Thai will not generate the same tokens as the Java variant. I've tested the latest beta version of the .NET project against the Apache Lucene v4.8.0. Even making sure that either implementations use the same `.brk` files did not yield consistent results. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@lucenenet.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org