Hi Please vote on RTC for making org.apache.lucene.analysis.standard.StandardAnalyzer as the default OOTB analyzer. This analyzer is capable of handling surrogate characters unlike the current default analyzer. Please see [1]
[1] - https://issues.apache.org/jira/browse/OAK-3276 Regards Satya Deep
