I am having some issue regarding "start" and "End" position of token. Here is the CharFilterFactory.
<charFilter class="solr.PatternReplaceCharFilterFactory" pattern="</? *ce(bold|sup|inf|hsp|vsp|italic)[^>]*>" replacement="X"/> Then the input data is <ce:sup loc=\"post\">1</ce:sup> In the Analysis page, textraw_bytesstartendpositionLengthtypeposition 1[31]21311word1 Should the "end" position "22"? It breaks the Highlighting... HTMLStripCharFilterFactory is working properly Any help? Jae