I am having some issue regarding "start" and "End" position of token.
Here is the CharFilterFactory.

<charFilter class="solr.PatternReplaceCharFilterFactory" pattern="&lt;/?
*ce(bold|sup|inf|hsp|vsp|italic)[^>]*>" replacement="X"/>


Then the input data is

<ce:sup loc=\"post\">1</ce:sup>

In the Analysis page,
textraw_bytesstartendpositionLengthtypeposition
1[31]21311word1

Should the "end" position "22"? It breaks the Highlighting...
HTMLStripCharFilterFactory is working properly

Any help?


Jae

Reply via email to