[ 
https://issues.apache.org/jira/browse/LUCENE-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12677629#action_12677629
 ] 

Peter Wolanin commented on LUCENE-1500:
---------------------------------------

Koji - thanks - I was aware that not all worked with the mapping filter, but I 
was apparently misinformed since I was told that the 
"solr.HTMLStripWhitespaceTokenizerFactory" was also suitable for CharFilter.  
Indeed your e-mail thread linked from SOLR-822 describes exactly the problem I 
have:

bq. As you can see, if you use CharFilter, Token offsets could be incorrect 
because CharFilters may convert 1 char to 2 chars or the other way around.

In the thread you suggest that this API could be aded to lucene java?

> Highlighter throws StringIndexOutOfBoundsException
> --------------------------------------------------
>
>                 Key: LUCENE-1500
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1500
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: contrib/highlighter
>    Affects Versions: 2.4
>         Environment: Found this running the example code in Solr (latest 
> version).
>            Reporter: David Bowen
>            Assignee: Michael McCandless
>             Fix For: 2.4.1, 2.9
>
>         Attachments: LUCENE-1500.patch, patch.txt
>
>
> Using the canonical Solr example (ant run-example) I added this document 
> (using exampledocs/post.sh):
> <add><doc>
>   <field name="id">Test for Highlighting 
> StringIndexOutOfBoundsExcdption</field>
>   <field name="name">Some Name</field>
>   <field name="manu">Acme, Inc.</field>
>   <field name="features">Description of the features, mentioning various 
> things</field>
>   <field name="features">Features also is multivalued</field>
>   <field name="popularity">6</field>
>   <field name="inStock">true</field>
> </doc></add>
> and then the URL 
> http://localhost:8983/solr/select/?q=features&hl=true&hl.fl=features caused 
> the exception.
> I have a patch.  I don't know if it is completely correct, but it avoids this 
> exception.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to