[
https://issues.apache.org/jira/browse/SOLR-3110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13211692#comment-13211692
]
Koji Sekiguchi commented on SOLR-3110:
--------------------------------------
I paste the URL in the mail thread that describes the problem with concrete
data:
http://www.lucidimagination.com/search/document/20ffaea7ccebfafd#38a9bb5cec478ec6
> Search result comes up with truncated words at the start of highlighted
> fragment
> --------------------------------------------------------------------------------
>
> Key: SOLR-3110
> URL: https://issues.apache.org/jira/browse/SOLR-3110
> Project: Solr
> Issue Type: Bug
> Components: highlighter
> Affects Versions: 4.0
> Environment: java Tomcat Solaris
> Reporter: Shyam Bhaskaran
> Labels: FastVectorHighlighter, boundaryScanner, highlighting,
> solr
>
> It is being observed that words are getting truncated at the start of
> Highlighter fragment displayed.
> Following boundary scanner settings are introduced inside in the
> solrconfig.xml file
> <str name="hl.bs.chars">.,!? &\#9;&\#10;&\#13;</str>
> If I change the settings to
> <str name="hl.bs.chars">.,!?</str>
> then it is seen that this issue goes away but another issues comes up where
> the highlighted search fragment does not start from the beginning of the
> sentence.
> Below is the complete list of setting we are using for boundary scanner.
> <boundaryScanner name="simple"
> class="solr.highlight.SimpleBoundaryScanner" default="true">
> <lst name="defaults">
> <str name="hl.bs.maxScan">200</str>
> <str name="hl.bs.chars">.,!? &\#9;&\#10;&\#13;</str>
> </lst>
> </boundaryScanner>
> <boundaryScanner name="breakIterator"
> class="solr.highlight.BreakIteratorBoundaryScanner">
> <lst name="defaults">
> <str name="hl.bs.type">SENTENCE</str>
> <str name="hl.bs.language">en</str>
> <str name="hl.bs.country">US</str>
> </lst>
> </boundaryScanner>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]