Grant Ingersoll created SOLR-13103:
--------------------------------------

             Summary: UnifiedHighlighter Separator-based BreakIterator should 
work with Strings, not just a single character
                 Key: SOLR-13103
                 URL: https://issues.apache.org/jira/browse/SOLR-13103
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: highlighter
            Reporter: Grant Ingersoll


For the `hl.bs.type` choice of SEPARATOR, it would be nice if we could support 
not just a single character, but a string.  In looking at the code, I see no 
reason Strings can't be supported other than a few signature changes on some 
constructors.

 

My use case: I have docs that I have section and page markers that make for 
conveniently-sized passages for highlighting, but there really isn't any clean 
way to mark those sections with a single character.  For instance, Tika will 
extract and mark pages with `<div class="page"><p/>....</div>`.  If I could 
pass in that `<div class="page">` tag as my separator, I could then just 
highlight within a page.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to