[ 
https://issues.apache.org/jira/browse/LUCENE-5697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14019350#comment-14019350
 ] 

Hoss Man commented on LUCENE-5697:
----------------------------------

1) Lucene 3.5 is pretty old.

2) At first glance, it sounds like the problems you are describing could simply 
be due to a disconnect between how your searches are executed vs how you are 
using the highlighter code.

w/o specific example code or a reproducible test case, there's really no way to 
tell if what you are describing is a genuine bug or a missunderstanding of the 
API.

3) there multiple highlighters available, and a *lot* of different ways to 
configure them, so even if there is a bug, w/o more specifics there really 
isn't enough info here to try and diagnose _where_ the bug is, let alone _what_ 
the bug is.

---

can you please provide some code (ideally a stand alone JUnit test using the 
lucene test-framework) demonstrating the problem?

> Preview issue
> -------------
>
>                 Key: LUCENE-5697
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5697
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/highlighter
>         Environment: DocFetcher 1.1.11 on Win 7(64) pro
>            Reporter: Martin Schoenmakers
>
> In DocFetcher, which uses Lucene v3.5.0, we stumbled on a bug. The lead of 
> DocFetcher has investigated and found the problem seems to be in Lucene. I do 
> not know if this bug has been fixed in a later Lucene version.
> Issue: 
> We use "proximity search": search on multiple words in a directory with about 
> 300 PDF files.   
> E.g. search for "wordA wordB wordC"~50, i.e. three words within 50 words 
> distance of each other. The resulting documents are correct. But the 
> highligted text in the document is often missing. 
> If the words are in the SAME order as in the search AND on the SAME page, 
> then the higlight works correct. But if the order of the words is different 
> from the search (like "wordA wordC wordB" OR the words are not on the same 
> page, then that text is not highlighted. 
> As we use the proximity search on multiple words often, it severely degrades 
> the usability.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to