[ https://issues.apache.org/jira/browse/LUCENE-1559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12681347#action_12681347 ]
Uwe Schindler commented on LUCENE-1559: --------------------------------------- The problems with POI often come from the fact, that POI does not filter the outputted characters and sometimes even generates non Unicode conform char values (>0xd000). E.g. you sometimes have non-breaking-spaces instead of normal spaces or other things. Depending on the Lucene Analyzer you use, there may be problems. E.g., TIKA uses a filter that maps all incorrect characters coming from POI according to aloowed chars in XML (because it generates XHTML from the docs that can be indexed using TikaAnalyzer). I think, your problem is invalid plain text content coming from POI. > Highlighting not working in some instances even though indexsearcher returns > result. > ------------------------------------------------------------------------------------ > > Key: LUCENE-1559 > URL: https://issues.apache.org/jira/browse/LUCENE-1559 > Project: Lucene - Java > Issue Type: Bug > Affects Versions: 2.4 > Environment: Mac OS 1.5 > Eclipse 3.4 > Reporter: Amin Mohammed-Coleman > Attachments: AJiA CH 02.doc, HighLightingSummaryTest(2).java, > HighLightingSummaryTest.java > > > In some instances highlighting does not return a result. However when you > use a different term for teh same document you get results. > Please see attach testcase and template file. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org