The simplest solution is always the best - when storing the page, do not break up sentences. So a page will be all the sentences that occur on it. If a sentence starts on one page and finishes on the next it will be included in both pages in the index.
Hope this helps Mike www.ardentia.com the home of NetSearch -----Original Message----- From: Mile Rosu [mailto:[EMAIL PROTECTED] Sent: 11 July 2006 15:55 To: java-user@lucene.apache.org Subject: Searching for a phrase which spans on 2 pages Hello, I am working on an application similar to google books which allows searching on documents which represent a scanned page. Of course, one might search for a phrase starting at the end of one page and ending at the beginning of the next one. In this case I do not know how I might treat this. Both pages should be returned as hit results. Do you have any idea on how this situation might be handled? Thank you, Mile Rosu --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]