RE: Searching for a phrase which spans on 2 pages

2006-07-13 Thread Ramesh Salla
home of NetSearch > -Original Message- > From: Mile Rosu [mailto:[EMAIL PROTECTED] > Sent: 11 July 2006 15:55 > To: java-user@lucene.apache.org > Subject: Searching for a phrase which spans on 2 pages > > Hello, > > I am working on an application similar to

Re: Searching for a phrase which spans on 2 pages

2006-07-12 Thread Erick Erickson
Sweet!

RE: Searching for a phrase which spans on 2 pages

2006-07-12 Thread Mike Streeton
home of NetSearch -Original Message- From: Mile Rosu [mailto:[EMAIL PROTECTED] Sent: 11 July 2006 15:55 To: java-user@lucene.apache.org Subject: Searching for a phrase which spans on 2 pages Hello, I am working on an application similar to google books which allows searching on documents

Re: Searching for a phrase which spans on 2 pages

2006-07-12 Thread Mile Rosu
Hello Erick, I have been trying on Google Books some scenarios and apparently found a Google bug ... It looks like they use number 2 approach, as this query illustrates it. http://books.google.com/books?vid=ISBN1564968316&id=14Xx2T8tmMYC&pg=PA8&lpg=PA8&dq=%2B%22the+site+is+unburdened%22&sig=QR

Re: Searching for a phrase which spans on 2 pages

2006-07-11 Thread Erick Erickson
I can think of several approaches, but the experts will no doubt show me up .. 1> index the entire book as a single document. Also, index the beginning and ending offset of each page in separate "documents". Assuming you can find the offset in the big doc of each matching phrase, you can also fin

Searching for a phrase which spans on 2 pages

2006-07-11 Thread Mile Rosu
Hello, I am working on an application similar to google books which allows searching on documents which represent a scanned page. Of course, one might search for a phrase starting at the end of one page and ending at the beginning of the next one. In this case I do not know how I might treat