It would appear that despite the mention of 2.0.0 in Nutch 0.9 changelist, it uses Lucene 2.1.0 which uses a slightly differing index.
On Wed, May 7, 2008 at 11:20 PM, Jani Hartikainen < [EMAIL PROTECTED]> wrote: > Permissions are OK - it's a windows box so should have no probs there. > > > I will look into this again tomorrow and I can create the issue then. > > > > > ---- (this was sent directly to my mailbox I think) > > Check the permissions on the index directories/files. > > > -- > Eric Marden > > > On Wed, 07 May 2008 22:56:09 +0300, Alexander Veremyev < > [EMAIL PROTECTED]> wrote: > > Hi Jani, > > > > > > Could you create JIRA issue for this and attach to it your index? Or > > send it to me for testing? > > > > > > With best regards, > > > > Alexander Veremyev. > > > > > > ________________________________ > > > > From: Jani [mailto:[EMAIL PROTECTED] > > Sent: Wednesday, May 07, 2008 4:07 PM > > To: [email protected] > > Subject: [fw-general] Zend_Search_Lucene and Apache Nutch > > > > > > I've been trying to figure out whether it's possible to use > > Zend_Search_Lucene in combination with Apache Nutch, > > which has a crawler and it can parse out a lot of formats like HTML, PDF > > etc. so it would be perfect for my case. > > > > The docs say Zend_Search_Lucene supports Lucene index formats 1.9 to > > 2.0, and according to the change > > list for the latest Nutch version (0.9), Nutch uses Lucene 2.0.0, but > > for some reason I haven't been able to get ZSL to > > open the indexes. > > > > When trying to open() the index, ZSL fails with Fatal error: Uncaught > > exception 'Zend_Search_Lucene_Exception' with message 'File > > 'data/index/_0.cfs' is not readable.' > > > > > > Anyone got any insight to this matter? Or perhaps a separate crawler > > solution to suggest? > > > > No virus found in this incoming message. > > Checked by AVG. > > Version: 8.0.100 / Virus Database: 269.23.9/1418 - Release Date: > > 06.05.2008 17:17 > > > > >
