Nutch generating a site-map

2007-08-02 Thread Robert Young
Hi, We're using Nutch to provide search on our internal website and we would like to be able to build a site-map of the pages Nutch has crawled. Has this kind of thing been implemented before? If not, where should I look to start writing a plugin which could handle this? Thanks Rob

Bad version number in .class file when injecting

2007-07-25 Thread Robert Young
I am getting an UnsupportedClassVersionError when trying to inject, I'm not sure how I've caused this to start (it only started yesterday). I have built both the nutch source and my webapp on the machine they're being run with the same java version as tomcat is running under. As far as I can see

Writing ScoringFilter plugins

2007-07-25 Thread Robert Young
Hi, Is there any documentation on writing different types of plugin, specifically a ScoringFilter but all really. I have written a couple of plugins and continually come across problems where I haven't filled in a method that I'm not interested in when in fact I should be doing some default

Re: Bad version number in .class file when injecting

2007-07-25 Thread Robert Young
It was dues to a plugin not being re-built properly. On 7/25/07, Robert Young [EMAIL PROTECTED] wrote: I am getting an UnsupportedClassVersionError when trying to inject, I'm not sure how I've caused this to start (it only started yesterday). I have built both the nutch source and my webapp

unable to load class for id: 36 during generate

2007-07-23 Thread Robert Young
Hi, I'm doing some fairly gutsy playing around with Nutch to get a multi-index app working nicely and I'm starting to get the following error. WARN crawl.MapWritable: Unable to load meta data entry, ignoring.. : java.io.IOException: unable to load class for id: 36 Which, although only being

Multiple nutch configurations within a single tomcat context

2007-07-18 Thread Robert Young
We are building search for multiple sites, we would like to have multiple indexes (one for each site) but we don't want multiple contexts running as the only difference between them will be their configuration. We would like to be able to manage multiple nutch instances from one tomcat context,

Injecting single URLs to an index

2007-07-09 Thread Robert Young
I asked a similar question last week but I don't think I explained myself properly. I have created a nutch / lucene index using the normal crawl, merge, dedup process. The problem I am having is that this whole process takes a long time, I would like to be able to inject single urls and have them

Stemming with Nutch

2007-06-28 Thread Robert Young
Hi, Are the Nutch Stemming modifications available as a patch? I can't seem to find anything on issue.apache.org Thanks Rob

Case insensitive searching

2007-06-26 Thread Robert Young
Hi, Is it possible to run a case insensitive search via Nutch? Thanks Rob

OR searches possible?

2007-06-22 Thread Robert Young
Hi, Is it possible to put together OR searches through a Nutch plugin? I have read that Nutch does not support ORing search terms together[1], it only supports an implied AND between search terms. I was wondering if it is possible to get a little closer to the building of the Lucene query (and