Web Service on Nutch

2010-04-25 Thread Kim Theng Chong
Hi All, I would like to build web service based on Nutch (I had followed the wiki tutorial on Run Nutch in Eclipse1.0 and plug in tutorial so I had done some customization on Nutch crawl and Nutch search, and I would like to build these as web service. However, when I was trying to build it

How to do faceting on data indexed by Nutch

2010-04-25 Thread KK
Hi All, I might be repeating this question asked by someone else but googling didn't help tracking any such mail responses. I'm pretty much aware of Solr/Lucene and its basic architecture. I've done hit highlighting in Lucene, has idea on faceting support by Solr but never tried it actually. I

Re: How to do faceting on data indexed by Nutch

2010-04-25 Thread Andrzej Bialecki
On 2010-04-25 15:03, KK wrote: Hi All, I might be repeating this question asked by someone else but googling didn't help tracking any such mail responses. I'm pretty much aware of Solr/Lucene and its basic architecture. I've done hit highlighting in Lucene, has idea on faceting support by

Separate Nutch(crawl) and Lucene (index/search)

2010-04-25 Thread sb101h
I have a requirement where I want to index and search file system contents (my local server contents), and at the same time crawl a select set of web-sites on the same search query. I have search for my local file system implemented through Lucene. I would like to have Nutch just crawl the

[VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-25 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc2/ The major difference between this release and rc #1 is the application of NUTCH-812 - Crawl.java incorrectly uses the Generator API