Bruce,
I recently wrestled with this same issue.
Nutch is good if you need something crawled. (e.g. apache web server,
file system)
Lucene is good if you need to index something that can't be crawled
(e.g. database)
While there are exceptions to the above, I would stick to that as a
general rule of thumb when evaluating lucene or nutch to use in a
project. Of course, an understanding of lucene will probably help out
with nutch.
IMO.
Phillip
Insurance Squared Inc. wrote:
Hi Bruce,
This list is not only very active - it's full of people constantly
giving helpful, instructive answers. If you've got questions, this is
the place.
I would say based on my experience that nutch is a) excellent and b)
not for the faint of heart when it comes to java - you'll need someone
who knows what they're doing probably even to get it installed.
g.
bruce wrote:
anybody running lucene/nutch that i can talk to, to exchange
information,
ideas.. i'm considering using lucene/nutch for a project, but i have
zero
java experience. i'm around the cali/bay area.
the guy who was going to be provide the java experience oversold his
expertise.. so i might have to bite the bullet on this one.
thanks
-bruce