Distributed search on Nutch

Boris Lau Tue, 25 Mar 2008 06:47:05 -0700

Hi everyone,

Wonder if anybody have tips on how to split up the crawl data for
distributed search using "bin/nutch server"?


I am facing problem is setting up distributed search in Nutch 0.9
using hadoop.  I follow the documentation on
http://wiki.apache.org/nutch/NutchHadoopTutorial, the search server
works okay but how would I manage to split up my crawl data between
different nodes (currently experimenting with about 3) to utilise the
parallel search when my current index is merged into one segment?

Thanks for the help in advance!
boris

p.s. would anyone have more references on how to setup distributed
serach in Nutch? the above link is one of the only resources I can
find.  Thanks!

Distributed search on Nutch

Reply via email to