Hi everyone, Wonder if anybody have tips on how to split up the crawl data for distributed search using "bin/nutch server"?
I am facing problem is setting up distributed search in Nutch 0.9 using hadoop. I follow the documentation on http://wiki.apache.org/nutch/NutchHadoopTutorial, the search server works okay but how would I manage to split up my crawl data between different nodes (currently experimenting with about 3) to utilise the parallel search when my current index is merged into one segment? Thanks for the help in advance! boris p.s. would anyone have more references on how to setup distributed serach in Nutch? the above link is one of the only resources I can find. Thanks!
