Hi Thank you for Katta
But are there any built-in Nutch functionality which can do this stuff. What I am looking forward is to make distributed search as I am planning to build an index of quite big size and so it will be not possible to keep it on one server. What are best practices for doing this? Alex 2008/8/3 Marcus Herou <[EMAIL PROTECTED]> > Hi. > > I think you should look at Katta: http://katta.wiki.sourceforge.net/. But > essentially I think it is quite easy to create a map reduce job which uses > an arbitrary lucene index as input and writes the output to the local > filesystem. > > Kindly > > //Marcus > > > On Sat, Aug 2, 2008 at 10:51 AM, Alexander Aristov < > [EMAIL PROTECTED]> wrote: > > > Hi > > > > I am planning to implement distributed searching using Nutch and hence I > > will need a few machines with built indexes. > > > > First I want to build index running it on a Hadoop cluster. One big index > > at > > once. > > > > Is it possible to split up the index later (and all necessary segments) > to > > upload them to different servers. Also can I set size of a part of each > > index? > > > > Also I use merge segemnts and merge index routines. Are they necessary If > I > > want to have split index? > > > > > > -- > > Best Regards > > Alexander Aristov > > > > > > -- > Marcus Herou CTO and co-founder Tailsweep AB > +46702561312 > [EMAIL PROTECTED] > http://www.tailsweep.com/ > http://blogg.tailsweep.com/ > -- Best Regards Alexander Aristov
