Hi again, I am still getting the hang of this.
I have a few questions hopefully someone can help me out with. Is there a way to get a page count? >From what I can gather, the best way to get multiple machines crawling and indexing is to build crawlers that hold around 20,000,000 pages. Has anyone completed a multiple computer setup that works and is fast when searching? Is it possible with enough computers to ever hit a billion pages with Nutch? >From all the testing I have done, the limitation isn't the crawling, it is the searching. If I wanted to put together a unit that was capable of big loads of pages, is it possible at this stage? If there was a way to ever link the Nutch databases, it might be kind of fun for some bigger bandwidth guys to really get aggressive and see what kind of terrain they can cover. Regards, J ------------------------------------------------------- This SF.Net email is sponsored by Sleepycat Software Learn developer strategies Cisco, Motorola, Ericsson & Lucent use to deliver higher performing products faster, at low TCO. http://www.sleepycat.com/telcomwpreg.php?From=osdnemail3 _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general
