Hi again,

I am still getting the hang of this.

I have a few questions hopefully someone can help me out with.

Is there a way to get a page count?

>From what I can gather, the best way to get multiple machines crawling and
indexing is to build crawlers that hold around 20,000,000 pages.  Has anyone
completed a multiple computer setup that works and is fast when searching?
Is it possible with enough computers to ever hit a billion pages with Nutch?
>From all the testing I have done, the limitation isn't the crawling, it is
the searching.  If I wanted to put together a unit that was capable of big
loads of pages, is it possible at this stage?

If there was a way to ever link the Nutch databases, it might be kind of fun
for some bigger bandwidth guys to really get aggressive and see what kind of
terrain they can cover.

Regards,

J



-------------------------------------------------------
This SF.Net email is sponsored by Sleepycat Software
Learn developer strategies Cisco, Motorola, Ericsson & Lucent use to deliver
higher performing products faster, at low TCO.
http://www.sleepycat.com/telcomwpreg.php?From=osdnemail3
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to