On Wed, Jun 11, 2008 at 10:11 AM, <[EMAIL PROTECTED]> wrote: > That's not realistic with Nutch, which was really designed for larger and > longer "fetch jobs" (more URLs). >
On the subject - is there a good rule of thumb for the smallest fetch jobs that would make sense to run with Nutch? We're running some bigger crawls, but also have a standing list of blog feeds (about 5000) that we plan to have Nutch refetch frequently. Thanks! -- Chris Anderson http://jchris.mfdz.com
