On Fri, 2006-12-08 at 11:01 +0100, Andrzej Bialecki wrote: > Ad 1. > > I suspect that it's sorting the reduce output now ... in 0.8.x this > operation has poor performance, especially when run on a single server. > So, I advise patience, and giving as much CPU and RAM as possible. For > the future, it's also much much better to run the fetcher in non-parsing > mode and run "nutch parse" afterwards as a separate step.
Okay, I'll give it a while and see what happens. Is it possible to get any information on what's going on? I'm running 0.8 pretty much out-of-the-box on a single server. I've seen people mentioning phases of Hadoop - can it tell me what's going on? Thanks -Rob
