Hi Martin,
Havve you checked that all mappers are working while parsing job is running?
How many URLs are you trying to parse here?

On Friday, July 19, 2013, Martin Aesch <[email protected]> wrote:
> Dear nutchers,
>
> Having Nutch 2.2.1/HBase 0.90.6/Hadoop 1.1.2/6Mappers/6Reducers/Core
> i7-3770/32GB (no swap)/2x3TB
>
> When I parse (in mapper, 6 simultaneously running map-tasks), this is
> very slow. Max load is ~1.5, max iowait is 5%, max CPU per task is only
> 30%, max CPU for hmaster is about 30%. iotop in consequence also shows
> low numbers.
>
> Since parsing is a CPU-intensive job and all IO-stuff is on very low
> level, I wonder why parsing does not work faster und with full CPU
> usage. It really takes a long time to finish. Where might be the
> bottleneck?
>
> Thanks for any advice,
> Martin
>
>
>
>
>

-- 
*Lewis*

Reply via email to