[
https://issues.apache.org/jira/browse/NUTCH-1147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13971471#comment-13971471
]
Julien Nioche commented on NUTCH-1147:
--------------------------------------
Good idea not to force it to 1 but what about relying on the standard
'mapred.reduce.tasks' param which can either be set globally or specified on
the command line (see crawl script) instead of implementing a special param for
it?
> WebGraph nodeDumper uses only 1 reducer
> ---------------------------------------
>
> Key: NUTCH-1147
> URL: https://issues.apache.org/jira/browse/NUTCH-1147
> Project: Nutch
> Issue Type: Improvement
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Priority: Trivial
> Fix For: 1.9
>
> Attachments: NUTCH-1147-1.5-1.patch
>
>
> The noderDumper is restricted to only one reducer, making it slow and
> producing too large files.
--
This message was sent by Atlassian JIRA
(v6.2#6252)