[
https://issues.apache.org/jira/browse/NUTCH-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13529846#comment-13529846
]
Lewis John McGibbney commented on NUTCH-1503:
---------------------------------------------
Seb, thank you for reviewing.
- I will add time limit properties as per your suggestions.
- I was also thinking the same about fetcher.threads.per.host.by.ip. I will
duly drop this property from nutch-default.xml. Do you think this should be
removed from Java code as well?
- I will add fetcher.queue.use.host.settings
Btw, the wiki entry is excellent. This is a key piece of documentation and very
well structured. Thank you for taking the time to do this.
New patch coming up
> Configuration properties not in sync between FetcherReducer and
> nutch-default.xml
> ---------------------------------------------------------------------------------
>
> Key: NUTCH-1503
> URL: https://issues.apache.org/jira/browse/NUTCH-1503
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Affects Versions: 2.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 2.2
>
> Attachments: NUTCH-1503.patch
>
>
> FetcherReducer.java
> Bug: Following properties appear in FetcherReducer but not in
> nutch-default.xml
> {code}
> 290 useHostSettings =
> conf.getBoolean("fetcher.queue.use.host.settings", false);
> 300 this.timelimit = conf.getLong("fetcher.timelimit", -1);
> 450 this.byIP = conf.getBoolean("fetcher.threads.per.host.by.ip", true);
> 698 timelimit = context.getConfiguration().getLong("fetcher.timelimit",
> -1);
> {code}
> Therefore they cannot be used properly in code execution and must be updated,
> removed and/or added to nutch-default.xml.
> Patch coming up just now.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira