Bonjour Yves,

Did you see https://issues.apache.org/jira/browse/NUTCH-770? It has been
committed to the trunk back in December.

HTH

Julien
-- 
DigitalPebble Ltd
http://www.digitalpebble.com

On 9 March 2010 17:26, Yves Petinot <y...@snooth.com> wrote:

> I was wondering if the current release of Nutch provides any support for
> slow servers ? The issue has been previously described in the following JIRA
> entry:
>
>
> https://issues.apache.org/jira/browse/NUTCH-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588746#action_12588746
>
> While being able to incorporate server latency information in the
> generation of fetch lists is nice to have, I was wondering if any
> configuration parameter is available to enforce a timeout on the effective
> fetch duration for a single URL ? In my current setup, I'm observing that
> over 50% of the time needed to complete a fetch task is due to a handful of
> slow hosts.
>
> Has anyone on the list been able to optimize their crawls to minimize the
> impact of slow hosts ?
>
> -yp
>

Reply via email to