Kragen,

Thanks, I like your solution - I hope the other nutch developers will
consider this change into the main branch of code.

Vince


On Fri, 03 Dec 2004 09:35:45 -0800, Kragen Sitaker
<[EMAIL PROTECTED]> wrote:
> On Fri, 2004-12-03 at 00:34 -0700, Vince Taluskie wrote:
> 
> 
> > What about allowing fetcher.server.delay to be in millisecs?  This would
> > also help out our situation where a single host serves up our intranet,
> > but we've got 100,000 documents.   The limitation of only servicing 1
> > hit/second is making our crawls take about 2 days and we'd like to go to
> > nightly.    When I've tried to run with delay set to '0' I get a hang
> > where from the nutch crawl command where no further output is logged and
> > the java process associated with the nutch crawl command just spins on
> > cpu.
> 
> We had very similar problems at CommerceNet.  I solved them by parsing
> fetcher.server.delay as a float.  As it turns out, the SERVER_DELAY
> constant that the config gets read into is actually already in
> milliseconds.
> 
> Index: src/plugin/protocol-http/src/java/net/nutch/protocol/http/Http.java
> ===================================================================
> --- src/plugin/protocol-http/src/java/net/nutch/protocol/http/Http.java 
> (revision 4)
> +++ src/plugin/protocol-http/src/java/net/nutch/protocol/http/Http.java 
> (revision 5)
> @@ -45,7 +45,7 @@
>    static String AGENT_STRING = getAgentString();
> 
>    static long SERVER_DELAY =
> -    NutchConf.getInt("fetcher.server.delay", 1) * 1000;
> +    (long)(NutchConf.getFloat("fetcher.server.delay", (float)1.0) * 1000);
> 
>    static {
>      LOG.info("http.proxy.host = " + PROXY_HOST);
> 
> HTH.  Please pardon my pidgin Java --- I don't know Java.  I think at
> least I should have written "(float)1.0" as "1.0f" --- any Java
> programmers who can comment?
> 
> -------------------------------------------------------
> 
> 
> SF email is sponsored by - The IT Product Guide
> Read honest & candid reviews on hundreds of IT Products from real users.
> Discover which products truly live up to the hype. Start reading now.
> http://productguide.itmanagersjournal.com/
> _______________________________________________
> Nutch-developers mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
>


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. 
http://productguide.itmanagersjournal.com/
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to