Hi Dogacan,

    I haven't yet checked it with the 474 patch. Thanks for that update!

   What I was trying to say was that even if fetcher2 works perfectly, it
can only give you a significant performance boost when the number of threads
per task is much less than the number of hosts that will be fetched by that
task. 

  Do you have any numbers about the number of hosts from which you are
fetching?

-vishal.

-----Original Message-----
From: Dogacan Güney [mailto:[EMAIL PROTECTED] 
Sent: Thursday, May 24, 2007 4:46 PM
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Subject: Re: [Nutch-general] Fetcher2 slowness?

Hi Vishal,

On 5/23/07, Vishal Shah <[EMAIL PROTECTED]> wrote:
> Hi Dogacan,
>
>    Fetcher2 gives a better performance when the number of hosts per task
is
> more than the number of threads that the task can use. In this case,
fetcher
> might block on some hosts, whereas fetcher2 will use that idle time in
> crawling some other host.
>
>    It could be that the number of hosts per task is not significantly
higher
> than the number of threads per task. In that case, ideally you should see
a
> similar performance from fetcher2 and fetcher (assuming same url list and
> network bandwidth).
>
>   Also, as Andrzej suggested - it would be good to have some more
debugging
> info.

Have you tested Fetcher2 after NUTCH-474? There were a couple of bugs
in Fetcher2 that made it work just like Fetcher (because lib-http
still blocked threads, making Fetcher2's queue logic useless).

Looking at the code, I can't see any other bugs, but I am still
testing, perhaps I will find a couple more(or perhaps, I will find out
that something in my conf is broken).

>
> Regards,
>
> -vishal.
>
> -----Original Message-----
> From: Dogacan Güney [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, May 23, 2007 8:21 PM
> To: [EMAIL PROTECTED]
> Subject: Re: [Nutch-general] Fetcher2 slowness?
>
> On 5/23/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> > So what was Fetcher2's performance like when its number of threads was
the
> same as that of Fetcher?
>
> It is still slower. I tried giving Fetcher2 more threads,it is still
> worse than Fetcher but a bit better than fewer-threaded
> Fetcher2(Fetcher finished in 1 hour, Fetcher2 in about 2.5). Though I
> have performed other tests where their performance is similar(and I
> have no idea why). I am trying to find the cause of problem, but so
> far, had no luck.
>
> >
> > Otis
> >
>
> [snip]
>
> --
> Dogacan Güney
>
>


-- 
Dogacan Güney


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to