Thank you very much for your prompt reply. I see what you mean.
Regards, Gal. -----Original Message----- From: Andrzej Bialecki [mailto:[EMAIL PROTECTED] Sent: Tuesday, April 11, 2006 12:17 PM To: [email protected] Subject: Re: Small dev question Gal Nitzan wrote: > Hi Andrzej, > > I have two questions in regards to ParseOutputFormat.java: > > 1. On line 102 a String[] is used. Do you think it might be better to use a > ListArray? It will save a few cycles down the road -- it shall save you to > use "validCount" and will save you the "if" on line 121. I can make a patch > if you think I'm correct on this. > I doubt it would save anything, and even if, the savings would be negligible. Creating a new entry in ListArray and hooking it up to the list has some cost, too. > 2. If I understand the functionality correct, on line 87 a new CrawlDatum is > created for the fetched page. The interval is set to 0.0. Could you please > explain why it is set to 0.0? > That's only a special additional CrawlDatum, which serves as a signature container. You see, if we don't parse at the same time as we fetch then we can't put the signature in the same CrawlDatum (see the logic in Fetcher.FetcherThread.output()), so we need another instance, to pick up the signature when running updatedb. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
