On Tue, 2004-04-20 at 17:45, Gilles Detillieux wrote:
> Hi, Chris and other developers.  The problem with this fix is that
> exclude_urls and bad_querystr can no longer be used in server blocks or
> URL blocks, as they'll only be parsed once regardless of how they're used.
> That's OK if you don't use them in blocks, but for the distributed code,
> we need to find a more generalized solution.  

 Right. Having just found the block documentation, I can indeed see this
as a useful feature, and probably something that I would use if the
performance hit wasn't so bad.

 One thing I could think of that could help performance quite
considerably is to have an array of type *HtRegexList that could contain
the parsed excludes list/badquery lists, etc. per block. Or perhaps a
struct that contains all parsed config attributes per block and have an
array of pointers to it. This way the config could be loaded and still
only need to be parsed once. The only downside I could see is that this
would mean htdig would have a slightly larger memory footprint, but I
don't really see that as a big problem. We're probably talking about a
couple k more, by today's standards, even a couple meg more wouldn't be
a big deal.


> 3) We may also need to determine if the repeated calls to config->Find()
> at each URL are having an impact on performance as well.  E.g. what is
> the performance cost of doing thousands of calls like this one?
> 
>      tmpList.Create(config->Find(&aUrl, "exclude_urls"), " \t");

 Easy thing to test. I'll give it a try later this week if I can,
perhaps tomorrow, and report back.

Cheers,

Chris

-- 
Christopher Murtagh
Enterprise Systems Administrator
ISR / Web Communications Group 
McGill University
Montreal, Quebec
Canada

Tel.: (514) 398-3122
Fax:  (514) 398-2017


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig Developer mailing list:
[EMAIL PROTECTED]
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to