According to Dan Langille:
> There are a number of variables in that equation, but I thought I'd share 
> my situation.  I have a dual Pentium box with 64MB of ram.
> 
> freebsddiary.org 
> 
>   website   #pages    time
> 
> 1 diary     500-ish   0:02:30
> 2 archive   13900     1:50:00-ish
> 3 archive   13900     1:40:00-ish (using local_urls)
> 
> All digs are run on the webserver.  If they were run remotely, I'd expect 
> a much bigger different between example 2 and example 3.

Don't you mean the other way around?  If they were run remotely, the
local_urls wouldn't have much effect.  Even locally, though, it may
be that it's falling back to HTTP an awful lot.  Note that only a very
small set of file suffixes is handled by local_urls (see the attrs.html
documentation for local_urls).  Files without suffixes (or extensions)
don't get handled locally, because htdig can't be sure what content-type
they are.

> The diary site gets reindexed as new pages are added.  The frequency 
> varies from twice a week to once a month.  It's no big deal to reindex in 
> full each time.
> 
> The other site contains a mailing list archive.  From time to time, we 
> get 80 new pages a day.  It's not practical to reindex that every day.  
> I'm going to look at incremental digs.  Sometime soon...

Incremental digs are a useful way of dealing with small updates to
large archives like this.  They can be done with htdig -m in the new
snapshots, or by digging a separate database and merging it into the
main one.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to