According to Jose Julian Buda:
> Hi , my question is about the re-index process
> i working on a web server that every days is updated
> with 2000 new web page coming in, i mean ,i have a
> directory that every days come in 2000 html files and
> then , every 120 minutes with crontab i run "rundig"
> and it make the index database .it work fine , but
> take much time to make the database .
>  the "rundig" create every time it run the database
> from de begin or just make an "update" with the new
> files arriving?
> i need to make just un update , because of....the
> resources...
> is it possible ?

Sure.  Just don't use the standard "rundig" script.  It's designed for
rebuilding the databases from scratch, so it uses "htdig -i".  Instead,
you should run htdig without -i, followed by htmerge.  You can put together
your own script to do this, and run it from cron.

If you want to be able to search while the database is updated, you should
try the ftp://htdig.sourceforge.net/pub/htdig/contrib/scripts/rundig.sh
script, which maintains two copies of the database so one can be updated
while the older one is still being searched.

Even with this, though, you may find that with a site growing at the rate
yours is, you'll probably reach a point where 2 hours isn't enough time
even for an update run.  At that point, you may need to decrease the
frequency with which you run the script from cron, or add some sort of
lockout to keep the script from starting up if an earlier instance of it
is still running.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to