According to Oliver Harvey:
>       ...I'm trying to merge any new pages on our site into our live htdig
> database.
>       
>       ...this *seems* to work ok.
> 
>       ...has anyone had any problems doing similar/got any comments ?? -
> all appreciated.
> 
>       I'm doing this:
>       - building a large htdig database to start off
>       - then, each day, indexing *new pages only* to form a new database
> in a separate directory.
>       - then, each day, trying to merge this new database into the big
> live one (with an htmerge -m)
> 
>       this is great because as long as I can work out which pages are new
> on our site every day, I only need *one* copy of our big
>       database (~22G) at any one time - (no .work files !).

Well, if it seems to work with a database that large, I'd guess your OS
handles files larger than 2GB without difficulty.  That's one obstacle
many users run into.

If you're running version 3.1.5, I highly recommend you apply the
ftp://ftp.ccsf.org/htdig-patches/3.1.5/words-db.cc-rundig.0 patch for htmerge,
to fix some serious bugs in the code (also read the notes in the patch).
You could also try the latest 3.1.6 development snapshot on the htdig.org
web site.

Lastly, though, I'd say you're taking chances with updating only one copy
of a large database like this.  if anything goes wrong while updating, you
could lose the whole thing and need to reindex from scratch.  Also, without
a .work copy, I think searches will fail during part of the htmerge process.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to