Geoff, see responses below.
-----Original Message-----
From: Geoff Hutchison [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, June 13, 2001 3:50 PM
To: Augeri, Jim (NM75)
Cc: '[EMAIL PROTECTED]'
Subject: Re: [htdig] Novice questions...
On Wed, 13 Jun 2001, Augeri, Jim (NM75) wrote:
> before that time. The ".../rundig" script is launched (in
"out-of-the-box"
> configuration) as a cron job at 0130 each day (hey, I said I was new at
> this!).
> 1. Have the other documents gone away? Emphatically, NO. They are all
> still in the same
> places they were on days one and two.
>
> 2. Has the htdig.conf file changed? No again.
So my first question is whether you've looked at the databases
themselves--do they still look like they're the same size as when you
first ran it?
---Given that there are only 3 days of activity thus far... But in any
case, yes,
---they all seem to be around the same size. That is as reported by our
---backup system which also runs every day.
---db.docdb =~ 12.5MB
---db.docs.index =~ 400Kb (2nd day @ 392Kb)
---db.urls =~ 2.16MB
---db.wordlist =~ 21.2MB
---db.words.db =~ 20.4MB
---(is this all of them?)
More specifically, depending on how many pages you index and your setup,
you may not want to run a cron job every day--if it takes more than 24
hours at some point, then the next day it will start up and clobber
everything!
---The "rundig" only takes about 10 minutes to run at 0130 AM in the
morning.
---Thats why I felt secure running it as a cron job daily.
Beyond that, I'd consider how much free disk space you have--the htmerge
phase can take quite a bit when sorting the wordlist, especially if you
have a big database. If the sorting runs out of space, strange things can
happen.
---The entire /apache tree, which includes the htdig workspace, is on a
Solstice
---DiskSuite device with 17GB of total space, with 10GB of it still free
(37%
---utilization).
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html