actually, i just tried that link in the faq, and it's
not working.


anyway, i've attached the rundig.sh script to this
email.


--- "Anthony E." <[EMAIL PROTECTED]> wrote:
> this is in the FAQ
> http://htdig.sourceforge.net/FAQ.html#q4.21
> 
> --- Daniel Escobar <[EMAIL PROTECTED]> wrote:
> > Hello,
> > 
> > What do you guys do for indexing? I know how to
> > index, but the problem
> > is that when you reindex the db gets deleted,
> > meaning that you cannot
> > search while rundig is getting executed.
> > 
> > I would greatly appreciate any help I can get on
> > this issue.
> > 
> > FYI:  I have to index about 80 websites, meaning
> > that the indexing takes
> > about 4 hours to run.
> > 
> > Thanks,
> > 
> > -Daniel
> > 
> > --
> > Daniel Escobar
> > Software Engineer
> > Cox Radio Interactive
> > 678.860.2749 (cellular)
> > 404.979.7860 (office)
> > 
> > 
> > 
> > 
> > _______________________________________________
> > htdig-general mailing list
> > <[EMAIL PROTECTED]>
> > To unsubscribe, send a message to
> > <[EMAIL PROTECTED]> with
> a
> > subject of unsubscribe
> > FAQ: http://htdig.sourceforge.net/FAQ.html
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Auctions - buy the things you want at great
> prices
> http://auctions.yahoo.com/
> 
> _______________________________________________
> htdig-general mailing list
> <[EMAIL PROTECTED]>
> To unsubscribe, send a message to
> <[EMAIL PROTECTED]> with a
> subject of unsubscribe
> FAQ: http://htdig.sourceforge.net/FAQ.html


__________________________________________________
Do You Yahoo!?
Yahoo! Auctions - buy the things you want at great prices
http://auctions.yahoo.com/
#!/bin/sh

# rundig.sh
# a script to drive ht://Dig updates
# Copyright (c) 1998 Colin Viebrock <[EMAIL PROTECTED]>
# Copyright (c) 1998-1999 Geoff Hutchison <[EMAIL PROTECTED]>

if [ "$1" = "-v" ]; then
    verbose="-v"
fi

# This is the directory where htdig lives
BASEDIR=/usr/local/htdig

# This is the db dir
DBDIR=$BASEDIR/db/

# This is the name of a temporary report file
REPORT=/usr/local/htdig/htdig.report

# This is who gets the report
REPORT_DEST="[EMAIL PROTECTED]"
export REPORT_DEST

# This is the subject line of the report
SUBJECT="cron: htdig report for domain"

# This is the name of the conf file to use
CONF=htdig.conf

# This is the directory htdig will use for temporary sort files
TMPDIR=/tmp
export TMPDIR

# This is the PATH used by this script. Change it if you have problems
#  with not finding wc or grep.
PATH=/usr/local/bin:/usr/bin:/bin

##### Dig phase
STARTTIME=`date`
echo Start time: $STARTTIME
echo rundig: Start time:   $STARTTIME > $REPORT
$BASEDIR/bin/htdig $verbose -s -a -c $BASEDIR/conf/$CONF >> $REPORT
TIME=`date`
echo Done Digging: $TIME
echo rundig: Done Digging: $TIME >> $REPORT

##### Merge Phase
$BASEDIR/bin/htmerge $verbose -s -a -c $BASEDIR/conf/$CONF >> $REPORT
TIME=`date`
echo Done Merging: $TIME
echo rundig: Done Merging: $TIME >> $REPORT

##### Cleanup Phase
# To enable htnotify or the soundex search, uncomment the following lines
# $BASEDIR/bin/htnotify $verbose >>$REPORT
# $BASEDIR/bin/htfuzzy $verbose soundex

# Move 'em into place. Since we only need db.wordlist to do update digs
# and we always use -a, we just leave it as .work
# mv $BASEDIR/db/db.wordlist.work $BASEDIR/db/db.wordlist
# We need the .work for next time as an update dig, plus the copy for searching
cp $BASEDIR/db/db.docdb.work $BASEDIR/db/db.docdb
# These are generated from htmerge, so we don't want copies of them.
mv $BASEDIR/db/db.docs.index.work $BASEDIR/db/db.docs.index
mv $BASEDIR/db/db.words.db.work $BASEDIR/db/db.words.db

END=`date`
echo End time: $END
echo rundig: End time:     $END >> $REPORT
echo 

# Grab the important statistics from the report file
# All lines begin with htdig: or htmerge:
fgrep "htdig:" $REPORT  
echo 
fgrep "htmerge:" $REPORT
echo
fgrep "rundig:" $REPORT
echo

WC=`wc -l $REPORT`
echo Total lines in $REPORT: $WC

# Send out the report ...
mail -s "$SUBJECT - $STARTTIME" $REPORT_DEST < $REPORT

# ... and clean up
rm $REPORT

Reply via email to