At 8:16 PM -0500 11/30/00, David Gewirtz wrote:
>So, first question: is it possible to REMOVE a site and it's
>associated URLs from a database without reinitializing?
There is no easy way of removing a URL (much less a site) in 3.1
without reindexing. That said, read on.
>One thought was to index each site at a time and check it out. But
>that'll take forever. Another thought was to index all the sites,
>but if one seems crappy, remove it from the start_url set, do an
>htdig -i, and clean out the database. But that'll require us to
>bring down the database for a re-index time and once the server goes
>live, that's not really acceptable.
You don't need to run with htdig -i. Personally, I usually do all my
runs with -a so that updates don't affect the "live" DB and so that I
have a backup in case things go south. (For an example, see the
rundig.sh script, e.g. <http://www.htdig.org/files/contrib/scripts/>.)
If you're using htdig -a, then you can just remove the .work
databases before starting and it clearly will be reindexing from
scratch.
(N.B. The -i flag just means that htdig deletes the databases before
going and the -a flag just appends a .work file before reading
anything in or starting.)
Does this sound like a slightly better solution?
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>