On Mon, 16 Jul 2001, Nico De Ranter wrote:

> I'm using htdig 3.1.5 to index our intranet servers. Unfortunately
> it takes about 2 days for 1 indexation run to complete :-(. Is there
> any way I can selectively update part of the database without having
> to recreate everything from scratch?

Well there are a few things. First off, if one of the servers you're
indexing happens to be on the same machine, you probably want to use
local_urls to retrieve it from the filesystem (less network traffic,
faster indexing, etc.):

<http://www.htdig.org/attrs.html#local_urls>

Secondly, if you're doing all your indexing from scratch, you're
definitely not doing it as fast as you could be--if there are already
databases, htdig will check the URLs quickly and only download and reindex
changed or new documents. For an example of how to do this, try my version
of the rundig script:

<http://www.htdig.org/files/contrib/scripts/rundig.sh>

Beyond that, you can also use the -m flag from htmerge to merge in another
database. So if there are only a few URLs you'd like to add, you can index
those and merge them in:

<http://www.htdig.org/htmerge.html>

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to