On Mon, Jul 16, 2001 at 10:58:30AM -0400, Geoff Hutchison wrote:
> On Mon, 16 Jul 2001, Nico De Ranter wrote:
>
> > I'm using htdig 3.1.5 to index our intranet servers. Unfortunately
> > it takes about 2 days for 1 indexation run to complete :-(. Is there
> > any way I can selectively update part of the database without having
> > to recreate everything from scratch?
>
> Well there are a few things. First off, if one of the servers you're
> indexing happens to be on the same machine, you probably want to use
> local_urls to retrieve it from the filesystem (less network traffic,
> faster indexing, etc.):
>
> <http://www.htdig.org/attrs.html#local_urls>
>
> Secondly, if you're doing all your indexing from scratch, you're
> definitely not doing it as fast as you could be--if there are already
> databases, htdig will check the URLs quickly and only download and reindex
> changed or new documents. For an example of how to do this, try my version
> of the rundig script:
>
> <http://www.htdig.org/files/contrib/scripts/rundig.sh>
>
> Beyond that, you can also use the -m flag from htmerge to merge in another
> database. So if there are only a few URLs you'd like to add, you can index
> those and merge them in:
>
> <http://www.htdig.org/htmerge.html>
That's exactly what I was looking for. Thanks!
Nico
>
> --
> -Geoff Hutchison
> Williams Students Online
> http://wso.williams.edu/
>
---------------------------------------------------------
"It has been said that there are only two businesses that
refer to customers as users: illegal drug trade and
the computer industry."
---------------------------------------------------------
Nico De Ranter
Sony Service Center (SDCE/NEE-B)
Sint Stevens Woluwestraat 55 (Rue de Woluwe-Saint-Etienne)
1130 Brussel (Bruxelles), Belgium, Europe, Earth
Telephone: +32 2 724 86 41 Telefax: +32 2 726 26 86
e-mail: [EMAIL PROTECTED]
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html