Steve Bankowitz's bits of Mon, 12 Nov 2001 translated to:

>How do I add a new link to be indexed without indexing the whole 10000
>other files?
>
>If I create a brand new `/var/tmp/sort.txt` file with:
>
>  http://www.mydomain.com/item.rs?ID=10001
>
>The resultant database file will overwrite the other 10000 entries.
>How do I merge them together as one database?  I tried using `htmerge
>-m`, but I don't think that is what I want.  (Or is it?)  I also tried
>using `htdig -a` in my `rundig` script, but all that did was just made a
>backup of the database files first.

Assuming valid dates of last modification are returned by these
URL's, it should only be necessary to add the new URL. In this
case, htdig should be run *without* the -i option. This should
result in new pages being indexed and added to the database and
modified pages being reindexed.

If the URL's do not return valid dates of last modification, you
need to run htdig/htmerge on the new URL's only, and then run
htmerge on the main database using the -m option to merge in the
new databases.

Btw, the -a does not 'backup' database files. It is used to
specify that alternate work files (.work) be used rather than
modifying the original databases in place. This allows an easy
way to perform maintenance while the original databases are in
active use.


Jim


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to