At 1:08 PM -0500 9/6/99, Frank Martini wrote:
>If the start.url document is append to while htdig is digging, then will
>htdig see the changes?
It will see the changes after the next run. Once it's indexed a
document on a run, it won't come back.
>Can htmerge be run while htdig is running? If not, then how do I get
>htmerge to stop htdig and restart it where it left off.
Do *not* run htmerge at the same time as htdig! If you want to do
stop-and-restart, I'd either check out the -l option:
http://www.htdig.org/htdig.html
or, I'd use server_max_docs and/or max_hopcount to index a small set
of documents, then expand the list later.
>If the start.url is constantly being appended to, when will htdig go back
>and check previously-indexed sites?
The next time you run it.
>It seems that htdig remembers all sites it has indexed (even once they
>are removed from start.url). If that's the case, then how do I get it to
>stop indexing a site. And then how do I get it to remove that site's
>information from the dbs?
Removing a URL that exists from the db is not realy possible at the
moment. If a URL is deleted, it will be removed from the db
automatically.
>I suppose I can kludge together my own solutions to these issues (like
>creating a start.url with only one entry, running htdig then htmerge (or
The start_url attribute can contain multiple URLs. Heck, you could
write up a file of URLs and include that as the start_url.
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.