At 1:08 PM -0500 9/6/99, Frank Martini wrote:
>If the start.url document is append to while htdig is digging, then will
>htdig see the changes?

It will see the changes after the next run. Once it's indexed a 
document on a run, it won't come back.

>Can htmerge be run while htdig is running? If not, then how do I get
>htmerge to stop htdig and restart it where it left off.

Do *not* run htmerge at the same time as htdig! If you want to do 
stop-and-restart, I'd either check out the -l option: 
http://www.htdig.org/htdig.html

or, I'd use server_max_docs and/or max_hopcount to index a small set 
of documents, then expand the list later.

>If the start.url is constantly being appended to, when will htdig go back
>and check previously-indexed sites?

The next time you run it.

>It seems that htdig remembers all sites it has indexed (even once they
>are removed from start.url). If that's the case, then how do I get it to
>stop indexing a site. And then how do I get it to remove that site's
>information from the dbs?

Removing a URL that exists from the db is not realy possible at the 
moment. If a URL is deleted, it will be removed from the db 
automatically.

>I suppose I can kludge together my own solutions to these issues (like
>creating a start.url with only one entry, running htdig then htmerge (or

The start_url attribute can contain multiple URLs. Heck, you could 
write up a file of URLs and include that as the start_url.

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word unsubscribe in
the SUBJECT of the message.

Reply via email to