At 7:49 PM -0700 10/30/01, Jim Cole wrote:
>If you can't fix the site (or have it fixed), you might want to
>try using the exclude_urls attribute to limit what htdig tries
>to index.

Jim is (once again) right on the money. Beyond exclude_urls, you can 
also use max_hop_count to limit infinite loops, though this will 
still leave the problem of duplicate URLs.

Normally when I index a website for the first time, I'll use 
max_hop_count to make sure there aren't any weird loops and then go 
from there.

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to