At 7:49 PM -0700 10/30/01, Jim Cole wrote: >If you can't fix the site (or have it fixed), you might want to >try using the exclude_urls attribute to limit what htdig tries >to index.
Jim is (once again) right on the money. Beyond exclude_urls, you can also use max_hop_count to limit infinite loops, though this will still leave the problem of duplicate URLs. Normally when I index a website for the first time, I'll use max_hop_count to make sure there aren't any weird loops and then go from there. -- -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

