Certainly a limit_urls_to value of /foo/bar/ would restrict your 
indexing to those parts of your servers. But there are two caveats 
here:

First, how common is "/foo/bar" in terms of URLs on other servers? If 
a URL from another server matches that path, it will be indexed. 
Secondly, unless you only want to index the index.html pages, don't 
include that in the pattern--a URL must match one of the 
limit_urls_to patterns *exactly*. So if you used the pattern you 
mentioned, then:

http://www.foo.com/path/foo/bar/title.html

Would not be indexed--it doesn't match the pattern.
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/


At 5:08 PM -0400 7/19/01, Downey, Michael {10-6~Indianapolis} wrote:
>Geoff,
>
>Thanks for your help. In light of the logical OR, would it seem that
>
>    */foo/bar/index.html
>
>would work as well? My problem is I want to avoid spidering other 
>servers that these index.html pages link to. Again, I appreicate 
>your assistance. Thanks!
>
>Best Regards,
>Michael Downey

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to