>I see this as doing exactly what it's supposed to do.
Yes, on re-reading, I agreee that it does do exactly what the manual
documentation says. I was misled by this comment in the htdig.conf file:
# This makes sure that we don't spider the web
local_urls_only: true
In fact, local_urls_only has nothing to do with spidering the web -
rampant spider prevention is the job of start_url and limit_urls_to.
No matter what you set local_urls_only to be, htdig won't spider outside
the limits set by start_url and limit_urls_to. So the comment is wrong,
and it misled me into thinking that a "local" URL was one on the local
machine even via HTTP, which is not true.
I offer the better comment:
# This means we won't try HTTP if we can't get the file out of the
# local file system. (Setting this to true will disable all CGI.)
local_urls_only: true
Now that I know that it is safe to turn this to "true" without causing
URLs outside my machine to be spidered, I'm happy.
The term "local" is perhaps unfortunate to distinguish "in the file
system" from "via the HTTP server", since both are "local" with respect
to items on "remote" servers, and I think we are most used to thinking
in of local/remote as being "my server" / "external server", not "in
the file system" / "via an HTTP server". I was certainly confused by
the term, and the htdig.conf file writer apparently was too!
--
-IAN! Ian! D. Allen Ottawa, Ontario, Canada [EMAIL PROTECTED]
Home Page on the Ottawa FreeNet: http://www.ncf.ca/~aa610/
College professor at: http://www.algonquincollege.com/~alleni/
Board Member, TeleCommunities CANADA http://www.tc.ca/
-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html