Gary Hargrave wrote:
wget does not seem to handle relative links in web pages of the form http:page3.htmlAccording to my understanding of rfc1808 this is a valid URL. When recursively retrieving html pages wget ignores these links with out displaying an error or warning.
Well, I am sure it is wrong URL, but took some time till I pinpoint it
in RFC1808. Otherwise it would be very difficult to code URL parser.
What you actually try to convince us is that you can omit the
net-location (i.e. usually comes in the middle) and still be able to
tell the location. Then how do you interpret http:program.com ?
Is it a site program in TLD com, or a .com (DOS executable) file served
who knows why via http?
So one of the places this is discussed in RFC1808 is:
4. Resolving Relative URLs
...
Step 2b): If the embedded URL starts with a scheme name, it is
interpreted as an *absolute* URL and we are done.
BTW, did you try to click in your browser on that link?
Kalin.
--
||///_ o *****************************
||//,_/> WWW: http://ThinRope.net/
|||\ <" mobile: +81 (90) 6265-0856
|||\\ ' NetPager: [EMAIL PROTECTED]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
