On Mit, 17 M�r 1999, Geoff Hutchison wrote:
>>start_url: http://www.suse.com/Mailinglists/suse-informix
>>
>>doesn't see this subdir. The index file there however contians
>>all the links...
>
>My best suggestion is to run htdig and add '-vvv' to your command line.
>This will generate a pile of data, but it should give you each HTTP header
>as well as some explanation of why it rejects links.
>
>I ran it through the URL test code I've been using as I add support for
>multiple services. It parsed it OK, so any problem is occurring earlier in
>parsing--for example the HTML parser may decide those href tags aren't any
>good. The debugging output will tell us more.
>
>-Geoff
>
I may be wrong, but AFAIK the colon is a special character in an URL
which is normally used to include username/password into FTP URLs
(i.e. "ftp://user:password@host/directory").
If used in another context, colons must be encoded.
regs,
Torsten
--
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstra�e 14 Tel: +49-4101-403605
D-25474 Ellerbek Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED] Internet: http://www.inwise.de
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.