On Mit, 17 M�r 1999, Geoff Hutchison wrote:
>>start_url:    http://www.suse.com/Mailinglists/suse-informix
>>
>>doesn't see this subdir. The index file there however contians
>>all the links...
>
>My best suggestion is to run htdig and add '-vvv' to your command line.
>This will generate a pile of data, but it should give you each HTTP header
>as well as some explanation of why it rejects links.
>
>I ran it through the URL test code I've been using as I add support for
>multiple services. It parsed it OK, so any problem is occurring earlier in
>parsing--for example the HTML parser may decide those href tags aren't any
>good. The debugging output will tell us more.
>
>-Geoff
>

I may be wrong, but AFAIK the colon is a special character in an URL
which is normally used to include username/password into FTP URLs
(i.e. "ftp://user:password@host/directory").
If used in another context, colons must be encoded.

regs,
  Torsten


--
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstra�e 14                            Tel: +49-4101-403605
D-25474 Ellerbek                            Fax: +49-4101-403606
E-Mail: [EMAIL PROTECTED]            Internet: http://www.inwise.de

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to