According to Aidan S Jones:
> Thanks for the response...
>
> I'm running 3.1.5.
> I added the index.html file to my path - no luck in resolving my difficulty.
> When running ./htdig -vvv, I recv. the following results:
...
> 0:0:0:http://nvasun4.raleigh.basf-corp.com/mfg/search/specifications/: Retrieval
> command for http://nvasun4.raleigh.basf-corp.com/mfg/search/specifications/: GE
> T /mfg/search/specifications/ HTTP/1.0
> User-Agent: htdig/3.1.5 ([EMAIL PROTECTED])
> Host: nvasun4.raleigh.basf-corp.com
>
> Header line: HTTP/1.1 200 ok
> Header line: Server: Netscape-Enterprise/3.6 SP2
> Header line: Date: Tue, 25 Sep 2001 23:32:28 GMT
> Header line: Content-Language: en
> Header line: Connection: close
> Header line:
> returnStatus = 0
> Read 8124 from document
> Read a total of 8124 bytes
> "" not a recognized type. Assuming text
> size = 8124
> pick: nvasun4.raleigh.basf-corp.com, # servers = 1
>
> The only line that does not make sense to me is: "" not a recognized type.
> Assuming text size = 8124.
Well, it seems your web server didn't return a Content-Type header for
this URL, so htdig is working with an empty string for the type. This
seems like a server bug to me. Section 7.2.1 of the HTTP 1.0 spec states:
Any HTTP/1.0 message containing an entity body should include a
Content-Type header field defining the media type of that body. If
and only if the media type is not given by a Content-Type header, as
is the case for Simple-Response messages, the recipient may attempt
to guess the media type via inspection of its content and/or the
name extension(s) of the URL used to identify the resource. If the
media type remains unknown, the recipient should treat it as type
"application/octet-stream".
Simple-Response messages have no headers at all. My understanding is
that if the server gives any headers at all, the Content-Type header
is mandatory.
See http://www.w3.org/Protocols/HTTP/1.0/spec.html#BodyType
Because htdig is assuming text/plain, rather than text/html, it's not
finding any HTML links in the document fetched for this directory.
If you can fix/upgrade/replace your web server, that should be your
first course of action. If you can't do that, you may need to patch
htdig/Document.cc to handle this situation and assume text/html in
this case.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html