According to Ansfried Vergauwe:
> Htdig seems to have problems with DWF files, a file format specially for
> internet, which contains text.
> Why does htdig refuses to scan these files on our intranet ?
htdig checks the Content-Type header that the web server returns. It has
internal parsers only for text/html and text/plain, plus it attempts to
parse other text/* files as text/plain. My guess is the server is tagging
the DWF files as application/(something), which htdig will ignore unless
you define an external converter to handle this type.
See http://www.htdig.org/attrs.html#external_parsers
and http://www.htdig.org/FAQ.html#q4.8 through 4.9
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html