According to Ansfried Vergauwe:
> Htdig seems to have problems with DWF files, a file format specially for
> internet, which contains text.
> Why does  htdig refuses to scan these files on our intranet ?

htdig checks the Content-Type header that the web server returns.  It has
internal parsers only for text/html and text/plain, plus it attempts to
parse other text/* files as text/plain.  My guess is the server is tagging
the DWF files as application/(something), which htdig will ignore unless
you define an external converter to handle this type.

See http://www.htdig.org/attrs.html#external_parsers
and http://www.htdig.org/FAQ.html#q4.8  through  4.9

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to