I have htdig working on a RedHat7.1 box, together with doc2html and it's
associated converters. This all works to plan when I tell htdig to crawl the
website using http://127.0.0.1 or similar, but if I use
file:///var/www/html, then the DOC PDF XLS files do not index. I presume
that the problem is that htdig does not determine a MIME type when used this
way, and all of the scripts try to treat it as text/plain.
Is there any way around this, as I would prefer using file:/// rather than
http://127.0.0.1 to take a load of the web server.
Thanks for any advice
Marc
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html