Reviewing the output from -vvvv I see this line:
Header line: Content-Type: text/plain
The .doc parser is triggered by a content-type of application/msword.
Using Firefox with the web developer extensions installed (or using
telnet and manually sending an HTTP request) you can see what
content-type headers Apache is really sending for Word documents. If it
is not sending application/msword (and it appears that it is not), then
the doc2html.pl script will not be triggered...
Let us know if that is the problem. Note that depending on your version
of Apache, how to configure it to send the right header varies... check
the documentation to find out how to do it.
Good luck!
Ted
On Jan 3, 2006, at 7:58 PM, brian read wrote:
Ted
Thanks for the suggestions...
Make sure the path to doc2html.pl is right and that htdig has
permission to launch it.
Tried that - widened the premissions to 0777 just in case.
Make sure you have a carriage return at the end of the .conf file. I
recall finding that if I didn't have the carriage return, some things
failed (but that might just have been me not understanding what was
_really_ going on...)
That looks ok
Make sure that doc2html.pl is also properly configured. If I'm not
mistaken, there are a few variables that need to be set in order for
it to work properly (but considering you've run it manually, I would
assume it is).
Agreed
Make sure that the max_doc_size (or something like that) is
significantly larger than your largest document (like twice the
size). This will definitely keep htdig from indexing the documents if
it is too small and looking at your config file, it appears to be set
at the default of 100000. I've got mine set to 24 MB since we have
people completely unaware of what file size is all about adding
things to the web site.
I set this to 24000000, but still no luck.
--
Cheers
Brian
http://www.abandonmicrosoft.co.uk
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general