Reviewing the output from -vvvv I see this line:

Header line: Content-Type: text/plain

The .doc parser is triggered by a content-type of application/msword.

Using Firefox with the web developer extensions installed (or using telnet and manually sending an HTTP request) you can see what content-type headers Apache is really sending for Word documents. If it is not sending application/msword (and it appears that it is not), then the doc2html.pl script will not be triggered...

Let us know if that is the problem. Note that depending on your version of Apache, how to configure it to send the right header varies... check the documentation to find out how to do it.

Good luck!

Ted

On Jan 3, 2006, at 7:58 PM, brian read wrote:

Ted

Thanks for the suggestions...

Make sure the path to doc2html.pl is right and that htdig has permission to launch it.

Tried that - widened the premissions to 0777 just in case.

Make sure you have a carriage return at the end of the .conf file. I recall finding that if I didn't have the carriage return, some things failed (but that might just have been me not understanding what was _really_ going on...)

That looks ok

Make sure that doc2html.pl is also properly configured. If I'm not mistaken, there are a few variables that need to be set in order for it to work properly (but considering you've run it manually, I would assume it is).

Agreed

Make sure that the max_doc_size (or something like that) is significantly larger than your largest document (like twice the size). This will definitely keep htdig from indexing the documents if it is too small and looking at your config file, it appears to be set at the default of 100000. I've got mine set to 24 MB since we have people completely unaware of what file size is all about adding things to the web site.

I set this to 24000000, but still no luck.


--
Cheers

Brian

http://www.abandonmicrosoft.co.uk



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to