Hello all,

I'm having a problem indexing pdf files.  The htdig phase seems to work
fine, no errors are produced, but when the htmerge phase is run, this error
always shows up:

Deleted, no excerpt: 17/http://svr-newlix/products/technical/faq.pdf

I'm not really sure how to go about fixing this problem.  Here's what I have
in my configuration file:

external_parsers: application/msword->text/html
/usr/local/htdig/bin/conv_doc.pl \
               application/postscript->text/html
/usr/local/htdig/bin/conv_doc.pl \
               application/pdf->text/html /usr/local/htdig/bin/conv_doc.pl

I was trying to use the parse_doc.pl script instead of the conv_doc.pl
script for a little while, but I kept getting many errors about acroread not
showing up, and how the pdf files could not be repaired.

Any help with how to fix this would be greatly appreciated.

Thanks,

-matt


------------------------
Matthew R. MacIntyre
Webmaster, Newlix Corporation
http://www.newlix.com
Tel: 613.225.0516
Fax: 613.225.5625


------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.

Reply via email to