According to Luna, John: > I want to try out a new linux box we have running Apache and Htdig. > I am using is Caldera OpenLinux 3.1. > > I believe htdig needs a parser to index PDF files. ... > external_parsers: application/msword /usr/local/bin/parse_doc.pl \ > application/postscript /usr/local/bin/parse_doc.pl \ > application/pdf /usr/local/bin/parse_doc.pl ... > The web site will not be using .doc, .exl or other > formats, just .html amf Adobe .pdf. > I then re-ran rundig. > > While rundig is running, I get the message > > PDF PARSING IS NOT SUPPORTED FOR SECURITY REASONS.
I've never seen this message before in htdig, pdftotext or parse_doc.pl, so one of these must have bee modified to give this message. I suggest you look into where this message was added and why. If the concern was about potential security problems in external parser support in 3.1.5 and older, you may want to upgrade to the 3.1.6 snapshot in http://www.htdig.org/files/snapshots/, which does away with the popen() call that makes some people overly jittery. I also strongly recommend using an external converter like conv_doc.pl or doc2html.pl, rather than parse_doc.pl. (See http://www.htdig.org/FAQ.html#q4.9) -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

