I want to try out a new linux box we have running Apache and Htdig. I am using is Caldera OpenLinux 3.1.
I believe htdig needs a parser to index PDF files. While logged on as root- I have inserted: external_parsers: application/msword /usr/local/bin/parse_doc.pl \ application/postscript /usr/local/bin/parse_doc.pl \ application/pdf /usr/local/bin/parse_doc.pl into /etc/htdig/htdig.conf. Add modified the following to parse_doc_pl # $CATDOC = "/usr/local/bin/catdoc"; # $CATXLS = "/usr/local/bin/xlHtml"; # $CATWP = "/bin/true"; # $CATRTF = "/bin/true"; $CATPS = "/usr/bin/ps2ascii"; $CATPDF = "/usr/bin/pdftotext"; The web site will not be using .doc, .exl or other formats, just .html amf Adobe .pdf. I then re-ran rundig. While rundig is running, I get the message PDF PARSING IS NOT SUPPORTED FOR SECURITY REASONS. It then continues on and works fine. Have I overlooked the permissions? -J _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

