I want to try out a new linux box we have running Apache and Htdig. 
I am using is Caldera OpenLinux 3.1. 

I believe htdig needs a parser to index PDF files.

While logged on as root-

I have inserted:

external_parsers: application/msword /usr/local/bin/parse_doc.pl \
application/postscript /usr/local/bin/parse_doc.pl \
application/pdf /usr/local/bin/parse_doc.pl

into /etc/htdig/htdig.conf.

Add modified the following to parse_doc_pl

# $CATDOC = "/usr/local/bin/catdoc";
# $CATXLS = "/usr/local/bin/xlHtml";
# $CATWP = "/bin/true";
# $CATRTF = "/bin/true";
$CATPS = "/usr/bin/ps2ascii";
$CATPDF = "/usr/bin/pdftotext";

The web site will not be using .doc, .exl or other
formats, just .html amf Adobe .pdf.
I then re-ran rundig.

While rundig is running, I get the message

PDF PARSING IS NOT SUPPORTED FOR SECURITY REASONS.

It then continues on and works fine.
Have I overlooked the permissions?

-J


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to