> If you can have them on the same line, that's fine too, but you may have > more lines; for example: > > external_parsers: \ > application/rtf->text/html /usr/local/bin/doc2html.pl \ > text/rtf->text/html /usr/local/bin/doc2html.pl \ > application/pdf->text/html /usr/local/bin/doc2html.pl \
> You may want to install doc2html.pl: > > http://www.htdig.org/FAQ.html#q4.9 > > Now I'm getting a different error. It's finding the pdftotext command but > > outputting the help text you get if you don't give it any arguments. Evide ntly > > it isn't being passed the file to be converted from pdf to text format. Th e > > documentation on external_parsers in the Configuration file format -- > > Attributes Web page doesn't seem to deal with passing arguments that refer to > > the pages being indexed. Yet even if it's passing the file to be converted the > > following arguments would vary from one converter to another and so there m ust > > be some way to indicate them in the htdig.conf file. The documentation say s > > you can include arguments if you quote the whole command string but how do I > > indicate the file to be converted and where should the output of that comma nd > > go? The documentation also says, "Unless it is an external converter, whic h > > will output a document of a different content-type, then its output must fo llow > > the format described here." I'm guessing that my case here is one of the > > external converters and the output doesn't have to conform to that format. > > The documentation also says, "If the second type is user-defined, then it's up > > to the converter script to put out a "Content-Type: type" header followed b y a > > blank line, to indicate to htdig what type it should expect for the output, > > much like what a CGI script would do." Is this a user-defined second type? > > I'm guessing that it isn't since it's plain text? Is it necessary to install doc2html.pl and call it in the htdig.conf rather than calling pdftotext directly? There have been examples of htdig.conf files with direct reference to pdftotext. There must be some way of passing arguments to the external parser in htdig.conf. TIA. Regards, Douglas ======== Douglas Kline [EMAIL PROTECTED] ------------------------------------------------------- This SF.Net email is sponsored by: SourceForge.net Broadband Sign-up now for SourceForge Broadband and get the fastest 6.0/768 connection for only $19.95/mo for the first 3 months! http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

