According to Geoff Hutchison:
> At 12:40 PM +0100 9/6/00, Klaus Gr�ger wrote:
> >external_parsers: application/pdf /usr/share/htdig/parse_doc.pl
>
> You may get better results with the new conv_doc.pl converter, but
> this should work.
>
> >htdig -i -v 5 -c /etc/htdig/htdig.conf produces the following output:
>
> N.B. The -v 5 flag does not correspond to -vvvvv.
The bare "5" argument will cause htdig to stop scanning for options,
so it won't see the -c option. If /etc/htdig/htdig.conf is not your
compiled-in default config file name, then it won't be reading that
file. Changing the -v 5 to -vvvvv would be the first thing to try.
If /etc/htdig/htdig.conf is your default, then I assume you're running
one of the RPM distributions of htdig. Which one did you grab, and on
which platform are you running it? Did you make sure, by checking the
README.RPMS.txt on the web site, that you installed the right one?
> >New server: hobbes.noc.abacom.net, 80
> >0:0:0:http://hobbes.noc.abacom.net/: --+++-+------- size = 4300
> >1:1:1:http://hobbes.noc.abacom.net/rasa.pdf: PDF::parse: cannot find
> >pdf parser /usr/local/bin/acroread
> > size = 207983
>
> Did you check your config file to make sure it doesn't have
> pdf_parser: set in it somewhere?
external_parsers should override pdf_parser, so setting the latter
shouldn't be a problem. Rather, it would seem htdig is not seeing the
external_parsers setting, either because it's not reading the config
file you think it is, or because of some sort of formatting error in
your config file.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>