According to Chad Phillips:
> I installed a snapshot of 2.4 last month and am having problems indexing
> pdf files.  Running rundig -v I get this error on pdf's:
> 40:32:2:http://grahamcenter.aafp.org/PreBuilt/20001002a.pdf:  not Parsable
> 
> In my htdig.conf file I have:
> pdf_parser: /usr/local/Acrobat4/bin/acroread -toPostScript -pairs
> 
> When I index the site with 3.15 it works fine.  Any ideas?

Yes, 3.1.5 supports pdf_parser, but the 3.2 betas don't.  The reason?
pdf_parser is obsolete.  See http://www.htdig.org/FAQ.html#q4.9
as well as questions 5.2 and 1.13.  Acrobat 4 is quite unreliable for
this purpose, so we recommend an external converter based on xpdf's
pdftotext.

If you really want to stick to Acrobat, you may have better luck with
version 3, but if you want to use it with 3.2.0b4, you'll need to get
a copy of acroconv.pl from http://www.htdig.org/files/contrib/parsers/

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to