Re: AW: [htdig] from a newbie-running htdig and pdf parser

Gilles Detillieux Fri, 11 Jan 2002 11:46:22 -0800

According to Geoff Hutchison:
> On Fri, 11 Jan 2002, Stratmann, T-Systems CSM, MD wrote:
> > pdf_parser: /usr/bin/pdftotext -
> 
> No. The pdf_parser attribute is depreciated--it initially was designed to
> use Adobe Acrobat to translate PDF files to PS and parse the result. The
> current approach is much more flexible and reliable.
> 
> > I think I need a variable in the command to specify the pdf-file.
> > I couldn't get the right information from the FAQ.
> 
> Try <http://www.htdig.org/FAQ.html#q4.9>


Which is exactly the FAQ entry to which I referred John Luna at the start
of this thread.  That entry details the proper way of using pdftotext
with htdig, with links to the relevant sources for scripts and the
xpdf package.

Also, the documentation for pdf_parser in attrs.html clearly states
that: "only Adobe's acroread program has been tested as a pdf_parser."
The FAQ and other documentation will only help if you actually follow
the advice it gives instead of going off on your own untested tangents.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Re: AW: [htdig] from a newbie-running htdig and pdf parser

Reply via email to