According to Joe R. Jah:
> > According to Ute Wehner@home:
> > > WElcher pdf-Parser steht in letzter Nachricht
> 
>  ~= Which is the latest pdf-Parser?

Thanks, Joe.  The latest external converter for PDFs is doc2html.pl
in http://www.htdig.org/files/contrib/parsers/.  There's a slightly
updated version of doc2html in the contrib/doc2html subdirectory of the
latest 3.2.0b4 snapshots (in http://www.htdig.org/files/snapshots/),
but the only difference is the addition of a line to decode hex-encoded
characters in the file names so they don't show up as hex encoded in
the title (if the file name is used in the title).

Don't be fooled by the more recent modification date on parsepdf.pl.
It may be more recent, but it's older technology.  External parsers
are rendered more or less obsolete by external converter support
introduced in 3.1.4.  parsepdf.pl does have a nice enhancement for
supporting links to specific pages of PDFs, though, but it requires
other programs on your web server to support this.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to