Doug:

My understanding is that the command line magic that you need to pass to
pdftotext directly is a bit too cumbersome to fit into htdig.conf.  Also,
from a memory standpoint, my experience with this was it was better for the
pdftotext program to write to a text/html temp file and then have htdig read
off of this vs. an "in-line" conversion and passthrough which caused
repeated failures for me.
If you would just like to try something different than doc2html.ph, I can
recommend a script which was written by Stefan Nehlsen (I think, I believe
that Martin Allert gave me a copy which he may well have made a lot of
changes to) called parsepdf.pl which I have found effective.  I think that
it is in the contributed works section although I should probably make sure
that is true.
Good Luck,
Tim



-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to