> Doug: > > My understanding is that the command line magic that you need to pass to > pdftotext directly is a bit too cumbersome to fit into htdig.conf. Also, > from a memory standpoint, my experience with this was it was better for the > pdftotext program to write to a text/html temp file and then have htdig read > off of this vs. an "in-line" conversion and passthrough which caused > repeated failures for me. > If you would just like to try something different than doc2html.ph, I can > recommend a script which was written by Stefan Nehlsen (I think, I believe > that Martin Allert gave me a copy which he may well have made a lot of > changes to) called parsepdf.pl which I have found effective. I think that > it is in the contributed works section although I should probably make sure > that is true. > Good Luck, > Tim
Thanks for the information, Tim. I wasn't trying to avoid doc2html.pl per se. Rather I wanted to avoid using any additional software. I want to be able to make the ht-Dig installation as compact and self-contained as possible. The pdftotext executable I'm using now depends on the locations of dynamic libraries. I plan to re-compile it later with static libraries to eliminate that dependency and then build it into the same directories with the ht-Dig software so that it will be easier to move. If it's necessary to use ancillary software like doc2html.pl or prohibitively difficult not to, then doc2html.pl is as good as anything else, at least so far as I know. Doug ======== Douglas Kline [EMAIL PROTECTED] ------------------------------------------------------- This SF.Net email is sponsored by: SourceForge.net Broadband Sign-up now for SourceForge Broadband and get the fastest 6.0/768 connection for only $19.95/mo for the first 3 months! http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

