The doc2html.pl parser works on the principle of extracting the document as raw text, then converting that text into HTML. No HTML document is created on the hard disk, The HTML output is fed straight into htdig without being stored physically.
I experienced some teething problems with doc2html.pl, which came down to some file ownership permissions, and possibly filepaths in the related pdftotext.pl and pdf2html.pl files (or whatever they're called). Make sure that these related files are also correct, as htdig may be using doc2html.pl to parse the pdf documents, but the actual xPDF translator isn't being called correctly, so htdig doesn't understand the content that it is receiving. Hope this helps. Rupert -- Outgoing mail is certified Virus Free. Checked by AVG Anti-Virus (http://www.grisoft.com). Version: 7.0.230 / Virus Database: 262.8.2 - Release Date: 15/04/2004 ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

