RE: [htdig] please clue me in! doc2html.pl

Rupert Jones Fri, 16 Apr 2004 02:14:34 -0700

The doc2html.pl parser works on the principle of extracting the document as
raw text, then converting that text into HTML. No HTML document is created
on the hard disk, The HTML output is fed straight into htdig without being
stored physically.


I experienced some teething problems with doc2html.pl, which came down to
some file ownership permissions, and possibly filepaths in the related
pdftotext.pl and pdf2html.pl files (or whatever they're called). Make sure
that these related files are also correct, as htdig may be using doc2html.pl
to parse the pdf documents, but the actual xPDF translator isn't being
called correctly, so htdig doesn't understand the content that it is
receiving.

Hope this helps.

Rupert

-- 
Outgoing mail is certified Virus Free.
Checked by AVG Anti-Virus (http://www.grisoft.com).
Version: 7.0.230 / Virus Database: 262.8.2 - Release Date: 15/04/2004
 



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

RE: [htdig] please clue me in! doc2html.pl

Reply via email to