----- Original Message -----
From: "Steve Yeazel" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, April 15, 2004 9:57 PM
Subject: [htdig] please clue me in! doc2html.pl
......

>
> I've confirmed that pdf2html.pl and pdftotext both work from the command
line.
>
> doc2html.pl just spits out garbage in between the html tags when I try to
> convert a pdf on the command line with it.

doc2html.pl should be calling pdf2html.pl and so should produce the same
output as pdf2html.pl.
If you are trying:

    doc2html.pl  filename.pdf  application/pdf

and you don't get good output then you need to check how you have configured
doc2html.pl

>
> I have the following line in htdig.conf:
> external_parsers: application/pdf->text/html
> /path/to/convertor/htdig/scripts/doc2html.pl
>
> I've also tried to call pdf2html.pl directly in the conf file to no avail.
> Any ideas???  Am I missing some config. somewhere?  I'm not getting any
> errors in the doc2html log file so I dont know where to look...

If pdf2html.pl works OK on the command line then you should find that

    external_parsers: application/pdf->text/html
/path/to/convertor/htdig/scripts/pdf2html.pl

works.  Did you get anything in the doc2html log file?  If not, then you
havn't configured doc2html.pl to produce one.

David Adams
University of Southampton



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to