----- Original Message -----
From: "Steve Yeazel" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, April 15, 2004 9:57 PM
Subject: [htdig] please clue me in! doc2html.pl
......
>
> I've confirmed that pdf2html.pl and pdftotext both work from the command
line.
>
> doc2html.pl just spits out garbage in between the html tags when I try to
> convert a pdf on the command line with it.
doc2html.pl should be calling pdf2html.pl and so should produce the same
output as pdf2html.pl.
If you are trying:
doc2html.pl filename.pdf application/pdf
and you don't get good output then you need to check how you have configured
doc2html.pl
>
> I have the following line in htdig.conf:
> external_parsers: application/pdf->text/html
> /path/to/convertor/htdig/scripts/doc2html.pl
>
> I've also tried to call pdf2html.pl directly in the conf file to no avail.
> Any ideas??? Am I missing some config. somewhere? I'm not getting any
> errors in the doc2html log file so I dont know where to look...
If pdf2html.pl works OK on the command line then you should find that
external_parsers: application/pdf->text/html
/path/to/convertor/htdig/scripts/pdf2html.pl
works. Did you get anything in the doc2html log file? If not, then you
havn't configured doc2html.pl to produce one.
David Adams
University of Southampton
-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general