Most external parsers used for PDF files are based on the pdftotext utility which is bundled with xpdf, and as the name implies, this outputs plain text.

The pdf2html.pl Perl script that comes with doc2html wraps HTML code around this, but nothing fancier than that. It would certainly be possible to add code to this which scanned the text for www.some.site and http://other.site strings and made them into links. However, this fall a lot short of what you are asking for.

There is a PDF to HTML utility bundled in with xpdf. I havn't looked at the latest version, does anyone know if it does what Robert wants?

David Adams
Corporate Information Services
Information Systems Services
University of Southampton

----- Original Message ----- From: "Robert Isaac" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Wednesday, December 01, 2004 5:08 PM
Subject: [htdig] pdf files



I use htdig 3.1.6 on a Cobalt RaQ550. I know that documents linked from html pages are scanned by htdig, but what about a pdf file that has hyperlinks to other pdf files. Will these files be scanned?

Thanks

Bob

Robert Isaac
Director & Internet Manager, Volvo Owners Club
All email messages are virus scanned before being sent
PLEASE INCLUDE ALL PREVIOUS MESSAGE TEXT WITH REPLY

Club web site: www.volvoclub.org.uk

Also visit: www.trisaac.com for
John Wayne Collectors Plates
Roil Products
Neways International





-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general





-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. http://productguide.itmanagersjournal.com/
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to