The latest version of Xpdf (V3.04) contains a command line tool pdftohtml.  If 
your linux system does not have the latest version there are precompiled binary 
downloads and source available from the authors at:

http://www.foolabs.com/xpdf/download.html

There are also precompiled binaries for the command line tools only (not the 
viewer) for Windows and Mac systems.  I found that the pdftotext tool with the 
"-table" option does quite well in extracting text from the PoOP PDF, including 
the instruction tables like Appendix B.

I have not yet tried the pdftohtml tool as I did not yet feel the need.

HTH

Peter

-----Original Message-----
From: IBM Mainframe Assembler List [mailto:[email protected]] On 
Behalf Of Martin Packer
Sent: Sunday, November 16, 2014 4:49 AM
To: [email protected]
Subject: Re: Redesigning the Principles of Operation Manual

PDF to HTML probably isn't impossible. But consider:

It would probably be crappy HTML at best.
The copyright owner of the material.
The difficulty of keeping the result up to date.

Now, does anyone know of tooling to turn PDF into (even bad) HTML?

Cheers, Martin
--

This message and any attachments are intended only for the use of the addressee 
and may contain information that is privileged and confidential. If the reader 
of the message is not the intended recipient or an authorized representative of 
the intended recipient, you are hereby notified that any dissemination of this 
communication is strictly prohibited. If you have received this communication 
in error, please notify us immediately by e-mail and delete the message and any 
attachments from your system.

Reply via email to