The latest version of Xpdf (V3.04) contains a command line tool pdftohtml. If your linux system does not have the latest version there are precompiled binary downloads and source available from the authors at:
http://www.foolabs.com/xpdf/download.html There are also precompiled binaries for the command line tools only (not the viewer) for Windows and Mac systems. I found that the pdftotext tool with the "-table" option does quite well in extracting text from the PoOP PDF, including the instruction tables like Appendix B. I have not yet tried the pdftohtml tool as I did not yet feel the need. HTH Peter -----Original Message----- From: IBM Mainframe Assembler List [mailto:[email protected]] On Behalf Of Martin Packer Sent: Sunday, November 16, 2014 4:49 AM To: [email protected] Subject: Re: Redesigning the Principles of Operation Manual PDF to HTML probably isn't impossible. But consider: It would probably be crappy HTML at best. The copyright owner of the material. The difficulty of keeping the result up to date. Now, does anyone know of tooling to turn PDF into (even bad) HTML? Cheers, Martin -- This message and any attachments are intended only for the use of the addressee and may contain information that is privileged and confidential. If the reader of the message is not the intended recipient or an authorized representative of the intended recipient, you are hereby notified that any dissemination of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by e-mail and delete the message and any attachments from your system.
