What application did you use to save the pdf as a .html file.

Dave
At 02:51 PM 7/29/2004, you wrote:

Has anyone successfully converted the 9/11 commission report to Plucker format? I tried http://plkr.org/pdf2pl.pl and get "Zero Sized Reply" errors. Also, the JakeWalk.de pdaConverter doesn't seem to work either.

I was able to download the original .pdf file, save it as a .html file, and "plucked" that. I also saved it as a Microsoft Word .doc file, and used Word to convert it to MS-HTML (certainly not valid HTML) and tried that as well. It is readible, though some things like tables and the Table of Contents aren't linkable items, they are present, and readible. The final file is 2,040,569 bytes with inline images. With external images (tap to see full-size image in Plucker), the doc is 4,173,460 bytes using zlib + 16bpp images.


If anyone is _really_ interested in this, I'm sure I can take some time and make the TOC linkable and clean up the other bits.. but that'll take some time to do (my schedule is very busy of late).

The reason the pdf2pl.pl fails, is because the original upstream .pdf file is password-protected, and prohibits content extraction with the tools I'm using. Removing the password allows it to work properly.

Also, when you remove the password from it (by trashing the portion of the header that includes it), you can then open it in Adobe's desktop product and select "File -> Reduce document size" to get it down to a 3.3MB pdf file. From there, you can manipulate it with various methods as necessary.


d. _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list


_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to