[SLUG] Compressing BIG PDF files with bash or Python (or PHP, or...)?

Jon Jermey Wed, 13 Oct 2010 17:14:46 -0700

OK, I've Googled this one till my brain hurts and got nothing... time toseek the higher wisdom.

I get large PDF files from publishers to index, which I do by runningthem through a few bash scripts and then working with the printedoutput. I have found a way to do everything via bash, but lately thefile sizes are getting bigger and bigger (the latest was over 500Mb!)and it takes forever to open and print these -- not to mention pagingthrough them if I need to find something.

The images are of no use to me, so an easy way to compress the fileswould be to eliminate the images, but as far as I can tell there is nosimple way to remove all the images at once from a PDF file, whilekeeping the text and page layout. Have I missed something obvious, or isthis really the case? If so, [insert profane expression of incredulityhere]!

The second-best option is to reduce the quality of the images to a bareminimum, but so far the only way I can find to do this is to use aWindows system, open the file in Adobe Acrobat, go to the Print dialog,change the settings and print the whole thing to another PDF file withminimal image quality. It's a pain and it takes forever.

Any ideas? There are various suggestions on the web about usingghostscript, imagemagick, ps2ps and so on but all they seem to do ismake the resulting file larger instead of smaller.

I'm doing this quite often, so a bash script would be useful. I can alsoprobably make sense of Python, but anything beyond that might be a stretch.


Thanks in advance,

Jon.
--
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

[SLUG] Compressing BIG PDF files with bash or Python (or PHP, or...)?

Reply via email to