undertruck wrote: > Hi, > > I want to retrieve following information from a PDF file using Zend_pdf > module. I was wondering if it's possible at all. I didn't see any such > references in api documentation. > > 0. Total no of pages with individual page sizes > 1. PDF print size and bounding box dimensions > 2. Colorspace (RGB/CMYK) used in PDF > 3. To confirm that font is either embedded or outlined > 4. Total no of objects and their list along with their types > 5. For all the image objects : > a) height and width in pixels > b) DPI stored in header > c) colorspace > d) bit depth > e) x and y position in PDF > f) Is image grayscaled ? > 6. List of all ICC profiles like dot gains > 7. Save pages in jpg (highest resolution image object should visible as > its original size) > 8. Save pages as thumbnails > 9. If crop box is present, get its x-y location in PDF and size > > I've tried (though not extensively) fpdf, tcpdf, imagemagick (convert and > identify tools) but none of them give me complete information. For some > reasons, I couldn't use Imagick PHP class on my host. So Zend_pdf seems the > last option and I don't see anything useful regarding 'reading' PDFs. pdflib > is also not an option as I'm looking for free solutions. > > Any help on this would be appreciated. Thanks.
Not sure of all option you list but seems to be a job for pdftk http://www.accesspdf.com/pdftk/ -- Bruno Friedmann
