image in pdf question

Manfred Pock Mon, 17 Aug 2015 00:08:21 -0700

The Pdfboxversion is the 2.0 trunk Version.

For performance reason we render Pdf's with one picture over the wholepage (scanned pdf's) at our own. (about 2 sec faster)The other pdf's we will render it with pdfbox. We check differentattributes from the page-resoureces (ShadingNames, ExtGSNames,PatternNames, PropetiesNames, ColorSpaceNames) and the Count and Size ofthe Picture (larger then the Mediabox). But we don't the check thefontnames from the resources because we have ocr (unvisible) text on thepdf-page to search in the page.

Now we have an pdf where is a an size-filled background-image and sometext overlayed. We detect this page as scanned page and so we justrender the picture.

Would there be a better solution to check/detect that an pdf-page is anscanned pdf-page with no attitional text?


regarts, Manfred

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

image in pdf question

Reply via email to