On 19/07/2011 14:58, Bernhard Haslinger wrote: > Is there a way to check with the iText library if a existing pdf has a ocr > layer or not? iText can parse PDFs into plain text, provided that the text doesn't consist of image. - if you use iText to parse your PDFs, and there's no text; then the PDF doesn't have an OCR layer. - if you use iText to parse your PDFs, and most pages have text; then it probably has an OCR layer. Hope this helps.
------------------------------------------------------------------------------ Magic Quadrant for Content-Aware Data Loss Prevention Research study explores the data loss prevention market. Includes in-depth analysis on the changes within the DLP market, and the criteria used to evaluate the strengths and weaknesses of these DLP solutions. http://www.accelacomm.com/jaw/sfnl/114/51385063/ _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php