Re: [iText-questions] How to check if a PDF is OCR recognized

1T3XT BVBA Tue, 19 Jul 2011 06:25:16 -0700

On 19/07/2011 14:58, Bernhard Haslinger wrote:
> Is there a way to check with the iText library if a existing pdf has a ocr
> layer or not?
iText can parse PDFs into plain text, provided that the text doesn't 
consist of image.
- if you use iText to parse your PDFs, and there's no text; then the PDF 
doesn't have an OCR layer.
- if you use iText to parse your PDFs, and most pages have text; then it 
probably has an OCR layer.
Hope this helps.


------------------------------------------------------------------------------
Magic Quadrant for Content-Aware Data Loss Prevention
Research study explores the data loss prevention market. Includes in-depth
analysis on the changes within the DLP market, and the criteria used to
evaluate the strengths and weaknesses of these DLP solutions.
http://www.accelacomm.com/jaw/sfnl/114/51385063/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Re: [iText-questions] How to check if a PDF is OCR recognized

Reply via email to