[tesseract-ocr] Re: Checking if Searchable or Image Only PDF

Tom Morris Tue, 10 May 2016 08:58:46 -0700

On Tuesday, May 10, 2016 at 7:34:58 AM UTC-4, Robert Williams wrote:
>
>
> Within code - is it possible to check if a PDF is already "searchable"? 
>
> We get documents from a third party and want to search for keywords - 
> don't want to be running an OCR over files that are already searchable.
>


Sure, but it doesn't have anything to do with OCR. If you can't figure it 
out from the documentation for whatever PDF toolkit you're using, you 
should ask in their support forum.

Tom 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/215e3aec-c597-4a91-b61f-d6df3c59f4da%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: Checking if Searchable or Image Only PDF

Reply via email to