Re: [EXTERNAL] How to set the page segmentation for TIKA python

Chris Mattmann Wed, 13 Nov 2019 20:32:42 -0800

Hi Aswathi,


Please check with [email protected].

 

Cheers,

Chris

 

 

 

 

From: Aswathi Nambiar <[email protected]>
Date: Wednesday, November 13, 2019 at 7:39 AM
To: "Mattmann, Chris A (US 1760)" <[email protected]>
Subject: [EXTERNAL] How to set the page segmentation for TIKA python

 

Hi Chris, 

 

I am using Apache TIKA OCR on python. Using the parser.from_file I am trying to 
extract text from the image. But the default psm seems to be 1 according to 
documentation. But how do I change the psm to 6 using python. 

I couldn’t find any documentation for this. I can find it for java using the 
below link. 

https://cwiki.apache.org/confluence/display/tika/TikaOCR

 

Regards,

Aswathi Nambiar

 


This e-mail, including accompanying communications and attachments, is strictly 
confidential and only for the intended recipient. Any retention, use or 
disclosure not expressly authorised by IHSMarkit is prohibited. This email is 
subject to all waivers and other terms at the following link: 
https://ihsmarkit.com/Legal/EmailDisclaimer.html

Please visit www.ihsmarkit.com/about/contact-us.html for contact information on 
our offices worldwide.

Re: [EXTERNAL] How to set the page segmentation for TIKA python

Reply via email to