Hi Aswathi,

 

Please check with dev@tika.apache.org.

 

Cheers,

Chris

 

 

 

 

From: Aswathi Nambiar <aswathi.namb...@ihsmarkit.com>
Date: Wednesday, November 13, 2019 at 7:39 AM
To: "Mattmann, Chris A (US 1760)" <chris.a.mattm...@jpl.nasa.gov>
Subject: [EXTERNAL] How to set the page segmentation for TIKA python

 

Hi Chris, 

 

I am using Apache TIKA OCR on python. Using the parser.from_file I am trying to 
extract text from the image. But the default psm seems to be 1 according to 
documentation. But how do I change the psm to 6 using python. 

I couldn’t find any documentation for this. I can find it for java using the 
below link. 

https://cwiki.apache.org/confluence/display/tika/TikaOCR

 

Regards,

Aswathi Nambiar

 


This e-mail, including accompanying communications and attachments, is strictly 
confidential and only for the intended recipient. Any retention, use or 
disclosure not expressly authorised by IHSMarkit is prohibited. This email is 
subject to all waivers and other terms at the following link: 
https://ihsmarkit.com/Legal/EmailDisclaimer.html

Please visit www.ihsmarkit.com/about/contact-us.html for contact information on 
our offices worldwide.


Reply via email to