[ 
https://issues.apache.org/jira/browse/TIKA-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tyler Palsulich closed TIKA-1543.
---------------------------------
    Resolution: Fixed

This isn't actually a problem. I just tested locally -- it works.

We have unit tests for the path, but it's difficult to test that extraction 
works with a non-standard path, since we don't know what the path is...

I think the problem is either:
    The path you set is not to the directory that contains the executable or 
    The path doesn't have a tessdata directory inside it.

You can see all of the Tesseract debugging messages by enabling {{debug}} level 
logging (put a 
[log4j.properties|https://github.com/apache/tika/blob/10298692cb27d1ad3732589930987e2fe2681ee8/tika-parsers/src/test/resources/log4j.properties]
 file on your classpath and set the output level to {{debug}}).

I'd be happy to help you debug further.

> TesseractOCRParser.setTesseractPath() doesn't work on Linux
> -----------------------------------------------------------
>
>                 Key: TIKA-1543
>                 URL: https://issues.apache.org/jira/browse/TIKA-1543
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.7
>            Reporter: Sean Zhao
>             Fix For: 1.8
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> After call setTesseractPath() to set the Tesseract path to a not-default 
> path, like /root/tesseract , call the TesseractOCRParser.parse(), nothing 
> will return.
> Not sure if this is related to TIKA-1421.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to