Matthew Caruana Galizia created TIKA-2167:
---------------------------------------------
Summary: Image processing causes OCR to fail
Key: TIKA-2167
URL: https://issues.apache.org/jira/browse/TIKA-2167
Project: Tika
Issue Type: Bug
Components: ocr
Affects Versions: 1.14
Environment: Mac OS X 10.11.6; Java 1.8.0_45; tesseract 3.04.01;
ImageMagick 6.9.6-2
Reporter: Matthew Caruana Galizia
Priority: Critical
Image processing before OCR is enabled by default in the OCR configuration
properties file. Unless this is disabled, running Tika on a simple TIFF image
(attached) with two clear words fails. When image processing is disabled, it
succeeds.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)