Ewan Mellor created TIKA-2584:
---------------------------------
Summary: Tika should have a way to pass arbitrary Tesseract options
Key: TIKA-2584
URL: https://issues.apache.org/jira/browse/TIKA-2584
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 1.17
Reporter: Ewan Mellor
Tesseract has a very large number of config options (use tesseract
--print-parameters to see them). There is no mechanism for TesseractOCRParser
/ TesseractOCRConfig to pass these to Tesseract, and so they cannot be
controlled by user code.
Tika should pass these through as opaque key-value pairs, so that user code can
set them as necessary.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)