[ https://issues.apache.org/jira/browse/TIKA-2190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Allison resolved TIKA-2190. ------------------------------- Resolution: Fixed Fix Version/s: 1.15 2.0 Thank you! > Add "preserve_interword_spaces" option of tesseract > --------------------------------------------------- > > Key: TIKA-2190 > URL: https://issues.apache.org/jira/browse/TIKA-2190 > Project: Tika > Issue Type: Improvement > Components: ocr > Reporter: Bipul Kumar > Assignee: Tim Allison > Fix For: 2.0, 1.15 > > > This option will preserve the spaces for TXT output type so that the layout > or context can be inferred while further parsing. > to enable :: -c preserve_interword_spaces=1 > to disable :: -c preserve_interword_spaces=0 or simply don't mention -- This message was sent by Atlassian JIRA (v6.3.4#6332)