Hi Tika Committers,

I was wondering if [1] has a chance of getting added. It brings the command 
line options on par with the Tika API for text extraction for the very common 
use case of getting „all text“ for indexing. The patch [2] has unit tests and 
is IMO very straightforward.

We rely on it at Adobe in our use of Tika via the cli, and currently have to 
maintain our own patch build, and it would be great to be able to use the 
standard releases again.

[1] https://issues.apache.org/jira/browse/TIKA-3044
[2] https://patch-diff.githubusercontent.com/raw/apache/tika/pull/312.patch

Thanks,
Alexander Klimetschek

Principal Scientist, Adobe
Apache OpenWhisk & Jackrabbit committer

Reply via email to