Hi Tika Committers, I was wondering if [1] has a chance of getting added. It brings the command line options on par with the Tika API for text extraction for the very common use case of getting „all text“ for indexing. The patch [2] has unit tests and is IMO very straightforward.
We rely on it at Adobe in our use of Tika via the cli, and currently have to maintain our own patch build, and it would be great to be able to use the standard releases again. [1] https://issues.apache.org/jira/browse/TIKA-3044 [2] https://patch-diff.githubusercontent.com/raw/apache/tika/pull/312.patch Thanks, Alexander Klimetschek Principal Scientist, Adobe Apache OpenWhisk & Jackrabbit committer
