Mike Thomsen created NIFI-9647:
----------------------------------
Summary: Add support for full text extraction of binary documents
supported by Apache Tika
Key: NIFI-9647
URL: https://issues.apache.org/jira/browse/NIFI-9647
Project: Apache NiFi
Issue Type: Improvement
Reporter: Mike Thomsen
Assignee: Mike Thomsen
This improvement will wrap Apache Tika using an updated version of Tim Spann's
ExtractTextProcessor processor. I contacted Tim via LinkedIn, and he agreed to
make it part of the NiFi code base going forward. In addition, this ticket adds
the include-media profile which makes it possible to easily add the NiFi media
bundle to a custom build of NiFi.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)