Mike Thomsen created NIFI-9647:
----------------------------------

             Summary: Add support for full text extraction of binary documents 
supported by Apache Tika
                 Key: NIFI-9647
                 URL: https://issues.apache.org/jira/browse/NIFI-9647
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Mike Thomsen
            Assignee: Mike Thomsen


This improvement will wrap Apache Tika using an updated version of Tim Spann's 
ExtractTextProcessor processor. I contacted Tim via LinkedIn, and he agreed to 
make it part of the NiFi code base going forward. In addition, this ticket adds 
the include-media profile which makes it possible to easily add the NiFi media 
bundle to a custom build of NiFi.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to