[ 
https://issues.apache.org/jira/browse/NIFI-9647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Handermann updated NIFI-9647:
-----------------------------------
    Summary: Add ExtractDocumentText Processor  (was: Add support for full text 
extraction of binary documents supported by Apache Tika)

> Add ExtractDocumentText Processor
> ---------------------------------
>
>                 Key: NIFI-9647
>                 URL: https://issues.apache.org/jira/browse/NIFI-9647
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Mike Thomsen
>            Assignee: Mike Thomsen
>            Priority: Major
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> This improvement will wrap Apache Tika using an updated version of Tim 
> Spann's ExtractTextProcessor processor. I contacted Tim via LinkedIn, and he 
> agreed to make it part of the NiFi code base going forward. In addition, this 
> ticket adds the include-media profile which makes it possible to easily add 
> the NiFi media bundle to a custom build of NiFi.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to