[jira] [Updated] (NIFI-7775) Exclude TesseractOCR Parser from ExtractMediaMetadata Processor

Timon Faerber (Jira) Sun, 30 Aug 2020 07:00:38 -0700


     [ 
https://issues.apache.org/jira/browse/NIFI-7775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Timon Faerber updated NIFI-7775:
--------------------------------
    Description: 
To extract Media Metadata is Apache Tika in use.

Tika uses also TesseractOCRParser as DefaultParser - for this case it doesnt 
needed and runtime improved.

 

With TikaConfig File could be exclude TesseractOCRParser 

  was:
To extract Media Metadata is Apache Tika in use.

Tika uses also TesseractOCRParser as DefaultParser - for this case it doesnt 
needed and steal some time for each FlowFile which goes threw this Processor.

 

With TikaConfig File could be exclude TesseractOCRParser 


> Exclude TesseractOCR Parser from ExtractMediaMetadata Processor
> ---------------------------------------------------------------
>
>                 Key: NIFI-7775
>                 URL: https://issues.apache.org/jira/browse/NIFI-7775
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Timon Faerber
>            Priority: Minor
>             Fix For: 1.13.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> To extract Media Metadata is Apache Tika in use.
> Tika uses also TesseractOCRParser as DefaultParser - for this case it doesnt 
> needed and runtime improved.
>  
> With TikaConfig File could be exclude TesseractOCRParser 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (NIFI-7775) Exclude TesseractOCR Parser from ExtractMediaMetadata Processor

Reply via email to