Sal created TIKA-3431:
-------------------------

             Summary: Using any setting other than AUTO for 
X-Tika-PDFOcrStrategy causes remarkable performance loss
                 Key: TIKA-3431
                 URL: https://issues.apache.org/jira/browse/TIKA-3431
             Project: Tika
          Issue Type: Bug
          Components: tika-server
    Affects Versions: 1.26
            Reporter: Sal


When processing PDF document to the local Tika server using PUT request to 
endpoint [http://localhost:9998/tika.]  If the PDFOcrStrategy is set to 
anything other than AUTO or NO_OCR, this causes extreme slowdown in processing 
of the PDF file.  

 

It doesn't matter if the PDF document has inline images or not, the slowdown 
happens regardless.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to