[
https://issues.apache.org/jira/browse/TIKA-3300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17286543#comment-17286543
]
Luís Filipe Nassif commented on TIKA-3300:
------------------------------------------
Hi [~tallison]!
Just tested on Windows with a small set of 7046 images with ForkParser enabled
(48 processes):
OpenMP enabled: 1302s
OpenMP disabled: 510s
After that, I remembered they disabled OpenMP supportd by default in tesseract
4.1 (I'm using 4.0) because of these differences:
https://github.com/tesseract-ocr/tesseract/releases/tag/4.1.0
> Figure out if we can improve tesseract parallelization
> -------------------------------------------------------
>
> Key: TIKA-3300
> URL: https://issues.apache.org/jira/browse/TIKA-3300
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
>
> https://github.com/tesseract-ocr/tesseract/issues/2609
> https://twitter.com/jbaiter_/status/1360266497864704008?s=20
> Not sure if this affects us? h/t [~jbaiter]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)