Re: [tesseract-ocr] Fasten Tesseract OCR

Zdenko Podobny Thu, 14 Dec 2023 11:41:11 -0800

A more effective approach to addressing the issue is to create a
test/example case. Advanced users can then evaluate and potentially offer
solutions


It would be helpful if you could provide details on how you obtain and
process the input images, as well as the OCR execution method (API,
wrapper, executable). Examining this could reveal opportunities for speed
improvements, particularly by minimizing IO operations.

It's worth noting that there have been reported problems with OpenMP on
Linux and Mac in the context of extensive OCR tasks, as outlined in these
GitHub issues: [1], [2].
Investigating these and other performance related) issues may offer
insights into potential optimizations.

[1]
https://github.com/tesseract-ocr/tesseract/issues/943#issuecomment-1798155553
[2] https://github.com/tesseract-ocr/tesseract/issues/3109

Zdenko




Zdenko


št 30. 11. 2023 o 14:57 vadansh kulshreshtha <
[email protected]> napísal(a):

> I am using an i3 quad-core CPU. My scenario is that I want to process 100
> images in 1 sec including the image processing and cropping images. I
> create an ROI crop it and do the image processing then OCR. But what
> happens is that sometimes the same ROI takes more than 1 sec but sometimes
> it does it in 150-200ms. Also, I use the best train file of Tesseract.
> Also, the size of my ROI is not more than the size of a word. eg. "super@145
> &4califragilisticexpialidocious".
>
> For image processing, I do the thresholding, and zooming of images if
> required.
>
> Please do suggest to me the ways to get a reliable OCR processing time and
> also ways to fasten the OCR.
>
> Thank you
>
> On Wednesday, 29 November 2023 at 20:04:25 UTC+5:30 zdenop wrote:
>
>> Your request is too general e.g.  reply could be "upgrade your
>> hardware"... ;-)
>>
>> Unless you provide details about your testing environment + process of
>> measuring speed and testing images, there is just one general advice: read
>> the docs and issue tracker (including closed issues), there are several
>> discussions (and hints) regarding speed.
>>
>> Zdenko
>>
>>
>> st 29. 11. 2023 o 13:53 vadansh kulshreshtha <
>> [email protected]> napísal(a):
>>
>>> Hello Everyone,
>>>
>>> I am using Tesseract OCR 5.2 and I want to speed up my OCR process so
>>> for that, could any help me with the same? It would be a great help for me.
>>> Also can anyone tell me all the parameters that affect the speed of OCR.
>>>
>>> Thank you
>>>
>>> --
>>>
>> You received this message because you are subscribed to the Google Groups
>>> "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/7ec9293f-798e-48e0-a742-c6ece2775165n%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/7ec9293f-798e-48e0-a742-c6ece2775165n%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/e720c41d-16b0-4268-9a1e-a197db2dcc13n%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/e720c41d-16b0-4268-9a1e-a197db2dcc13n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wwX3YYP%2BQAMAE8R6Qbz4qnkXDToa1Gj-ebnzsLkHLnVQ%40mail.gmail.com.

Re: [tesseract-ocr] Fasten Tesseract OCR

Reply via email to