[tesseract-ocr] Re: Hardware optimisation

Tuan Ardouin Thu, 16 Apr 2020 03:08:03 -0700

I'm also interested in this question. Have you read this issue on the 
tesseract repository ?
https://github.com/tesseract-ocr/tesseract/issues/263

The 2 main things I can read on different issues are custom kernel 
configuration and single threading.

An example of custom kernel configuration : 
https://make-linux-fast-again.com/)
As for the single thread, the tesseract executable can use multithreading 
to speed up the OCR processing of a single page. The gain is not really 
large, it costs excessive CPU overhead, and so the suggested solution is to 
disable that, either at compile time (--disable-openmp) or at run time (
OMP_THREAD_LIMIT=1). You then have to run as much tesseract workers as your 
core total number.

That's my understanding but an expert point of view on the matter would be 
greatly appreciated.

On Wednesday, April 15, 2020 at 9:13:03 PM UTC+2, adamuk73 wrote:
>
> I'm interested to know how hardware-sensitive Tesseract 4 is to hardware. 
> For example is double number of cores going to double to processing speed? 
> Or dual cpu set ups? 
>
> Does Tesseract mostly run in RAM or does it use much hdd? 
>
> Any guidance would be appreciated 
> . I'm looking to run an instance of Tesseract on a x86 box running Centos. 
>
> Thanks in advance. 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/affc7488-2102-41ac-97c3-e0a25c3089a0%40googlegroups.com.

[tesseract-ocr] Re: Hardware optimisation

Reply via email to