According to the FAQ, if I run tesseract on a multi-page image, it
should process the pages in parallel.
I am converting a 10-page TIF (in one file) into PDF, and looking at *top*,
it seems like tesseract never uses more than about 250% CPU (I have 16
cores / 32 threads on my machine).
Am I doing something wrong?
tesseract combined.tif out pdf
Tesseract Open Source OCR Engine v4.00.00alpha with Leptonica
OSD: Weak margin (6.98) for 914 blob text block, but using orientation
tesseract -v (from Debian Testing):
libgif 5.1.4 : libjpeg 6b (libjpeg-turbo 1.5.1) : libpng 1.6.28 : libtiff
4.0.8 : zlib 1.2.8 : libwebp 0.5.2 : libopenjp2 2.1.2
You received this message because you are subscribed to the Google Groups
To unsubscribe from this group and stop receiving emails from it, send an email
To post to this group, send email to email@example.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
For more options, visit https://groups.google.com/d/optout.