[tesseract-ocr] OCR on number 0 usualy working but not always

Sofie Geens Wed, 01 Mar 2023 07:05:12 -0800

I want to read numbers with pytesseract and it does it with 100% accuracy 
until a certain point, from there it just doesn't read anything anymore. 
The total grid of numbers I want to read looks something like this: 
[image: test1.png]
I only want to know the red numbers (these are green when smaller then 10). 
To read the numbers, I take screenshots from only the number that looks 
something like this:
 [image: roodCijfer.png] or [image: groenCijfer.png]
Then I remove the background, and make the numbers bigger so that the 
number is about 30 pixels high. That looks like this: 
[image: tmp.png]
Then I try use the following line of code to read this:


*result = pytesseract.image_to_string(img, lang='eng', config='--psm 10 
--oem 3 -c tessedit_char_whitelist=0123456789')print(result)*

This works for all the red numbers and some of the green numbers. The ones 
on the bottom row work perfectly, even when the number becomes 0. When it 
reaches the top row however, red and green works, except for the 0. It 
always fails there. The picture it tries to read is the one shown before 
(white background, big black 0, I don't have an example of a 0 where it 
works fine). I have no clue why it doesn't work, I do the exact same 
preprocessing as with the bottom row, but don't get the same good results. 
What can I do to get this to work?


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c66504fc-c6e4-4d32-92e3-c88311585df9n%40googlegroups.com.

[tesseract-ocr] OCR on number 0 usualy working but not always

Reply via email to