[tesseract-ocr] Pytesseract used with captcha images unable to recognize characters with lines on top

2018-05-06 Thread Rutanshu Jhaveri
*script.py* cv2.imwrite(filename, imgOP) text = pytesseract.image_to_string(Image.open(filename)) print(text) Within the files that I have attached, for Output1.png while using pytesseract I get the following in the console *PGKQKf* Instead of getting *PGKQKT* And as for the the second

Re: [tesseract-ocr] Problem reading text in two columns

2018-05-06 Thread ShreeDevi Kumar
Which version of tesseract are you using? Which traineddata (from which repo) Try with --psm 6 if using tesseract 4 beta. It will recognise whole line, rather than column On Mon 7 May, 2018, 1:21 AM Brooks Johnson, wrote: > >

[tesseract-ocr] Problem reading text in two columns

2018-05-06 Thread Brooks Johnson
I was experimenting with an image of a receipt but there seems to be trouble reading the two columns. I'm including a sample image so you can see what I was

[tesseract-ocr] Re: tesseract performs wrong auto-correction sometimes : how to disable it?

2018-05-06 Thread Clark Knøsen
I experience the same with tesseract 4.0 installed with best traindata from repo # printf "deb https://notesalexp.org/tesseract-ocr/$(lsb_release -sc)/ $(lsb_release -sc) main\ndeb https://notesalexp.org/tesseract-ocr/tessdata_best/ stretch main\n" >> /etc/apt/sources.list onsdag den 25.