Consider the attached receipt.  

I am trying to get text from this image.  

I tried all the options that I could

➜  receipts  tesseract costco.jpg costco -psm 0

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

Error during processing.

➜  receipts  tesseract costco.jpg costco -psm 1

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

OSD: Weak margin (4.85) for 209 blob text block, but using orientation 
anyway: 0

➜  receipts  tesseract costco.jpg costco -psm 2

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 4

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

set_count == gridheight():Error:Assert failed:in file colfind.cpp, line 648

[1]    46598 abort      tesseract costco.jpg costco -psm 4

➜  receipts  tesseract costco.jpg costco -psm 5

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 6

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 7

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 8

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 9

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 10

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract costco.jpg costco -psm 6 -l eng

Tesseract Open Source OCR Engine v3.02.02 with Leptonica

➜  receipts  tesseract -v                             


But the option where I get most data is with -psm 6. But the data is 
unreadable (See attached file)


How can I read this image?

Thanks


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/1cdd920a-c70a-409c-b49f-90a294c9b375%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
 l
"$2  "
nan: acnvg: ms {M
-mam mu an 1% A
gas wgszge? .e- ~
29 m a new’ ‘.33
W W WE :;-:4:
@333 R71 3'11» W4 gigs
  @533
mag &5.1,&....n N
[£8150 M1 PMH 51 ‘L29
 mm 8.28
2 G 1 D0
3% mm mm:
§§:§ angry 
325.. my 5; .
3o;s15 mm rflhm 3%;
335% Bin/§.u§'ik‘” u1uo—
SUIIMK II6.'!B
fl 7.25 1 mx RAVE 7.8!
gm. filfiéfigih
ms .m.. M

Reply via email to