Sorry, I forgot to specify that. Tesseract 3.04.01
I'm using the data from tesseract-ocr-eng On Sunday, May 6, 2018 at 11:16:39 PM UTC-5, shree wrote: > > Which version of tesseract are you using? > > Which traineddata (from which repo) > > Try with --psm 6 if using tesseract 4 beta. It will recognise whole line, > rather than column > > On Mon 7 May, 2018, 1:21 AM Brooks Johnson, <[email protected] > <javascript:>> wrote: > >> >> <https://lh3.googleusercontent.com/-BFTqPnFWM6A/Wu9T9hZgG0I/AAAAAAAAXaM/dGe3ZIexEDsIdueXtCYFqmNg1-6vLbBXwCLcBGAs/s1600/receipt.png> >> I was experimenting with an image of a receipt but there seems to be >> trouble reading the two columns. I'm including a sample image so you can >> see what I was working with. The output I get from running "tesseract >> receipt.png out" is this: >> >> >> CUL DAIRY >> CHOBANI VOG >> >> PRODUCE >> >> HONEVURISP APPLES >> >> 0.93 lb 6 $2.29/ 1b >> {are Weyght: 0.011b >> >> BANANAS >> >> 3.16 lb 9 $0,59/ lb >> Tare Weight: 0.01m >> >> BALANCEDlE >> >> $2.13 >> >> $1.86 >> >> $9.88 >> >> >> >> There are a few typos but the biggest concern is that the $5.89 is >> nowhere to be found, but the prices that are below it manage to be >> included. That first price is still missing after I processed the image >> and even used a different image taken under different lighting. Am I doing >> something wrong here? >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/fd5f8596-7f21-42d6-a7bb-0dcafa113a4a%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/fd5f8596-7f21-42d6-a7bb-0dcafa113a4a%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/906b3f78-b93f-45a3-a28e-977aa14a50fa%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

