I've uninstalled and reinstalled from the PPA and my results resemble yours. I used the tessdata_fast file for English - are these different from tessdata-ocr-eng that comes with Ubuntu?
On Wednesday, May 9, 2018 at 3:21:12 AM UTC-5, shree wrote: > > Please try by building the latest version of tesseract from github >> > > or install from links given in > https://github.com/tesseract-ocr/tesseract/wiki > > I get the following output using the default eng.traineddata from the > three repos - tessdata, tessdata_best, tessdata_fast, without any > pre-processing of image. > > # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata -c > preserve_interword_spaces=1 -c page_separator='' > > Warning. Invalid resolution 0 dpi. Using 70 instead. > CUL DAIRY > > CHOBANI Y0G $5.89 F > PRODUCE > > HONEYCRTSP APPLES > > 0.931b@ $2.29/ Ib $2.13 F > Tare Weight: 0.011b > > BANANAS > > 3.16 1b® $0.59/ Ib $1.86 F > Tare Weight: 0.011b > > BALANCE DUE $9.88 > > > # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata_best -c > preserve_interword_spaces=1 -c page_separator='' > > Warning. Invalid resolution 0 dpi. Using 70 instead. > CUL DAIRY > > CHOBANI Y0G $5.89 F > PRODUCE > > HONEYCRISP APPLES > > 0.931b8 $2.20/ Ib $213 F > Tare Weight: 0.011b > > BANANAS > > 3.16 1b8 $0.59 Ib $1.86 F > Tare Weight: 0.011b > > BALANCE DUE $9.88 > > > # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata_fast -c > preserve_interword_spaces=1 -c page_separator='' > > Warning. Invalid resolution 0 dpi. Using 70 instead. > CUL DAIRY > > CHOBANI ¥OG $5.89 F > PRODUCE > > HONEYCRISP APPLES > > 0.93 Ib @ = $2.29/ Ib $2.13 F > Tare Weight: 0.011b > > BANANAS > > 3.16 1b @ —$0.59/ Ib $1.86 F > Tare Weight: 0.01Ib > > BALANCE DUE $9.88 > > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/aab49960-0ca2-477d-ba83-dce8dd2a2438%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

