I've uninstalled and reinstalled from the PPA and my results resemble 
yours.  I used the tessdata_fast file for English - are these different 
from tessdata-ocr-eng that comes with Ubuntu?

On Wednesday, May 9, 2018 at 3:21:12 AM UTC-5, shree wrote:
>
> Please try by building the latest version of tesseract from github
>>
>
> or install  from links given in 
> https://github.com/tesseract-ocr/tesseract/wiki
>
> I get the following output using the default eng.traineddata from the 
> three repos - tessdata, tessdata_best, tessdata_fast, without any 
> pre-processing of image.
>
> # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata -c 
> preserve_interword_spaces=1 -c page_separator=''
>
> Warning. Invalid resolution 0 dpi. Using 70 instead.
> CUL DAIRY
>
> CHOBANI Y0G              $5.89 F
> PRODUCE
>
> HONEYCRTSP APPLES
>
> 0.931b@ $2.29/ Ib     $2.13 F
> Tare Weight: 0.011b
>
> BANANAS
>
> 3.16 1b®  $0.59/ Ib   $1.86 F
> Tare Weight: 0.011b
>
> BALANCE DUE               $9.88
>
>
> # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata_best -c 
> preserve_interword_spaces=1 -c page_separator=''
>
> Warning. Invalid resolution 0 dpi. Using 70 instead.
> CUL DAIRY
>
> CHOBANI Y0G              $5.89 F
> PRODUCE
>
> HONEYCRISP APPLES
>
> 0.931b8  $2.20/ Ib     $213 F
> Tare Weight: 0.011b
>
> BANANAS
>
> 3.16 1b8 $0.59 Ib   $1.86 F
> Tare Weight: 0.011b
>
> BALANCE DUE               $9.88
>
>
> # tesseract receipt.png - --psm 6 --tessdata-dir ./tessdata_fast  -c 
> preserve_interword_spaces=1 -c page_separator=''
>
> Warning. Invalid resolution 0 dpi. Using 70 instead.
> CUL DAIRY
>
> CHOBANI ¥OG              $5.89 F
> PRODUCE
>
> HONEYCRISP APPLES
>
> 0.93 Ib @ = $2.29/ Ib     $2.13 F
> Tare Weight: 0.011b
>
> BANANAS
>
> 3.16 1b @ —$0.59/ Ib   $1.86 F
> Tare Weight: 0.01Ib
>
> BALANCE DUE               $9.88
>
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/aab49960-0ca2-477d-ba83-dce8dd2a2438%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to