Both AVX and AVX2 are enabled on both the systems.

I am not using specific tessdata_fast or tessdata_best. I am using the 
default eng.traineddata that comes with windows installer.

On Tuesday, December 17, 2019 at 9:36:16 PM UTC+5:30, shree wrote:
>
> >There is the same version of tesseract on the two systems as i mentioned 
> before.
>
> OK. But is there any difference in specs of the 2 systems in terms of AVX 
> etc. Hence tesseract -v would be useful.
>
> Also, just check the results via CLI.
>
> I get different results when using eng.traineddata from tessdata_best and 
> tessdata_fast
>
> ubuntu@tesseract-ocr:~/TEST$ tesseract unnamed.png - --tessdata-dir 
> ~/tessdata_fast
> Warning: Invalid resolution 0 dpi. Using 70 instead.
> Estimating resolution as 195
> OFW ID CARD
>
> Republe of the Pieippings *
> Department of Labor and Employment ae
> Phiipploe Overseas Employment Admieistration,
>
>
>
>
>
>
>
>
>
>
>
>
> MARIA SANTOS DELA CRUZ
> 29911483
> ‘enh
>
> x Sess)
>
> GIDE
> ubuntu@tesseract-ocr:~/TEST$ tesseract unnamed.png - --tessdata-dir 
> ~/tessdata_best
> Warning: Invalid resolution 0 dpi. Using 70 instead.
> Estimating resolution as 195
> OFW ID CARD
>
> Ropubkc of he Pisppines
> Department of Labor and Employment "a
> Phiipplaa Overseas Employment Admieistration
>
>
>
>
>
>
>
>
>
>
>
>
> MARIA SANTOS DELA CRUZ
> 29911483
> rn ve
>
> hI [Op410)
>
> [EI
>
> On Tue, Dec 17, 2019 at 1:08 PM adesh gautam <[email protected] 
> <javascript:>> wrote:
>
>> The file size of eng.traineddata is same - 3.92MB.
>>
>> On Tuesday, December 17, 2019 at 12:47:28 PM UTC+5:30, shree wrote:
>>>
>>> Please check file sizes for eng.traineddata - they maybe different 
>>> versions even though they are called the same.
>>>
>>> On Mon, Dec 16, 2019 at 9:06 PM adesh gautam <[email protected]> wrote:
>>>
>>>>
>>>> There is the same version of tesseract on the two systems as i 
>>>> mentioned before.
>>>>
>>>> The trained data is also same, eng.traineddata 
>>>>
>>>>
>>>> These are the two images.
>>>>
>>>> [image: a.png]
>>>>
>>>> a.jpg
>>>>
>>>>
>>>> [image: b.png]
>>>>
>>>> b.jpg
>>>>
>>>>
>>>> And these are the outputs for the same images on different systems.
>>>> *System 1* 
>>>>  
>>>> *a.jpg*
>>>>
>>>> ['(A', '1, oy OFW ID CARD 2', 'Repubic of the Plasppines', 'o WJ, 
>>>> Department of Laor and Empioyment )', '(SSRee) Philippine Oversans 
>>>> Employment Admievstraticn,', 'ee ——', 'MARIA SANTOS DELA CRUZ', 'rn', 
>>>> '20911483', 'orion, 10', 'inde [0p {0)]', 'os', 'isle =', 'TUTTI ARG 
>>>> COMPANY [EI', 'dune 20, 2010']
>>>>
>>>> *b.jpg*
>>>>
>>>> ['INTERNATIONAL STUDENT TY', 'EE', 'Ng ome']
>>>>  
>>>> *System 2* 
>>>>  
>>>> *a.jpg*
>>>>
>>>> ['(~~', '% oy OFW ID CARD L', 'Nepubse of the Preappinas', '4 wi. 
>>>> Department of Labor and Employment »', 'Soe ac Pruippine Oversaas 
>>>> Employment Adminvstraion,', 'fp', 'MARIA SANTOS DELA CRUZ', 'si00an', 
>>>> '29911483', 'emt, 84', 'ine 1} 4 10)', 'mt', 'Mein Drbitue Bcd', '“werraci 
>>>> —ABo coMPany Oat', 'Si vue 90, 2010']
>>>>  
>>>> *b.jpg*
>>>>
>>>> ['DCL ae Pcl', 'R) arc orn', 'PN secret']
>>>>
>>>>
>>>> The output is different. 
>>>>
>>>> Is it normal for tesseract ?
>>>>
>>>>
>>>>
>>>> On Monday, December 16, 2019 at 6:46:26 PM UTC+5:30, shree wrote:
>>>>>
>>>>> Run tesseract --version on the different systems.
>>>>>
>>>>> Are thetraineddata files being used on the different systems the same?
>>>>>
>>>>> Share an image and the different output received in each case.
>>>>>
>>>>> On Mon, Dec 16, 2019, 17:58 adesh gautam <[email protected]> wrote:
>>>>>
>>>>>> Hi,
>>>>>>
>>>>>> I am using tesseract-ocr on my images, and i am getting different 
>>>>>> results by running tesseract on different systems for same image. 
>>>>>> I am using *pytesseract *library.
>>>>>> I am setting the following parameters:
>>>>>> *--psm 6  -c classify_enable_learning=0 -c 
>>>>>> classify_enable_adaptive_matcher=0*
>>>>>>
>>>>>> Images have* dpi=300*.
>>>>>> Tesseract version:
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> *tesseract v5.0.0-alpha.20191030  leptonica-1.78.0   libgif 5.1.4 : 
>>>>>> libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 
>>>>>> 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0  Found AVX2  Found AVX  Found 
>>>>>> FMA 
>>>>>>  Found SSE  Found libarchive 3.3.2 zlib/1.2.11 liblzma/5.2.3 
>>>>>> bz2lib/1.0.6 
>>>>>> liblz4/1.7.5*
>>>>>>
>>>>>> OS:
>>>>>> *Windows 10*
>>>>>>
>>>>>> Are there any system specific optimizations/dependencies by tesseract 
>>>>>> ? 
>>>>>>
>>>>>>
>>>>>> -- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "tesseract-ocr" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to [email protected].
>>>>>> To view this discussion on the web visit 
>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com
>>>>>>  
>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>> .
>>>>>>
>>>>> -- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to [email protected].
>>>> To view this discussion on the web visit 
>>>> https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com
>>>>  
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>>
>>>
>>>
>>> -- 
>>>
>>> ____________________________________________________________
>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/e0946afd-dbbe-41ea-9741-1bfadeff97f3%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/e0946afd-dbbe-41ea-9741-1bfadeff97f3%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> -- 
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/9ab5b126-424d-41d6-af6c-63f3ea58437f%40googlegroups.com.

Reply via email to