The file size of eng.traineddata is same - 3.92MB.

On Tuesday, December 17, 2019 at 12:47:28 PM UTC+5:30, shree wrote:
>
> Please check file sizes for eng.traineddata - they maybe different 
> versions even though they are called the same.
>
> On Mon, Dec 16, 2019 at 9:06 PM adesh gautam <[email protected] 
> <javascript:>> wrote:
>
>>
>> There is the same version of tesseract on the two systems as i mentioned 
>> before.
>>
>> The trained data is also same, eng.traineddata 
>>
>>
>> These are the two images.
>>
>> [image: a.png]
>>
>> a.jpg
>>
>>
>> [image: b.png]
>>
>> b.jpg
>>
>>
>> And these are the outputs for the same images on different systems.
>> *System 1* 
>>  
>> *a.jpg*
>>
>> ['(A', '1, oy OFW ID CARD 2', 'Repubic of the Plasppines', 'o WJ, 
>> Department of Laor and Empioyment )', '(SSRee) Philippine Oversans 
>> Employment Admievstraticn,', 'ee ——', 'MARIA SANTOS DELA CRUZ', 'rn', 
>> '20911483', 'orion, 10', 'inde [0p {0)]', 'os', 'isle =', 'TUTTI ARG 
>> COMPANY [EI', 'dune 20, 2010']
>>
>> *b.jpg*
>>
>> ['INTERNATIONAL STUDENT TY', 'EE', 'Ng ome']
>>  
>> *System 2* 
>>  
>> *a.jpg*
>>
>> ['(~~', '% oy OFW ID CARD L', 'Nepubse of the Preappinas', '4 wi. 
>> Department of Labor and Employment »', 'Soe ac Pruippine Oversaas 
>> Employment Adminvstraion,', 'fp', 'MARIA SANTOS DELA CRUZ', 'si00an', 
>> '29911483', 'emt, 84', 'ine 1} 4 10)', 'mt', 'Mein Drbitue Bcd', '“werraci 
>> —ABo coMPany Oat', 'Si vue 90, 2010']
>>  
>> *b.jpg*
>>
>> ['DCL ae Pcl', 'R) arc orn', 'PN secret']
>>
>>
>> The output is different. 
>>
>> Is it normal for tesseract ?
>>
>>
>>
>> On Monday, December 16, 2019 at 6:46:26 PM UTC+5:30, shree wrote:
>>>
>>> Run tesseract --version on the different systems.
>>>
>>> Are thetraineddata files being used on the different systems the same?
>>>
>>> Share an image and the different output received in each case.
>>>
>>> On Mon, Dec 16, 2019, 17:58 adesh gautam <[email protected]> wrote:
>>>
>>>> Hi,
>>>>
>>>> I am using tesseract-ocr on my images, and i am getting different 
>>>> results by running tesseract on different systems for same image. 
>>>> I am using *pytesseract *library.
>>>> I am setting the following parameters:
>>>> *--psm 6  -c classify_enable_learning=0 -c 
>>>> classify_enable_adaptive_matcher=0*
>>>>
>>>> Images have* dpi=300*.
>>>> Tesseract version:
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> *tesseract v5.0.0-alpha.20191030  leptonica-1.78.0   libgif 5.1.4 : 
>>>> libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 
>>>> 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0  Found AVX2  Found AVX  Found 
>>>> FMA 
>>>>  Found SSE  Found libarchive 3.3.2 zlib/1.2.11 liblzma/5.2.3 bz2lib/1.0.6 
>>>> liblz4/1.7.5*
>>>>
>>>> OS:
>>>> *Windows 10*
>>>>
>>>> Are there any system specific optimizations/dependencies by tesseract ? 
>>>>
>>>>
>>>> -- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to [email protected].
>>>> To view this discussion on the web visit 
>>>> https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com
>>>>  
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>>
>>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>
>
> -- 
>
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/e0946afd-dbbe-41ea-9741-1bfadeff97f3%40googlegroups.com.

Reply via email to