The file size of eng.traineddata is same - 3.92MB. On Tuesday, December 17, 2019 at 12:47:28 PM UTC+5:30, shree wrote: > > Please check file sizes for eng.traineddata - they maybe different > versions even though they are called the same. > > On Mon, Dec 16, 2019 at 9:06 PM adesh gautam <[email protected] > <javascript:>> wrote: > >> >> There is the same version of tesseract on the two systems as i mentioned >> before. >> >> The trained data is also same, eng.traineddata >> >> >> These are the two images. >> >> [image: a.png] >> >> a.jpg >> >> >> [image: b.png] >> >> b.jpg >> >> >> And these are the outputs for the same images on different systems. >> *System 1* >> >> *a.jpg* >> >> ['(A', '1, oy OFW ID CARD 2', 'Repubic of the Plasppines', 'o WJ, >> Department of Laor and Empioyment )', '(SSRee) Philippine Oversans >> Employment Admievstraticn,', 'ee ——', 'MARIA SANTOS DELA CRUZ', 'rn', >> '20911483', 'orion, 10', 'inde [0p {0)]', 'os', 'isle =', 'TUTTI ARG >> COMPANY [EI', 'dune 20, 2010'] >> >> *b.jpg* >> >> ['INTERNATIONAL STUDENT TY', 'EE', 'Ng ome'] >> >> *System 2* >> >> *a.jpg* >> >> ['(~~', '% oy OFW ID CARD L', 'Nepubse of the Preappinas', '4 wi. >> Department of Labor and Employment »', 'Soe ac Pruippine Oversaas >> Employment Adminvstraion,', 'fp', 'MARIA SANTOS DELA CRUZ', 'si00an', >> '29911483', 'emt, 84', 'ine 1} 4 10)', 'mt', 'Mein Drbitue Bcd', '“werraci >> —ABo coMPany Oat', 'Si vue 90, 2010'] >> >> *b.jpg* >> >> ['DCL ae Pcl', 'R) arc orn', 'PN secret'] >> >> >> The output is different. >> >> Is it normal for tesseract ? >> >> >> >> On Monday, December 16, 2019 at 6:46:26 PM UTC+5:30, shree wrote: >>> >>> Run tesseract --version on the different systems. >>> >>> Are thetraineddata files being used on the different systems the same? >>> >>> Share an image and the different output received in each case. >>> >>> On Mon, Dec 16, 2019, 17:58 adesh gautam <[email protected]> wrote: >>> >>>> Hi, >>>> >>>> I am using tesseract-ocr on my images, and i am getting different >>>> results by running tesseract on different systems for same image. >>>> I am using *pytesseract *library. >>>> I am setting the following parameters: >>>> *--psm 6 -c classify_enable_learning=0 -c >>>> classify_enable_adaptive_matcher=0* >>>> >>>> Images have* dpi=300*. >>>> Tesseract version: >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> *tesseract v5.0.0-alpha.20191030 leptonica-1.78.0 libgif 5.1.4 : >>>> libjpeg 8d (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib >>>> 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.0 Found AVX2 Found AVX Found >>>> FMA >>>> Found SSE Found libarchive 3.3.2 zlib/1.2.11 liblzma/5.2.3 bz2lib/1.0.6 >>>> liblz4/1.7.5* >>>> >>>> OS: >>>> *Windows 10* >>>> >>>> Are there any system specific optimizations/dependencies by tesseract ? >>>> >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/414947ab-b10a-40b8-8196-65a5bbbb3e1c%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/e2cf0580-e096-4b5f-80d8-5d609051f203%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/e0946afd-dbbe-41ea-9741-1bfadeff97f3%40googlegroups.com.

