My tests shows that it is not possible to use newer language data files -prepared in new version tesseract - in older version tesseract as clearly clarified by the Zdenko.
However traineddata prepared in the older version will work in new version tesseract - according to my test. -sriranga(79yrs) On Sun, Feb 5, 2012 at 9:52 PM, Sriranga(78yrsold) <[email protected]>wrote: > Also tested in tesseract-3.02 using the eng.traineddata of r-527 > -works fine vide screenshot attached > > > On Sun, Feb 5, 2012 at 9:44 PM, Sriranga(78yrsold) < > [email protected]> wrote: > >> Also tested in the r-527 using the eng.trainedata of ver 3.02. I got >> error message vide screenshot attached which is self explanatory. >> -sriranga(79yrs) >> >> >> On Sun, Feb 5, 2012 at 8:52 PM, zdenko podobny <[email protected]> wrote: >> >>> Just quick tests: >>> >>> I am able to run 'tesseract eurotext.tif eurotext' (it use >>> eng.traineddata) and I got result on linux without any problem... >>> Can you verify downloaded file? In attachment you can find my md5 >>> checksum... >>> tesseract 3.02 works also with 3.01 data file (as I tested it on linux), >>> so try to use 3.01 eng.traineddata... If it do not work so I guess problem >>> is not in 'eng.traineddata'... >>> >>> >>> 2012/2/5 Patrick Questembert <[email protected]> >>> >>>> This is running on iOS, within an app which has been running perfectly >>>> with Tesseract 2.04, 3.00 and 3.01 using the same init with lang API with >>>> eng.traineddata >>>> It's clearly not an issue of not being able to locate the file, the >>>> assert appears to state that the training set is inconsistent in terms of >>>> expected number of entries. >>>> >>>> The API call is: myTess->Init(tessDataDir.c_str(), language, >>>> OEM_DEFAULT, NULL, 0, false); >>>> >>>> Thanks, >>>> Patrick >>>> >>>> 2012/2/5 Zdenko Podobný <[email protected]> >>>> >>>>> Can you please provide more details (OS, compiler, how to run/use >>>>> tesseract)? >>>>> >>>>> Zdenko >>>>> >>>>> Dňa 05.02.2012 15:38, patrickq wrote / napísal(a): >>>>> >>>>> I am running the latest Tess 3.02 with the new English training set >>>>> and get the following crash at init with lang: >>>>> >>>>> actual_tessdata_num_entries_ <= TESSDATA_NUM_ENTRIES:Error:Assert >>>>> failed:in file tessdatamanager.cpp, line 48 >>>>> >>>>> Has anyone seen this? >>>>> >>>>> Note: I am not using the cube version, just "eng" with eng.traineddata >>>>> >>>>> By the way: I noticed the new training set is 21.9MB versus 3.1MB for >>>>> Tesseract 3.01: just more fonts added or something else too? >>>>> >>>>> Thanks, >>>>> Patrick >>>>> >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To post to this group, send email to [email protected] >>>>> To unsubscribe from this group, send email to >>>>> [email protected] >>>>> For more options, visit this group at >>>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>> >>>> >>>> >>>> >>>> -- >>>> Patrick Questembert, *ScanBizCards* >>>> +1-917-250-4177 | www.scanbizcards.com >>>> twitter.com/ScanBizCards | www.facebook.com/ScanBizCards >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> To unsubscribe from this group, send email to >>> [email protected] >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >> >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

