I remember something about Ray adding a stderr feature for Windows. I remember getting the English training images from 2.03 and having them a level below the tessdata folder, (in tessdata\eng\). Here are two command lines I tried that made tr files:
tesseract tessdata\eng\eng.arial.tif junk nobatch box.train.stderr tesseract tessdata\eng\eng.arialbd.tif junk nobatch box.train.stderr On Mar 16, 3:00 am, 74yrs old <[email protected]> wrote: > Request for valuable solution for the following problems. > For testing purpose I used the phototest.tif as a sample to find out > whether tesseract-r319svn is able generate all required datafiles without > any inherent problems. > > Today, as a last effort, I downloaded tesseract -r319 again from the source > and re-compiled and generated all exe files in VC++2008 > > I tested Kan1.tif and also phototest.tif by running "tesseract phototest.tif > test -l eng" output was fine. Even for Kan1.tif was fine with some > mispelling. > > In order to generated fresh eng.datafiles and Kan.datafiles tried as > follows: > step 1: generated txt file using commandline " tesseract > *phototest.tif*phototest > batch.nochop makebox" > generated txt file using commandline " tesseract *kan1.tif* kan1 > *-l kan* batch.nochop makebox" > > step 2 renamed generated phototest.txt and kan1.txt as phototest.box > and Kan1.box. > > step3 tesseract phototest.tif junk nobatch box.train > > tesseract kan1.tif junk nobatch box.train > > For Step3, instead of generating .tr files generated "tesseract.exe > has encountered problem > windows error message displayed for phototest.tif and kan1.tif. > > As such I could not generated English and kannada datafiles from the > the scratch using real text > Awaiting further valuable guidance. I also posted under issue. > Regards, > -sriranga(77yrsold) > > test1(eng).JPG > 252KViewDownload > > test 1(kan).JPG > 189KViewDownload -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

