'mylang' should be the ISO 639-2 language code for the language you are training for. I'm not sure that a long name would produce that error, but it might. Sven
On Sunday, May 19, 2013, wrote: > Hello group, > > I have lots of images that I have combined into one "long" (the images are > appended side by side) image using convert from the imagemagick suite. I > ran the long image through tesseract to create a box file, which it did. > Checking the image and the boxfile using jTessBoxEditor (which is a genius > tool!), I saw that tesseract had identified several characters, but guessed > a lot of them wrongly. I corrected the boxes and letters and generated > files according to the wiki for the purpose of training. I followed each > step, which seems to have worked nicely, and I ended up with a > mylang.traineddata file. > > When I try to run tesseract on one of the sample images that I used when > combining, i.e tesseract myimage.png result -l mylang, tesseract says > "Empty page!!". If I run it on the file I used for training, it reports > gibberish for all letters. > > Any ideas what I am doing wrong? > Have I totally misunderstood how tesseract works? > > Looking forwards to your reply and thanks in advance! > > PS: Attached is the training-file and the box file for inspection if > needed. > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] <javascript:_e({}, > 'cvml', 'tesseract-ocr%[email protected]');>. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

