'mylang' should be the ISO 639-2 language code for the language you are
training for. I'm not sure that a long name would produce that error, but
it might.
Sven

On Sunday, May 19, 2013, wrote:

> Hello group,
>
> I have lots of images that I have combined into one "long" (the images are
> appended side by side) image using convert from the imagemagick suite. I
> ran the long image through tesseract to create a box file, which it did.
> Checking the image and the boxfile using jTessBoxEditor (which is a genius
> tool!), I saw that tesseract had identified several characters, but guessed
> a lot of them wrongly. I corrected the boxes and letters and generated
> files according to the wiki for the purpose of training. I followed each
> step, which seems to have worked nicely, and I ended up with a
> mylang.traineddata file.
>
> When I try to run tesseract on one of the sample images that I used when
> combining, i.e tesseract myimage.png result -l mylang, tesseract says
> "Empty page!!". If I run it on the file I used for training, it reports
> gibberish for all letters.
>
> Any ideas what I am doing wrong?
> Have I totally misunderstood how tesseract works?
>
> Looking forwards to your reply and thanks in advance!
>
> PS: Attached is the training-file and the box file for inspection if
> needed.
>
> --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to 
> [email protected]<javascript:_e({}, 'cvml', 
> '[email protected]');>
> To unsubscribe from this group, send email to
> [email protected] <javascript:_e({}, 'cvml',
> 'tesseract-ocr%[email protected]');>
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected] <javascript:_e({},
> 'cvml', 'tesseract-ocr%[email protected]');>.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>
>


-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to