Can you post examples?
Sven

On Thursday, August 29, 2013, georg wrote:

> Hi,
>
> Did anyone have any suggestions? - I haven't heard back...
>
> Additionally I have 2 more questions:
>
> 1. It looks like tesseract messes up when the characters are more bold
> (line thickness is bigger) than the original trained image.. Is this
> correct? - Is there a way to fix that.
>
> 2. We tried to train a character, but jTessbox only drew boxes around some
> characters (see posted image) and not all of them, although they seem very
> much alike. Why is that?
>
> I would very much appreciate some input as we are hitting a brick wall
> with this one.
>
> Thanks
>
> Georg
>
> Am Montag, 26. August 2013 13:43:43 UTC+2 schrieb georg:
>>
>> Hello,
>>
>> I have a question regarding language files.
>>
>> We have a set of characters, which sometimes has cut off characters.
>>
>> It is my understanding that I can not train very different looking
>> characters in one set, because it causes tesseract to get confused.
>>
>> I would like to generate 2 tiffs (one for complete characters and one for
>> cut off ones) and then do the mft training.
>>
>> Is it true that mft training assembles both tiffs in one language file
>> and runs tesseract twice, first with the tiff for the whole characters and
>> once for the cut off characters?
>>
>> Does tesseract keep the tiffs separate although they are in the same
>> language file?
>>
>> How would you work this problem? - I want to try and keep the training
>> process as simple as possible (it is already complicated enough).
>>
>> Thanks for your help!
>>
>> Take care,
>>
>> Georg
>>
>>
>>  --
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to 
> [email protected]<javascript:_e({}, 'cvml', 
> '[email protected]');>
> To unsubscribe from this group, send email to
> [email protected] <javascript:_e({}, 'cvml',
> 'tesseract-ocr%[email protected]');>
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>
> ---
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected] <javascript:_e({},
> 'cvml', 'tesseract-ocr%[email protected]');>.
> For more options, visit https://groups.google.com/groups/opt_out.
>


-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to