Can you post examples? Sven On Thursday, August 29, 2013, georg wrote:
> Hi, > > Did anyone have any suggestions? - I haven't heard back... > > Additionally I have 2 more questions: > > 1. It looks like tesseract messes up when the characters are more bold > (line thickness is bigger) than the original trained image.. Is this > correct? - Is there a way to fix that. > > 2. We tried to train a character, but jTessbox only drew boxes around some > characters (see posted image) and not all of them, although they seem very > much alike. Why is that? > > I would very much appreciate some input as we are hitting a brick wall > with this one. > > Thanks > > Georg > > Am Montag, 26. August 2013 13:43:43 UTC+2 schrieb georg: >> >> Hello, >> >> I have a question regarding language files. >> >> We have a set of characters, which sometimes has cut off characters. >> >> It is my understanding that I can not train very different looking >> characters in one set, because it causes tesseract to get confused. >> >> I would like to generate 2 tiffs (one for complete characters and one for >> cut off ones) and then do the mft training. >> >> Is it true that mft training assembles both tiffs in one language file >> and runs tesseract twice, first with the tiff for the whole characters and >> once for the cut off characters? >> >> Does tesseract keep the tiffs separate although they are in the same >> language file? >> >> How would you work this problem? - I want to try and keep the training >> process as simple as possible (it is already complicated enough). >> >> Thanks for your help! >> >> Take care, >> >> Georg >> >> >> -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected] <javascript:_e({}, > 'cvml', 'tesseract-ocr%[email protected]');>. > For more options, visit https://groups.google.com/groups/opt_out. > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

