Hi Sven, Are the samples OK? - I haven't heard anything back yet...
Thanks Regards, Georg Am Donnerstag, 29. August 2013 15:33:54 UTC+2 schrieb sventech: > > Can you post examples? > Sven > > On Thursday, August 29, 2013, georg wrote: > >> Hi, >> >> Did anyone have any suggestions? - I haven't heard back... >> >> Additionally I have 2 more questions: >> >> 1. It looks like tesseract messes up when the characters are more bold >> (line thickness is bigger) than the original trained image.. Is this >> correct? - Is there a way to fix that. >> >> 2. We tried to train a character, but jTessbox only drew boxes around >> some characters (see posted image) and not all of them, although they seem >> very much alike. Why is that? >> >> I would very much appreciate some input as we are hitting a brick wall >> with this one. >> >> Thanks >> >> Georg >> >> Am Montag, 26. August 2013 13:43:43 UTC+2 schrieb georg: >>> >>> Hello, >>> >>> I have a question regarding language files. >>> >>> We have a set of characters, which sometimes has cut off characters. >>> >>> It is my understanding that I can not train very different looking >>> characters in one set, because it causes tesseract to get confused. >>> >>> I would like to generate 2 tiffs (one for complete characters and one >>> for cut off ones) and then do the mft training. >>> >>> Is it true that mft training assembles both tiffs in one language file >>> and runs tesseract twice, first with the tiff for the whole characters and >>> once for the cut off characters? >>> >>> Does tesseract keep the tiffs separate although they are in the same >>> language file? >>> >>> How would you work this problem? - I want to try and keep the training >>> process as simple as possible (it is already complicated enough). >>> >>> Thanks for your help! >>> >>> Take care, >>> >>> Georg >>> >>> >>> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> For more options, visit https://groups.google.com/groups/opt_out. >> > > > -- > ``All that is gold does not glitter, > not all those who wander are lost; > the old that is strong does not wither, > deep roots are not reached by the frost. > From the ashes a fire shall be woken, > a light from the shadows shall spring; > renewed shall be blade that was broken, > the crownless again shall be king.” > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

