[tesseract-ocr] Re: All-caps, small-caps

bácsi Kazi Sun, 27 Dec 2015 12:43:55 -0800

Could you help? Have I missed something blatantly trivial?
Any help would be highly appreciated!


Kazi

2015. december 15., kedd 8:33:27 UTC+1 időpontban bácsi Kazi a következőt 
írta:

> Hi there! 
>
> I'm playing with Tesseract 3.02, and I would need precise recognition of 
> capital letters. Unfortunately my files are full of all caps and small 
> caps. During the training if I included such words in the sample, I got 
> random capitals in the rest of the text. I thought I would try to put them 
> into a new font, same. I included them in the dictionary files, somewhat 
> better, but still problematic at letter o, u, v etc. I.e. HELLo WoRLD & 
> friends, despite having HELLO WORLD in dictionary. 
> It's quite similar to this: 
> https://code.google.com/p/tesseract-ocr/issues/detail?id=691 
> What is your experience? How to train Tesseract for caps? Is it better in 
> later versions? Is there a configuration parameter to set? 
> Thanks! 
>
> Kazi

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/16a46021-43b9-484f-a66f-e3b077b4aadb%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: All-caps, small-caps

Reply via email to