I'm sorry for this. I found the two causes of the differences: First, our code was converting all to lower case, so this part was stupid to ask Second, seems that the tesseract executable uses the option tesseract::PSM_AUTO by default, but while calling from C++ you have to explicitly call api.SetPageSegMode(tesseract::PSM_AUTO);
El martes, 10 de abril de 2012 16:24:54 UTC-3, Juan Cardelino escribió: > > Dear all, > In the first place, thanks for keeping up this great library. > I'm doing some tests and when I run tesseract from the command line, I get > all the words with the correct casing. However when I call it from a C++ > program, all letters get converted to lowercase. Is there any option that I > am missing? > I've read something about whitelisting uppercase characters, but I'm not > really sure how to use it. Shall I list all possible uppercase characters? > Thanks in advance. > Best regards, > Juan > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

