Could you please post also testing image? Zdenko
On Thu, Jul 3, 2014 at 12:22 PM, elena bresciani < [email protected]> wrote: > Dear all, > > I need to integrate Tesseract in a C++ project. > First I simply called Tesseract from command line and, after setting up a > spefic configuration I've come to satifying results. > > This is the config file "pharma" > > load_system_dawg 0 >> load_freq_dawg 0 >> load_punc_dawg 0 >> user_words_suffix pharma-words >> tessedit_char_whitelist >> abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789, >> language_model_penalty_non_dict_word 0 >> > > > Now that I have to do the same thing with a Tesseract API I have terrible > results, like down to 10% of correct identification and 90% garbage. > I must be missing something in the conversion to the API... > > This is my code > > #include <tesseract/baseapi.h> >> #include <leptonica/allheaders.h> >> >> int main(int argc, char *argv[]) >> { >> char *outText; >> >> tesseract::TessBaseAPI *api = new tesseract::TessBaseAPI(); >> >> api -> Init("/usr/local/share/","ita"); >> api -> ReadConfigFile ("pharma"); >> >> >> Pix *image = pixRead (argv[1]); >> api -> SetImage (image); >> api -> SetSourceResolution(600); >> >> outText = api -> GetUTF8Text(); >> printf ("OCR output: \n%s", outText); >> >> api -> End(); >> delete [] outText; >> pixDestroy (&image); >> >> return 0; >> >> } >> > > > Can somebody help me undestand please? > > Thanks in advance > > Elena > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/7dd534f7-3e85-480f-bb81-3d34c7af0c05%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/7dd534f7-3e85-480f-bb81-3d34c7af0c05%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8zDL6-buukz9KaQijqrPsjtHwpseKQWESxy7tQmnX%3DdYA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

