Let me explain, I'm using opencv to preprocess the image and obtain the image of a license plate (binaria2.png, for example), then I want to execute tesseract to obtain the characters from that license plate. I don't mind if the language is in Spanish or any other language. I' ve changed the function I had with the function you provided me, using leptonica and Pix, instead of Mat and when I've runned the program an error has been displayed. The error is the error I showed before.
Error: Illegal min or max specification! signal_termination_handler: Error:Signal_termination_handler called: Code 5002 About the PSM, I've tried with more than 7, also tried with Single line, Auto PSM, Single Block.... and I obtain the same result. When I obtain some chararcters these are not correct. For example, in a license plate like: 0211 JCW the result I obtained yestarday was: EEEEEEIIIE or something like that, I don't remember now. If I execute tesseract from the console in the same image I obtain the desired result, that's why I'm getting crazy with this. I don't understand the reason. Thank you. El martes, 10 de diciembre de 2013 08:47:22 UTC+1, zdenop escribió: > > Can you please provide exact error message? If it is in spain because of > your locale, try to run > LC_ALL=C ./api_test > > What is the result of tesseract executable (tesseract binaria2.png > binaria2 -psm 7)? > > > Zdenko > > > On Tue, Dec 10, 2013 at 8:37 AM, adrian company <[email protected]<javascript:> > > wrote: > >> I'm using Ubuntu 12.04, with tesseract version 3.02 and using Eclipse CDT >> >> El martes, 10 de diciembre de 2013 08:17:31 UTC+1, zdenop escribió: >>> >>> What OS you use? >>> Which tesseract version? >>> What compiller you used? >>> >>> Zdenko >>> >>> >>> On Tue, Dec 10, 2013 at 7:41 AM, adrian company <[email protected]>wrote: >>> >>>> Hi Zdenko, >>>> I've tried to use the code you posted here for using leptonica, and >>>> that gives me an error saying something about min, max specificacion >>>> (Error: Illegal min or max specification! >>>> signal_termination_handler:Error:Signal_termination_handler >>>> called:Code 5002) >>>> >>>> I've changed the OCR.SetRectangle and the same error displayed, I've >>>> tried also with another image and the same. >>>> >>>> El lunes, 9 de diciembre de 2013 22:02:13 UTC+1, zdenop escribió: >>>>> >>>>> >>>>> 1. Instead of function listing it is better to provide small test >>>>> case. It save time to testers... >>>>> 2. Skip not "relevant" code (e.g. if you are testing tesseract >>>>> api, open image with leptonica function and not with opencv...) >>>>> 3. You need to fix perspective of image first, so you have some >>>>> border around text. See I did it in gimp, but maybe you can do it in >>>>> opencv >>>>> too... >>>>> >>>>> >>>>> >>>>> Zdenko >>>>> >>>>> >>>>> On Mon, Dec 9, 2013 at 1:15 PM, adrian company <[email protected]>wrote: >>>>> >>>>>> Hi Nick, >>>>>> I've took a look at api/tesseractmain.cpp as you recommend me, but I >>>>>> cannot find anything wrong, I think. Anyway, I could post my program >>>>>> here >>>>>> and try to guess what is going on with your help. >>>>>> This is my method: >>>>>> ___________________________________________________________________ >>>>>> void recognizeChar(Mat imagen){ >>>>>> >>>>>> /*INITIALIZE (TESSERACT)*/ >>>>>> putenv("TESSDATA_PREFIX=/usr/local/share/"); >>>>>> setlocale(LC_NUMERIC, "C"); >>>>>> tesseract::TessBaseAPI OCR; >>>>>> >>>>>> if (OCR.Init(NULL, "spa")){ >>>>>> fprintf( stderr, "cannot could initialize tesseract.... \n" ); >>>>>> exit(1); >>>>>> } >>>>>> /*CONFIGURING*/ >>>>>> OCR.SetPageSegMode(tesseract::PSM_SINGLE_LINE); >>>>>> api.SetVariable("tessedit_char_whitelist", " >>>>>> ABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789 ");//lista blanca >>>>>> api.SetVariable("tessedit_char_blacklist" , "<> >>>>>> abcdefghijklmnopqtrstuvwxyz./!¡$%&?¿,;+-#");//lista negra >>>>>> OCR.SetImage(imagen.data, imagen.size().width, >>>>>> imagen.size().height, imagen.channels(), imagen.step1()); >>>>>> OCR.TesseractRect(imagen.data, 0, imagen.step1(), 0, 0, >>>>>> imagen.cols, imagen.rows); >>>>>> >>>>>> /*GETTING READED TEXT*/ >>>>>> char* texto = OCR.GetUTF8Text(); >>>>>> string t1=texto; >>>>>> t1.erase( remove(t1.begin(), t1.end(), '\n'), t1.end() ); >>>>>> cout << "TEXTO: "<<t1.c_str() <<endl; >>>>>> } >>>>>> ____________________________________________________________ >>>>>> ___________ >>>>>> Thank you all. >>>>>> >>>>>> El martes, 3 de diciembre de 2013 11:29:58 UTC+1, Nick White escribió: >>>>>> >>>>>>> Hi Adrian, >>>>>>> >>>>>>> Well then your C++ program must be wrong in some way. The command >>>>>>> line version doesn't do anything special, it just uses the API like >>>>>>> anything else. Take a look at api/tesseractmain.cpp to check how >>>>>>> your API usage differs, to find your bug. >>>>>>> >>>>>>> Nick >>>>>>> >>>>>>> On Tue, Dec 03, 2013 at 01:16:40AM -0800, adrian company wrote: >>>>>>> > Hi Sventech, >>>>>>> > I've tested the image with the command line version and I get the >>>>>>> same result >>>>>>> > as you. But when I use my own software in C++ I cannot obtain the >>>>>>> same result, >>>>>>> > simply get nothing. Currently I am using PSM_SINGLE_LINE, but I've >>>>>>> said before >>>>>>> > I've tried all the page seg modes. >>>>>>> > I don't know what is wrong. I've reinstalled tesseract and do the >>>>>>> same. >>>>>>> > >>>>>>> > >>>>>>> > El martes, 3 de diciembre de 2013 07:29:11 UTC+1, adrian company >>>>>>> escribió: >>>>>>> > >>>>>>> > And about the page seg I've tried with all the page seg but I >>>>>>> still get >>>>>>> > anything. >>>>>>> > >>>>>>> > El lunes, 2 de diciembre de 2013 16:13:17 UTC+1, sventech >>>>>>> escribió: >>>>>>> > >>>>>>> > I get >>>>>>> > V! 2\"03ENl >>>>>>> > so you could postprocess that kind of thing to get better >>>>>>> results -- >>>>>>> > you need to eliminate the black border for best results. >>>>>>> You may need >>>>>>> > to remove noise. What page seg mode are you using? Make >>>>>>> sure you test >>>>>>> > with the command line version before you try your own. >>>>>>> Also, I'm using >>>>>>> > the latest version 3.02.02 >>>>>>> > --Sven >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > On Mon, Dec 2, 2013 at 6:18 AM, adrian company < >>>>>>> [email protected]> >>>>>>> > wrote: >>>>>>> > >>>>>>> > Hi again, I've tried to deskew the first image and >>>>>>> pass it to >>>>>>> > tesseract greater, but I have the same result, the >>>>>>> numbers and >>>>>>> > letters are not recognized by tesseract. I post an >>>>>>> image where you >>>>>>> > can see how is my image now. >>>>>>> > Any idea??? >>>>>>> > Thanks in advance again. >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > El jueves, 31 de octubre de 2013 07:22:53 UTC+1, >>>>>>> adrian company >>>>>>> > escribió: >>>>>>> > >>>>>>> > Thanks Sventech, I'll try to deskew the first, i'm >>>>>>> using opencv >>>>>>> > to prepare the image so I cannot use any program >>>>>>> to prepare it. >>>>>>> > I've tried to rotate the image and pass it to >>>>>>> tesseract with >>>>>>> > text in horizontal but tesseract outputs the same. >>>>>>> I will also >>>>>>> > try to pass it to in png format and I will see the >>>>>>> result. >>>>>>> > >>>>>>> > >>>>>>> > On Wednesday, October 30, 2013 3:21:58 PM UTC+1, >>>>>>> sventech >>>>>>> > wrote: >>>>>>> > >>>>>>> > In the first image you need to deskew it. >>>>>>> There are free >>>>>>> > programs for preparing the image, The second >>>>>>> image appears >>>>>>> > to be too low resolution (or letter pixel >>>>>>> height to be >>>>>>> > precise). Approx. 200-300dpi is ideal for >>>>>>> tesseract's >>>>>>> > default training. Also, JPEG is not a good >>>>>>> format for text. >>>>>>> > Internally it will convert to TIFF or PNG. >>>>>>> > >>>>>>> > >>>>>>> > On Wed, Oct 30, 2013 at 6:50 AM, adrian >>>>>>> company < >>>>>>> > [email protected]> wrote: >>>>>>> > >>>>>>> > Hi all, I am trying to write a software to >>>>>>> recognize >>>>>>> > some text from an image, but when I >>>>>>> binarize the image >>>>>>> > and I call to tesseract engine, this does >>>>>>> not recognize >>>>>>> > text in image. Does somebody know why text >>>>>>> it is not >>>>>>> > recognized? Must I do something extra to >>>>>>> recognize? >>>>>>> > I attach the image I am trying to >>>>>>> recognize text >>>>>>> > (license plate). In this attached image >>>>>>> the tesseract >>>>>>> > output is nothing. >>>>>>> > >>>>>>> > I've also tried to recognize text from >>>>>>> another image >>>>>>> > (Fuma) and in this case the output is: "L >>>>>>> I". >>>>>>> > >>>>>>> > Could anybody help me? >>>>>>> > >>>>>>> > What could be happening? >>>>>>> > >>>>>>> > >>>>>>> > Thanks in advance. >>>>>>> > Adri >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > -- >>>>>>> > -- >>>>>>> > You received this message because you are >>>>>>> subscribed to >>>>>>> > the Google >>>>>>> > Groups "tesseract-ocr" group. >>>>>>> > To post to this group, send email to >>>>>>> > [email protected] >>>>>>> > To unsubscribe from this group, send email >>>>>>> to >>>>>>> > [email protected] >>>>>>> > For more options, visit this group at >>>>>>> > http://groups.google.com/group >>>>>>> /tesseract-ocr?hl=en >>>>>>> > >>>>>>> > --- >>>>>>> > You received this message because you are >>>>>>> subscribed to >>>>>>> > the Google Groups "tesseract-ocr" group. >>>>>>> > To unsubscribe from this group and stop >>>>>>> receiving >>>>>>> > emails from it, send an email to >>>>>>> > [email protected]. >>>>>>> > For more options, visit >>>>>>> https://groups.google.com/grou >>>>>>> > ps/opt_out. >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > -- >>>>>>> > ``All that is gold does not glitter, >>>>>>> > not all those who wander are lost; >>>>>>> > the old that is strong does not wither, >>>>>>> > deep roots are not reached by the frost. >>>>>>> > From the ashes a fire shall be woken, >>>>>>> > a light from the shadows shall spring; >>>>>>> > renewed shall be blade that was broken, >>>>>>> > the crownless again shall be king.” >>>>>>> > >>>>>>> > -- >>>>>>> > -- >>>>>>> > You received this message because you are subscribed >>>>>>> to the Google >>>>>>> > Groups "tesseract-ocr" group. >>>>>>> > To post to this group, send email to >>>>>>> [email protected] >>>>>>> > To unsubscribe from this group, send email to >>>>>>> > [email protected] >>>>>>> > For more options, visit this group at >>>>>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>>>>> > >>>>>>> > --- >>>>>>> > You received this message because you are subscribed >>>>>>> to the Google >>>>>>> > Groups "tesseract-ocr" group. >>>>>>> > To unsubscribe from this group and stop receiving >>>>>>> emails from it, >>>>>>> > send an email to [email protected]. >>>>>>> > For more options, visit https://groups.google.com/grou >>>>>>> ps/opt_out. >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > >>>>>>> > -- >>>>>>> > ``All that is gold does not glitter, >>>>>>> > not all those who wander are lost; >>>>>>> > the old that is strong does not wither, >>>>>>> > deep roots are not reached by the frost. >>>>>>> > From the ashes a fire shall be woken, >>>>>>> > a light from the shadows shall spring; >>>>>>> > renewed shall be blade that was broken, >>>>>>> > the crownless again shall be king.” >>>>>>> > >>>>>>> > -- >>>>>>> > -- >>>>>>> > You received this message because you are subscribed to the Google >>>>>>> > Groups "tesseract-ocr" group. >>>>>>> > To post to this group, send email to [email protected] >>>>>>> > To unsubscribe from this group, send email to >>>>>>> > [email protected] >>>>>>> > For more options, visit this group at >>>>>>> > http://groups.google.com/group/tesseract-ocr?hl=en >>>>>>> > >>>>>>> > --- >>>>>>> > You received this message because you are subscribed to the Google >>>>>>> Groups >>>>>>> > "tesseract-ocr" group. >>>>>>> > To unsubscribe from this group and stop receiving emails from it, >>>>>>> send an email >>>>>>> > to [email protected]. >>>>>>> > For more options, visit https://groups.google.com/groups/opt_out. >>>>>>> >>>>>> -- >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To post to this group, send email to [email protected] >>>>>> To unsubscribe from this group, send email to >>>>>> [email protected] >>>>>> For more options, visit this group at >>>>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>>>> >>>>>> --- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> For more options, visit https://groups.google.com/groups/opt_out. >>>>>> >>>>> >>>>> -- >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> [email protected] >>>> For more options, visit this group at >>>> http://groups.google.com/group/tesseract-ocr?hl=en >>>> >>>> --- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> For more options, visit https://groups.google.com/groups/opt_out. >>>> >>> >>> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected]<javascript:> >> To unsubscribe from this group, send email to >> [email protected] <javascript:> >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> --- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> For more options, visit https://groups.google.com/groups/opt_out. >> > > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

