Pls can you expand on each of your proposals. The end goal which you might find interesting is to benchmark the text localisation and segmentation capability against dataset. The aim is to fully understand how good the detector in tesseract is.
I understand that tesseract uses leptonica for this purpose. So I am looking for the shortest path to my goal. From: Dmitri Silaev Sent: 25 March 2011 06:19 To: [email protected] Cc: BYTEFX Subject: Re: tesseract api, how do you get the bbox co-ordinates in commandline using the exe in win32 Well, I still don't get your final goal, maybe it could be easier to suggest something having known what you try to achieve. However if you'd decide to dive into programming, a better way of getting rects is using the ResultIterator/PageIterator interface. Also you can benefit of knowing that generation of .box files also provides you with rect coords... You can count lines, calculate widths and heights... There's a number of text file processing utilities... Well, probably you know what to do. Btw examining control paths used to generate .box files is a good point to devise your own blob rect dumper with minimal effort. Warm regards, Dmitri Silaev On Thu, Mar 24, 2011 at 6:48 PM, BYTEFX <[email protected]> wrote: > hi thanks for the pointer. i have set this up but this does not answer > my questions. > > let me explain: > > for a sample image sent to tesseract for processing, "GetRegions" > function can be called from the api. > i can figure this out and print out the results from within tesseract > but i imagined this must have been done elsewhere in the forum ! > > On Mar 24, 3:11 pm, Dmitri Silaev <[email protected]> wrote: >> I'm not sure if it's exactly what you want, but at first you can try >> to create a config file with the following line inside: >> >> textord_oldbl_debug T >> >> Since the output can be quite long and might not fit into the console >> window, you can also specify: >> >> debug_file tesseract.log >> >> I suspect any other debug info is only accessible from the ScrollView >> facility. You can read the Wiki and search this forum for "ScrollView" >> to find out more. If you are on Windows, you can use my article >> athttp://rdaemons.blogspot.com/2011/02/tesseract-ocr-setting-up-interac... >> to make your ScrollView installation process quicker >> >> Warm regards, >> Dmitri Silaev >> >> >> >> On Thu, Mar 24, 2011 at 2:41 PM, BYTEFX <[email protected]> wrote: >> > Hi, >> >> > i am interested in understanding the internal detector of tesseract >> > 3.00 in win32. >> >> > How do i go about printing out to winconsole the detected Rects from >> > tesseract. >> > is there a commandline arg for this, or where in the code (baseapi) >> > can i start from the get this information. >> >> > Basically need the output format, [x,y,width & height], and count of >> > Rect's identified from tesseract. >> >> > best >> >> > bytefx. >> >> > -- >> > You received this message because you are subscribed to the Google Groups >> > "tesseract-ocr" group. >> > To post to this group, send email to [email protected]. >> > To unsubscribe from this group, send email to >> > [email protected]. >> > For more options, visit this group >> > athttp://groups.google.com/group/tesseract-ocr?hl=en.- Hide quoted text - >> >> - Show quoted text - > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

