Hi, Thanks for the response. I looked for those functions you mentioned and I noticed that they are only in the SVN. I am working with the tesseract-3.00 source archive because I was unable to compile tesseract from SVN on OS X.
But I have made pretty good progress in 3.00. I've been able to get it to recognize some of my sample glyphs. I've converted them into lists of line segments which I insert into a PBLOB. From there I create a WERD and insert it into a ROW, and then a BLOCK. After that I am running Tesseract::recog_all_words(). I don't have consistent results yet though, I think this is because I don't have my initial state set up correctly for the ROW and BLOCK. I am wondering now if I should try to get what is on the SVN to compile. It looks like the API has changed a bit. Thanks, Leaf On Mar 24, 2:30 pm, Dmitri Silaev <[email protected]> wrote: > You can begin your journey to the guts of Tesseract from the > TessBaseAPI::MakeTBLOB() > function. Its result then can be used e.g. in LearnBlob() from > "blobclass.cpp" or > TessBaseAPI::RunAdaptiveClassifier() > > However, if you say you have curves besides line segments, you need to > restore your glyphs > in the bitmap form and then run the training/recognition in a usual > way. This is because afaik Tess > doesn't support curve approximations of glyph outlines, it only uses > straight segments. Or, to do > this programmatically, you might use the > MakeTBLOB()/RunAdaptiveClassifier() approach. > > HTH > > Warm regards, > Dmitri Silaev > > > > > > > > On Wed, Mar 23, 2011 at 9:28 PM, leaf corcoran <[email protected]> wrote: > > I have a collection of unknown glyphs for letters that I am trying to > > identify in the form of a polygon made of line segments and curves. > > I'm wondering if tesseract would be a good candidate for this. I've > > been looking through the source code now but I haven't been able to > > isolate any part that does what I am trying to achieve. I am using > > tesseract 3.00 > > > I am looking at PBLOB right now, but I haven't seen a way to far to > > get around the page and word recognition and go right to identifying > > characters. From what I have read it looks like the classify directory > > is where I should be looking but I am not so sure. > > > Thanks for any help, > > Leaf > > > -- > > You received this message because you are subscribed to the Google Groups > > "tesseract-ocr" group. > > To post to this group, send email to [email protected]. > > To unsubscribe from this group, send email to > > [email protected]. > > For more options, visit this group > > athttp://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

