There is a newer wrapper for 3.x version:
http://code.google.com/p/tesseractdotnet/w/list

I think it was made by the developer of VietOCR
--Sven


On Thu, Nov 15, 2012 at 5:06 PM, zdenko podobny <[email protected]> wrote:

> On Fri, Nov 9, 2012 at 1:43 PM, Troy Frazier <[email protected]> wrote:
>
>> Is it possible to search an image for a particular word using the Tessnet
>> wrapper?  I see that it is possible to limit your scan to certain
>> characters, but what I would like to do is to input a word and have all
>> instances of that word be highlighted somehow.  Can anyone point me in the
>> right direction?
>>
>> Additional Info: I'm using Tessnet because it also comes in 64-bit, and
>> that's what I need.
>>
>> I am not sure about Tessnet - AFAIK it was wrapping tesseract 2.0x...
>
> It should be possible in tesseract 3.02 (there is C++ or C API)  with
> something like this (c++ code snippet):
>
> ...
>
>     const char *searchForWord = "name";
>
>     api->Recognize(0);
>
>     tesseract::ResultIterator* ri = api->GetIterator();
>
>     tesseract::PageIteratorLevel level = tesseract::RIL_WORD;
>
>     if (ri != 0) {
>
>        do {
>
>             const char* text = ri->GetUTF8Text(level);
>
>                 if (strcmp(text, searchForWord) == 0) {
>
>                 float conf = ri->Confidence(level);
>
>                 int x1, y1, x2, y2;
>
>                 ri->BoundingBox(level, &x1, &y1, &x2, &y2);
>
>                 printf("text: '%s''; conf: %.2f; BoundingBox: %d,%d,%d,%d;\n",
>
>                                            text, conf, x1, y1, x2, y2);
>
>             }
>
>             delete[] text;
>
>         } while (ri->Next(level));
>
>     }
>
> ...
>
> Instead of printf use your highlight routine (e.g. different
> highlight color based on confidence value ;-) )...
>
> --
> Zdenko
>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to