Well, this as interesting question for me too.

A high-level interface (like "ResultIterator") for getting best
results at word level does not exist. You need to dive into the guts
of Tesseract. I can't give a ready recipe for this, only I can give
you some kind of a probable guiding thread:

- Look into Wordrec::cc_recog()
- Look into Dict::DebugWordChoices()
- Check Dict::best_choices_ member

It'd be great if share your findings with the community when you
complete your investigation.

Warm regards,
Dmitri Silaev





On Tue, Apr 12, 2011 at 10:55 PM, Amrit <[email protected]> wrote:
> Thanks Dmitri,
>
> As a follow up, is there any way this can be done at a word level.As
> in if is a way I can get the top three most likely decoding results
> from tesseract for a given text image(individual words)
>
>
> Regards,
> Amrit.
>
> On Apr 12, 12:45 am, Dmitri Silaev <[email protected]> wrote:
>> One more subtlety:
>>
>>    // This ensures Tesseract's "blob_choices" structures are filled
>>    SetVariable("save_best_choices", "T");
>>
>>
>>
>>
>>
>>
>>
>> On Tue, Apr 12, 2011 at 12:36 AM, Amrit <[email protected]> wrote:
>> > Hi All,
>> >         Is there a way of obtaining a N-best result list from
>> > tesseract output? Something similar to generating the decoding
>> > hypothesis.I was looking for a way to examine the results from the
>> > tesseract word decoding on my earlier posted task of identifying
>> > postal codes from a raw image.
>> >         Please do let me know if anyone has come across a similar
>> > implementation.Any help is highly appreciated.
>>
>> > Regards,
>> > Amrit.
>>
>> > --
>> > You received this message because you are subscribed to the Google Groups 
>> > "tesseract-ocr" group.
>> > To post to this group, send email to [email protected].
>> > To unsubscribe from this group, send email to 
>> > [email protected].
>> > For more options, visit this group 
>> > athttp://groups.google.com/group/tesseract-ocr?hl=en.
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group at 
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to