[ocropus] Re: Train OCROPUS using word images

pkt Mon, 18 Jul 2011 00:34:57 -0700

On 15 Ιούλ, 13:58, Raj Julha <[email protected]> wrote:
> Hi
>
> I notice that due to the complexity of extracting characters from
> handwritten historical texts some research has been done in
> recognising words by using word images. I'm planning to train the
> engine for a specific handwriting so I'd like to use a set of word
> images and their corresponding transcription. Can I do that with
> OCROPUS? If yes any hints on how I could do that?


I think you are refering to "segmentation-free" recognition.
AFAIK Ocropus doesn't have a "segmentation-free" recogniser,
so if you insist on taking this route you would have to implement
the recognition step (line images -> character hypotheses FST)
yourself I think.

On the other hand, it seems Ocropus' recognizer approach
(oversegmentation + MLP neural net) was initially developed
for handwriting recognition so if the handwriting you care about
is not cursive / very different from type, you can give it a try.

Wrt training, from what it seems from the latest posted screencasts,
Ocropus can be trained fine with word-level, line-level or even
page-level ground truth and it will try to do the alignment itself.



-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en.

[ocropus] Re: Train OCROPUS using word images

Reply via email to