Hi

I have the following:
1. A full page handwritten image sample
2. The bounding boxes (coordinates) of each word on the page/image
3. A text file having the corresponding ground truth of each word on
the image.

My first attempt was to create an image snippet of each word and a
corresponding text file containing the word. I thought I'd use this as
input for training but couldn't proceed any further because the
ocroopus-align command doesn't seem to match the image/text. Should I
programmatically create the page level pseg file?
Or should I rather create an image per line (i have the necessary
coordinates per line). The reason I took this route was to avoid the
engine to incorrectly create line images as the handwritten text is
not a straght line.

I would appreciate any input on how I can proceed to train the engine.

Thanks

Raj

-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en.

Reply via email to