Hi I have the following: 1. A full page handwritten image sample 2. The bounding boxes (coordinates) of each word on the page/image 3. A text file having the corresponding ground truth of each word on the image.
My first attempt was to create an image snippet of each word and a corresponding text file containing the word. I thought I'd use this as input for training but couldn't proceed any further because the ocroopus-align command doesn't seem to match the image/text. Should I programmatically create the page level pseg file? Or should I rather create an image per line (i have the necessary coordinates per line). The reason I took this route was to avoid the engine to incorrectly create line images as the handwritten text is not a straght line. I would appreciate any input on how I can proceed to train the engine. Thanks Raj -- You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/ocropus?hl=en.
