Hi Nick, I generated the uzn file and added some bbox values of 486 110 635 142 which has only one word of the whole image.
Next i used the command tesseract 101436364.tiff 10143636.3B_uzn -psm4 hocr ,So it generated the xhtml of the whole image. Next i used the command tesseract 101436364.tiff 10143636.3B_uzn -psm4 ,now it generated the text file of 486 110 635 142 area, which i provided in the uzn file. Now i need the xhtml of the 486 110 635 142 area. How can i generate it ? I hope you understood my question. It is very urgent for me,Please help me, Appreciate your help . Regards Krijesh PV. On Thursday, May 29, 2014 8:48:51 PM UTC+5:30, Nick White wrote: > > Hi Krijesh, > > On Thu, May 29, 2014 at 07:38:17AM -0700, Krijesh PV wrote: > > i am completely a novice on this topics, can please explain on complete > > process, how can i create this uzn files are there any tools for that, > > There aren't any tools to create uzn files, that I know of. You can > see how they are formatted from the isri example files mentioned > earlier in the thread. > > > wat does this psn switch mean, how to identify this values. > > Read the tesseract manual (as well as earlier in this thread) for > details about the psm switch, and how it relates to uzn. > > Note that you can also use the API to select regions to OCR, if > you're in a programming mood. > > Nick > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/58881bbf-783f-4cf2-bd1a-7dfd2808606f%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

