Hi Nick, 
I  generated  the uzn file and added some bbox values of 486 110 635 142 
which has only one word of the whole image. 

Next i used the command  tesseract 101436364.tiff 10143636.3B_uzn -psm4 
hocr ,So it generated the xhtml of the whole image. 

Next  i used the command  tesseract 101436364.tiff 10143636.3B_uzn -psm4  ,now 
it generated the text file of  486 110 635 142  area,  which i provided in 
the uzn file.

Now i need the xhtml of the 486 110 635 142 area. 

How can i generate it ?  I hope you understood my question. 

 It is very urgent for me,Please help me,  Appreciate your help . 
 
Regards
Krijesh PV.

On Thursday, May 29, 2014 8:48:51 PM UTC+5:30, Nick White wrote:
>
> Hi Krijesh, 
>
> On Thu, May 29, 2014 at 07:38:17AM -0700, Krijesh PV wrote: 
> > i am completely a novice on this topics, can please explain on complete 
> > process, how can i create this uzn files are there any tools for that, 
>
> There aren't any tools to create uzn files, that I know of. You can 
> see how they are formatted from the isri example files mentioned 
> earlier in the thread. 
>
> > wat does this psn switch mean, how to identify this values. 
>
> Read the tesseract manual (as well as earlier in this thread) for 
> details about the psm switch, and how it relates to uzn. 
>
> Note that you can also use the API to select regions to OCR, if 
> you're in a programming mood. 
>
> Nick 
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/58881bbf-783f-4cf2-bd1a-7dfd2808606f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to