After I run 

 unicharset_extractor rashi.bold.exp1.box rashi.regular.exp1.box

I get some lines in the unicharset file that are not explained anywhere. 
 For example,

Joined 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # Joined [4a 6f 69 
6e 65 64 ]
|Broken|0|1 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # Broken

What should I do with these?

Also, the remaining lines don't match what the wiki says for training 
tesseract 3.  

E.g.,

0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # מ [5de ]

and

. 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # . [2e ]

Any help would be appreciated.  Thanks,

-seth

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to