which version of tesseract are using? It appears you are using prior to tess
like 2.xx. On Fri, Apr 8, 2011 at 2:03 AM, zl2k <[email protected]> wrote: > hi, all > > I generated the unicharset file as following: > > 36 > NULL 0 NULL 0 > 0 8 NULL 1 > 1 8 NULL 2 > 2 8 NULL 3 > 3 8 NULL 4 > 4 8 NULL 5 > 5 8 NULL 6 > 6 8 NULL 7 > 7 8 NULL 8 > 8 8 NULL 9 > 9 8 NULL 10 > A 5 NULL -1 > B 5 NULL -1 > C 5 NULL -1 > D 5 NULL -1 > E 5 NULL -1 > F 5 NULL -1 > G 5 NULL -1 > H 5 NULL -1 > I 5 NULL -1 > J 5 NULL -1 > K 5 NULL -1 > L 5 NULL -1 > M 5 NULL -1 > N 5 NULL -1 > O 5 NULL -1 > P 5 NULL -1 > R 5 NULL -1 > S 5 NULL -1 > T 5 NULL -1 > U 5 NULL -1 > V 5 NULL -1 > W 5 NULL -1 > X 5 NULL -1 > Y 5 NULL -1 > Z 5 NULL -1 > > What does the "NULL 0 NULL 0" in the second line mean? Do I need to > delete it? My box file is generated as plain text, does the "-1" ok? > (the id code of char given language). Thanks for help. > > zl2k > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

