1. Italicness is not stored in the unicharset file. The italicness is stored
in the inttemp file.
2. No harm in the NULLs. They are there for future expansion.
Ray.

On Sat, Feb 7, 2009 at 10:43 PM, Clem <[email protected]>wrote:

>
> Two questions:
>
> 1. Is it normal that the characters that I have labelled as "italic"
> with TesseractTrainer.py are given the number 0 ?
> 2. Why my unicharset file is full of NULL? Did I do something wrong?
>
> Thanks for your help.
>
> clement.
>
> -----------
>
> 108
> NULL 0 NULL
> n 3 NULL
> i 3 NULL
> h 3 NULL
> l 3 NULL
> u 3 NULL
> c 3 NULL
> d 3 NULL
> s 3 NULL
> , 0 NULL
> p 3 NULL
> r 3 NULL
> ae 3 NULL
> st 3 NULL
> a 3 NULL
> t 3 NULL
> v 3 NULL
> e 3 NULL
> o 3 NULL
> g 3 NULL
> 2 8 NULL
> : 0 NULL
> m 3 NULL
> ct 3 NULL
> f 3 NULL
> ssi 3 NULL
> 3 8 NULL
> 1 8 NULL
> 8 8 NULL
> q 3 NULL
> à 3 NULL
> E 5 NULL
> x 3 NULL
> & 0 NULL
> A 5 NULL
> P 5 NULL
> 6 8 NULL
> ; 0 NULL
> è 3 NULL
> H 5 NULL
> V 5 NULL
> ā 3 NULL
> R 5 NULL
> S 5 NULL
> y 3 NULL
> b 3 NULL
> T 5 NULL
> G 5 NULL
> si 3 NULL
> I 5 NULL
> . 0 NULL
> ò 3 NULL
> N 5 NULL
> Qu 5 NULL
> ss 3 NULL
> 9 8 NULL
> L 5 NULL
> fi 3 NULL
> j 3 NULL
> ( 0 NULL
> O 5 NULL
> ) 0 NULL
> C 5 NULL
> M 5 NULL
> ff 3 NULL
> - 0 NULL
> ù 3 NULL
> ? 0 NULL
> 7 8 NULL
> F 5 NULL
> D 5 NULL
> X 5 NULL
> $e 0 NULL
> $st 0 NULL
> $q 0 NULL
> $u 0 NULL
> $o 0 NULL
> $d 0 NULL
> $i 0 NULL
> $c 0 NULL
> $a 0 NULL
> $m 0 NULL
> $n 0 NULL
> $s 0 NULL
> $l 0 NULL
> $p 0 NULL
> $r 0 NULL
> $, 0 NULL
> $t 0 NULL
> $ss 0 NULL
> $. 0 NULL
> $Qu 0 NULL
> $si 0 NULL
> ū 3 NULL
> ē 3 NULL
> ffi 3 NULL
> ō 3 NULL
> $; 0 NULL
> ó 3 NULL
> fl 3 NULL
> oe 3 NULL
> é 3 NULL
> -- 0 NULL
> ê 3 NULL
> ú 3 NULL
> Q 5 NULL
> B 5 NULL
> z 3 NULL
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to