When running combine_lang_model, the following waring occurs.

Warning: properties incomplete for index 6 = ိ
Warning: properties incomplete for index 11 = ်
Warning: properties incomplete for index 22 = ြ
Warning: properties incomplete for index 39 = ွ
Warning: properties incomplete for index 42 = ့
Warning: properties incomplete for index 45 = ဲ
Warning: properties incomplete for index 57 = ှ
Warning: properties incomplete for index 58 = ံ
Warning: properties incomplete for index 64 = ီ
Warning: properties incomplete for index 84 = ဩ

Warning: properties incomplete for index 105 = ဪ


I checks Myanmar.unicharset 
<https://github.com/tesseract-ocr/langdata/blob/master/Myanmar.unicharset> .ြ  
ဩ and ဪ are missing. 
I added index for ဩ and waring disappear.
others contain in Myanmar.unicharset 
<https://github.com/tesseract-ocr/langdata/blob/master/Myanmar.unicharset>.

The following is comment from unicharset.h 
<https://github.com/tesseract-ocr/tesseract/blob/8de022ab1cd291ffaf4d24bb4fa07b97edfca4a7/src/ccutil/unicharset.h>
// Returns true if any of the top/bottom/width/bearing/advance ranges/stats 
is empty.

ိ 0 58,59,255,255,211,215,0,0,0,0 Myanmar 44 17 44 ိ # ိ [102d ]
> ီ 0 58,59,255,255,211,215,0,0,0,0 Myanmar 45 17 45 ီ # ီ [102e ]
> ဲ 0 58,59,255,255,211,215,0,0,0,0 Myanmar 49 17 49 ဲ # ဲ [1032 ]
> ံ 0 58,59,255,255,211,215,0,0,0,0 Myanmar 50 17 50 ံ # ံ [1036 ]
> ့ 0 0,0,255,255,211,215,0,0,0,0 Myanmar 51 17 51 ့ # ့ [1037 ]

 
bearing and advance for those are zero,
i look at Devanagari.unicharset 

 ं 0 62,76,194,242,81,178,0,27,0,77 Devanagari 2 17 2 ं # ं [902 ]


and I changed for ိ
ိ 0 58,59,255,255,211,215,1,27,1,27 Myanmar 44 17 44 ိ # ိ [102d ]
Nomore Warining.


   - 
   
   min_bearing, max_bearing: how far from the usual start position does the 
   leftmost part of the character begin.
   - 
   
   min_advance, max_advance: how far from the printer’s cell left do we 
   advance to begin the next character.
   
I readed doc but I can't understand and calculate.

pls help me.





-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/cd76e288-81d4-43d3-b621-b2d5f6d897f5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to