Hi Dovhani, On Tue, Aug 19, 2014 at 04:06:26AM -0700, Dovhani Foneworx wrote: > Hi I have a problem that when I run: > > set_unicharset_properties -U input_unicharset -O output_unicharset > --script_dir > =/home/foneworx/DM/Tesseracting/tesseract-3.03/training/langdata > > > > I get the following output: > > > Loaded unicharset of size 3 from file input_unicharsetSetting unichar > propertiesOther case JOINED of Joined is not in unicharsetOther case > |BROKEN|0| > 1 of |Broken|0|1 is not in unicharsetWriting unicharset to file > output_unicharset
Sometimes unicharsets have lines beginning Joined and |Broken| near the top. I'm not sure what they mean, but they don't screw anything up, so don't worry about it. The output you see there is just set_unicharset_properties warning you that they look weird (they are, but it's fine). Sorry to be a bit vague; I don't have time to look into exactly what they mean or why they're there at the moment. Nick -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/20140820144945.GA21531%40manta.lan. For more options, visit https://groups.google.com/d/optout.

