Hello, I'm trying to finetune the end.traineddata model as the steps in the link: https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for-%C2%B1-a-few-characters
As the tutorail shows, I fine tuning for ± a few characters following the steps. But when I execute the first command, to generate new training and eval data: training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng --linedata_only \ --noextract_font_properties --langdata_dir ../langdata \ --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus An error is prompted: *Creation of encoded unicharset failed! *While constructing LSTM training data. More details refer to the image. Can you help me? Thanks. <https://lh3.googleusercontent.com/-Wjai0vTloSw/WYwaRTe-wHI/AAAAAAAAABA/y-k4luLbaNws3Qz5gae8oT2ou2nJoF9XACLcBGAs/s1600/2017-08-10%2B16-15-28%25E5%25B1%258F%25E5%25B9%2595%25E6%2588%25AA%25E5%259B%25BE.png> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/1c40ba47-a6e5-4ec9-bf58-677bcdb2f74b%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

