Hi, I am trying to create .lstm files to finetune tesseract4.0.0 for new characters. I want to fine tune tesseract to recognize new characters like ±. What i tried: I added text that consists of the plus or minus symbol to the eng.training_text in langdata. Then I tried to run the following command
src/training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng --linedata_only --noextract_font_properties --langdata_dir ../langdata --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus I am getting the following error: ERROR: /tmp/tmp.3qWucNlYrH/eng/eng.Arial.exp0.box does not exist or is not readable The error repeated for all the font types. Can you please give some suggestions why this error occurs and how to solve this? Thanks in advance Harathi -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ae8f7849-8d9a-4799-be3d-47dc67fcddc2%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

