Hi,

I am trying to create .lstm files to finetune tesseract4.0.0 for new 
characters. I want to fine tune tesseract to recognize new characters like 
±.
What i tried:
I added text that consists of the plus or minus symbol to the 
eng.training_text in langdata.
Then I tried to run the following command

src/training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng 
--linedata_only --noextract_font_properties --langdata_dir ../langdata  
 --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus

I am getting the following error:
ERROR: /tmp/tmp.3qWucNlYrH/eng/eng.Arial.exp0.box does not exist or is not 
readable

The error repeated for all the font types.

Can you please give some suggestions why this error occurs and how to solve 
this?

Thanks in advance
Harathi

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/ae8f7849-8d9a-4799-be3d-47dc67fcddc2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to