Look at src/training/language_specific.sh

The list of default fonts for English is being picked up from there and you
probably don't have them installed.

Use fonts that are available.

On Fri, Jun 22, 2018 at 9:20 AM Harathi Surya <[email protected]>
wrote:

> Hi,
>
> I am trying to create .lstm files to finetune tesseract4.0.0 for new
> characters. I want to fine tune tesseract to recognize new characters like
> ±.
> What i tried:
> I added text that consists of the plus or minus symbol to the
> eng.training_text in langdata.
> Then I tried to run the following command
>
> src/training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng
> --linedata_only --noextract_font_properties --langdata_dir ../langdata
>  --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus
>
> I am getting the following error:
> ERROR: /tmp/tmp.3qWucNlYrH/eng/eng.Arial.exp0.box does not exist or is not
> readable
>
> The error repeated for all the font types.
>
> Can you please give some suggestions why this error occurs and how to
> solve this?
>
> Thanks in advance
> Harathi
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/ae8f7849-8d9a-4799-be3d-47dc67fcddc2%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/ae8f7849-8d9a-4799-be3d-47dc67fcddc2%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWXEvRyLZw2e2rg4cgZxHjvcnSETNCYZWssAz6LN%2BDS_g%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to