I'm trying to execute the training from the 4.o tutorial, but I'm getting 
an error, can someone help with this?

Platform: MAC OS X 10.13.3
Tesseract: 4.0.0-beta.1
leptonica: 1.75.3
libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11


Code used

../../tesseract/training/tesstrain.sh \
  --fonts_dir /Library/Fonts \
  --lang eng --linedata_only \
  --noextract_font_properties \
  --exposures "0"    \
  --langdata_dir ../../langdata \
  --tessdata_dir /usr/local/Cellar/tesseract/HEAD-f8e26ee/share/tessdata \
  --fontlist "Verdana" \
  --output_dir .~/tesstutorial/ara

Result

=== Starting training for language 'eng'
[Fri Apr 6 20:19:15 -03 2018] /usr/local/bin/text2image 
--fonts_dir=/Library/Fonts --font=Verdana 
--outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/font_tmp.XXXXXXXXXX.aU9oTb7N/sample_text.txt
 
--text=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/font_tmp.XXXXXXXXXX.aU9oTb7N/sample_text.txt
 
--fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/font_tmp.XXXXXXXXXX.aU9oTb7N

=== Phase I: Generating training images ===
Rendering using Verdana
[Fri Apr 6 20:19:17 -03 2018] /usr/local/bin/text2image 
--fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/font_tmp.XXXXXXXXXX.aU9oTb7N
 
--fonts_dir=/Library/Fonts --strip_unrenderable_words --leading=32 
--char_spacing=0.0 --exposure=0 
--outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0
 
--max_pages=3 --font=Verdana --text=../../langdata/eng/eng.training_text
ERROR: 
/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
 
does not exist or is not readable
ERROR: 
/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x340000gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
 
does not exist or is not readable

Observations

I can find the fond if I use:

text2image --list_available_fonts --fonts_dir=/Library/Fonts

I tested some other fonts.

Thanks for the time and reply!

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/cbe9828e-690f-4bc4-8592-d195370d4857%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to