Re: [tesseract-ocr] ERROR: exp0.box does not exist or is not readable

2018-04-07 Thread Fanatico
Thanks for the reply, but I just fixed this bug, the problem is that the 
var PANGOCAIRO_BACKEND was empty on MAC OSX so I needed to set it before 
executing the code. Something like this:

PANGOCAIRO_BACKEND=fc \
../../tesseract/training/tesstrain.sh \
  --fonts_dir /Library/Fonts \
  --lang eng --linedata_only \
  --noextract_font_properties \
  --exposures "0"\
  --langdata_dir ../../langdata \
  --tessdata_dir /usr/local/Cellar/tesseract/HEAD-f8e26ee/share/tessdata \
  --fontlist "Verdana" \
  --output_dir .~/tesstutorial/eng

If someone need more details please look here: 
https://github.com/tesseract-ocr/tesseract/issues/736

On Saturday, 7 April 2018 04:35:36 UTC-3, shree wrote:
>
> Look in your tmp directory in the sub folders referred in the console 
> output
>
> Check the log file and other files there
>
> On Sat 7 Apr, 2018, 11:00 AM Fanatico,  
> wrote:
>
>> Yes the location is correct,  I tried to put the full path to the folder 
>> and go the same error. 
>>
>> Im just cloned the https://github.com/tesseract-ocr/langdata repo
>>
>> On Friday, 6 April 2018 23:28:06 UTC-3, shree wrote:
>>>
>>> Is your langdata in   --langdata_dir ../../langdata
>>>


 -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com .
>> To post to this group, send email to tesser...@googlegroups.com 
>> .
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/0c2a56a5-181b-4a8c-b29f-8869dfe8b22d%40googlegroups.com
>>  
>> 
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/25c5c75b-ebfd-41c3-978d-fb62c67119be%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] ERROR: exp0.box does not exist or is not readable

2018-04-07 Thread ShreeDevi Kumar
Look in your tmp directory in the sub folders referred in the console output

Check the log file and other files there

On Sat 7 Apr, 2018, 11:00 AM Fanatico,  wrote:

> Yes the location is correct,  I tried to put the full path to the folder
> and go the same error.
>
> Im just cloned the https://github.com/tesseract-ocr/langdata repo
>
> On Friday, 6 April 2018 23:28:06 UTC-3, shree wrote:
>>
>> Is your langdata in   --langdata_dir ../../langdata
>>
>>>
>>>
>>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/0c2a56a5-181b-4a8c-b29f-8869dfe8b22d%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWUYDSbTEg058UrxjCAazjcNeAxNy%2B5z-br_jhO6fiScA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] ERROR: exp0.box does not exist or is not readable

2018-04-06 Thread Fanatico
Yes the location is correct,  I tried to put the full path to the folder 
and go the same error. 

Im just cloned the https://github.com/tesseract-ocr/langdata repo

On Friday, 6 April 2018 23:28:06 UTC-3, shree wrote:
>
> Is your langdata in   --langdata_dir ../../langdata
>
>>
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/0c2a56a5-181b-4a8c-b29f-8869dfe8b22d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: [tesseract-ocr] ERROR: exp0.box does not exist or is not readable

2018-04-06 Thread ShreeDevi Kumar
Is your langdata in   --langdata_dir ../../langdata

On Sat 7 Apr, 2018, 4:51 AM Fanatico,  wrote:

> I'm trying to execute the training from the 4.o tutorial, but I'm getting
> an error, can someone help with this?
>
> Platform: MAC OS X 10.13.3
> Tesseract: 4.0.0-beta.1
> leptonica: 1.75.3
> libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11
>
>
> Code used
>
> ../../tesseract/training/tesstrain.sh \
>   --fonts_dir /Library/Fonts \
>   --lang eng --linedata_only \
>   --noextract_font_properties \
>   --exposures "0"\
>   --langdata_dir ../../langdata \
>   --tessdata_dir /usr/local/Cellar/tesseract/HEAD-f8e26ee/share/tessdata \
>   --fontlist "Verdana" \
>   --output_dir .~/tesstutorial/ara
>
> Result
>
> === Starting training for language 'eng'
> [Fri Apr 6 20:19:15 -03 2018] /usr/local/bin/text2image
> --fonts_dir=/Library/Fonts --font=Verdana
> --outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N/sample_text.txt
> --text=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N/sample_text.txt
> --fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N
>
> === Phase I: Generating training images ===
> Rendering using Verdana
> [Fri Apr 6 20:19:17 -03 2018] /usr/local/bin/text2image
> --fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N
> --fonts_dir=/Library/Fonts --strip_unrenderable_words --leading=32
> --char_spacing=0.0 --exposure=0
> --outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0
> --max_pages=3 --font=Verdana --text=../../langdata/eng/eng.training_text
> ERROR:
> /var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
> does not exist or is not readable
> ERROR:
> /var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
> does not exist or is not readable
>
> Observations
>
> I can find the fond if I use:
>
> text2image --list_available_fonts --fonts_dir=/Library/Fonts
>
> I tested some other fonts.
>
> Thanks for the time and reply!
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/cbe9828e-690f-4bc4-8592-d195370d4857%40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWz0tPqugJmi8mMOVKidXFiFjPsQ%2BOUzNzgp-y%3Dkw64WA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] ERROR: exp0.box does not exist or is not readable

2018-04-06 Thread Fanatico
I'm trying to execute the training from the 4.o tutorial, but I'm getting 
an error, can someone help with this?

Platform: MAC OS X 10.13.3
Tesseract: 4.0.0-beta.1
leptonica: 1.75.3
libjpeg 9c : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11


Code used

../../tesseract/training/tesstrain.sh \
  --fonts_dir /Library/Fonts \
  --lang eng --linedata_only \
  --noextract_font_properties \
  --exposures "0"\
  --langdata_dir ../../langdata \
  --tessdata_dir /usr/local/Cellar/tesseract/HEAD-f8e26ee/share/tessdata \
  --fontlist "Verdana" \
  --output_dir .~/tesstutorial/ara

Result

=== Starting training for language 'eng'
[Fri Apr 6 20:19:15 -03 2018] /usr/local/bin/text2image 
--fonts_dir=/Library/Fonts --font=Verdana 
--outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N/sample_text.txt
 
--text=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N/sample_text.txt
 
--fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N

=== Phase I: Generating training images ===
Rendering using Verdana
[Fri Apr 6 20:19:17 -03 2018] /usr/local/bin/text2image 
--fontconfig_tmpdir=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/font_tmp.XX.aU9oTb7N
 
--fonts_dir=/Library/Fonts --strip_unrenderable_words --leading=32 
--char_spacing=0.0 --exposure=0 
--outputbase=/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0
 
--max_pages=3 --font=Verdana --text=../../langdata/eng/eng.training_text
ERROR: 
/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
 
does not exist or is not readable
ERROR: 
/var/folders/xl/gqcd7ljn0k7d3r_3j9dy7x34gn/T/tmp.OaBuo1g2/eng/eng.Verdana.exp0.box
 
does not exist or is not readable

Observations

I can find the fond if I use:

text2image --list_available_fonts --fonts_dir=/Library/Fonts

I tested some other fonts.

Thanks for the time and reply!

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/cbe9828e-690f-4bc4-8592-d195370d4857%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.