Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-26 Thread David Barishev
I have successfully compiled from the latest branch. Thank you for all the support. On Wednesday, June 21, 2017 at 2:56:22 PM UTC+3, shree wrote: > > > > On Tuesday, June 20, 2017 at 9:09:53 PM UTC+5:30, shree wrote: >> >> I got the same error building 3.05.01 and have filed it as an issue - >>

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-21 Thread shree
On Tuesday, June 20, 2017 at 9:09:53 PM UTC+5:30, shree wrote: > > I got the same error building 3.05.01 and have filed it as an issue - > https://github.com/tesseract-ocr/tesseract/issues/1000 > This has been fixed by @stweil via https://github.com/tesseract-ocr/tesseract/pull/1003 Please

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread shree
I got the same error building 3.05.01 and have filed it as an issue - https://github.com/tesseract-ocr/tesseract/issues/1000 -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it,

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread ShreeDevi Kumar
Master branch currently includes the legacy engine. So you can easily build your custom traineddata using the following command (modify it for your fonts location, training text, font name etc) training/tesstrain.sh \ --fonts_dir ~/.fonts \ --tessdata_dir ../tessdata \ --training_text

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread David Barishev
After several testing, i have found mixed results. If i download leptonica 1.74.4, build it, and than build master brach, it works fine. With the same version of leptonica, the 3.05.01 release failes with the following error: libtool: link: g++ -g -O2 -std=c++11 -o .libs/tesseract

Re: [tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-20 Thread ShreeDevi Kumar
> Do you know why my tesseract isnt compiling ? I would really love a updated version on my ubuntu. Not sure. I haven't built 3.05 branch. For master, I follow the usual autotools method. Have you also built leptonica? Make sure you don't have any old leptonica version already. Make sure you

[tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-19 Thread shree
See https://github.com/tesseract-ocr/tesseract/issues/318 regarding the unicharset format I was able to do regular tesseract training (not lstm) using tesseract 4.00.00 version from github master and create new unicharset and traineddata with your box/tiff pair. The output on the same tiff

[tesseract-ocr] Re: unicharset_extractor extracting zero values

2017-06-19 Thread shree
See https://github.com/tesseract-ocr/tesseract/issues/318 regarding the unicharset format I was able to do regular tesseract training (not lstm) using tesseract 4.00.00 version from github master and create new unicharset and traineddata with your box/tiff pair. The output on the same tiff file