Re: [tesseract-ocr] Re: Problem reading text in two columns

2018-05-11 Thread ShreeDevi Kumar
> I used the tessdata_fast file for English - are these different from tessdata-ocr-eng that comes with Ubuntu? The ppa has traineddata files from tessdata_fast. Ubuntu 18.04 will have the same. Older versions of ubuntu (wihout ppa) will have traineddata files for tesseract 3.0x. You can try

[tesseract-ocr] Re: Problem reading text in two columns

2018-05-10 Thread Brooks Johnson
I've uninstalled and reinstalled from the PPA and my results resemble yours. I used the tessdata_fast file for English - are these different from tessdata-ocr-eng that comes with Ubuntu? On Wednesday, May 9, 2018 at 3:21:12 AM UTC-5, shree wrote: > > Please try by building the latest version

[tesseract-ocr] Re: Problem reading text in two columns

2018-05-09 Thread shree
> > Please try by building the latest version of tesseract from github > or install from links given in https://github.com/tesseract-ocr/tesseract/wiki I get the following output using the default eng.traineddata from the three repos - tessdata, tessdata_best, tessdata_fast, without any