Re: [tesseract-ocr] Re: Error opening traineddata files on Mac High Sierra

2018-04-11 Thread Firlefanz
Thank you again. I think I'll stay with plain txt -- pdf looks too difficult to achieve. Now, next problem: Everything worked fine with my 1-page test pdf. I now tried to do the same with a 30 MB 500 pages pdf. After running convert -density 300 test.pdf -depth 8 -strip -background white -alph

Re: [tesseract-ocr] Re: Error opening traineddata files on Mac High Sierra

2018-04-11 Thread ShreeDevi Kumar
https://github.com/tesseract-ocr/tesseract/issues/660 Regarding pdf On Wed 11 Apr, 2018, 1:28 PM ShreeDevi Kumar, wrote: > 1. Check the output tif and adjust convert command if needed > > 2. Depending on your tesseract version you could try -l frk also. > > 3. Yes, you can get a pdf as output.

Re: [tesseract-ocr] Re: Error opening traineddata files on Mac High Sierra

2018-04-11 Thread ShreeDevi Kumar
1. Check the output tif and adjust convert command if needed 2. Depending on your tesseract version you could try -l frk also. 3. Yes, you can get a pdf as output. Search Github issues, there is a long discussion thread regarding best ways to create a pdf output. Look for pdf and invisible pdf.