[tesseract-ocr] Not getting any result with Tesseract-ocr v3.05.01 on Windows

2017-07-07 Thread Ruben Gaspar
Hello, While trying to use Tesseract v3.05.01 on my documents I get: "c:\Program Files (x86)\Tesseract-OCR\tesseract.exe" G:\\n_12_ocr11.png n_12_ocr11 -l eng Tesseract Open Source OCR Engine v3.05.01 with Leptonica Error in pixCreateNoInit: pix_malloc fail for data Error in

Re: [tesseract-ocr] Tesseract-ocr on Redhat 5

2017-07-07 Thread ShreeDevi Kumar
​for 3.05 don't you need to checkout the 3.05 branch??​ master is for 4.0 development. ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Fri, Jul 7, 2017 at 9:22 PM, akhil katpally wrote: >

[tesseract-ocr] Re: find blocks of text

2017-07-07 Thread akhil katpally
Try Olena (scribo module) ... this can do good job ... here is the link olena.lrde.epita.fr On Tuesday, June 27, 2017 at 6:12:10 AM UTC-7, gadv wrote: > > How can I find blocks of text (not paragraphs necessarily) with tesseract? > If not possible with tesseract, do you know of any other tool

Re: [tesseract-ocr] Tesseract-ocr on Redhat 5

2017-07-07 Thread akhil katpally
Steven .. Here is the list of commands to install tesseract 3.05 on Redhat 6 ... Hope this should work for Redhat 5 ... if not please try to downgrade the tesseract and try .. sudo yum update sudo yum install wget unzip sudo yum install gcc gcc-c++ make sudo

[tesseract-ocr] Running Tesseract from Java application is giving permission denied

2017-07-07 Thread Prav
Hi, I am struggling with using Tesseract from a Java application. We have a Ubuntu server. When we run the Tesseract command directly on the server we get the output. But the moment we run it from the java application it gives permission denied. Error 13. We have set the mode of the directory

Re: [tesseract-ocr] Re: Store rotated pages

2017-07-07 Thread Thomas Klettke
Greg - here is the relevant part of the Perl code I've put together for my server, running CentOS 7. Feel free to modify for your use: == ## Search for pages that need to rotated # Arbitrary prefix for temporary files - # I'm also using the nanoseconds of

Re: [tesseract-ocr] Re: Store rotated pages

2017-07-07 Thread 'Greg Kinney' via tesseract-ocr
Hi Thomas, would you mind sharing your script? I've been desperately looking for a way to auto-rotate from the command line. On Tuesday, June 6, 2017 at 6:20:19 PM UTC-5, Thomas Klettke wrote: > > Thanks - I've figured it out, and have a solution that works now: > >- Input file is a

[tesseract-ocr] GetUTF8Text missing on PageIterator from page.AnalyseLayout() for .NET version of Tesseract 4

2017-07-07 Thread Justin Minnaar
Testing the new version of Tesseract 4 under .NET, I am puzzled to find that there is no way to get the text for an item when using an PageIterator after calling the page.AnalyseLayout() function. Am I missing something or did this function not get exposed in C#? Does anyone have a suggestion