Re: [tesseract-ocr] tesseract on cygwin

Simon Eigeldinger Thu, 23 Jul 2015 23:43:17 -0700

Hi,

i never tried to give tesseract a pdf as an input.
cygwin has leptonica 1.71 or 1.72 by default so i used this for compiling.
maybe leptonica doesn't like pdf files so it might complain.

so ShreeDevi Kumar might convert the pdf into an image or he uses anormal image (tif, jpg, etc.).



greetings,
simon



Am 24.07.2015 um 08:17 schrieb zdenko podobny:

On Fri, Jul 24, 2015 at 7:10 AM, ShreeDevi Kumar <[email protected]>
wrote:


C:\Users\User\Downloads\TESS>tesseract test/eurotext.tif
test/eurotext-eng-pdf -l eng pdf
Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Page 1
Error in fopenWriteStream: stream not opened
Error in pixWrite: stream not opened
Error in fopenReadStream: file not found
Error in extractG4DataFromFile: stream not opened to file
Error in l_generateG4Data: datacomp not extracted
Error in pixGenerateCIData: g4 data not made
Error in l_generateCIDataForPdf: file test/eurotext.tif format is 4;
unreadable
Error during processing.

It looks like leptonica issue. Did you try to build and run leptonica
progs (all that has pdf in name)?




Zdenko


--
Simon Eigeldinger
Follow me on Twitter: http://www.twitter.com/domasofan/
E-Mail: [email protected]
MSN: [email protected]
ICQ: 121823966
Jabber: [email protected]

--
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/55B1DE3B.1090906%40vol.at.
For more options, visit https://groups.google.com/d/optout.

Re: [tesseract-ocr] tesseract on cygwin

Reply via email to