Re: [tesseract-ocr] tesseract on cygwin

2015-07-27 Thread ShreeDevi Kumar
Most probably icui18n see issues 61 and 62 - sent from my phone. excuse the brevity and typos. On 27 Jul 2015 15:15, "Marco Atzeri" wrote: > Hi ShreeDevi, > > Which icu libs is requested ? > > $ pkg-config --list-all|grep icu- > icu-i18nicu-i18n - International Components fo

Re: [tesseract-ocr] tesseract on cygwin

2015-07-27 Thread Marco Atzeri
Hi ShreeDevi, Which icu libs is requested ? $ pkg-config --list-all|grep icu- icu-i18nicu-i18n - International Components for Unicode: Internationalization library icu-uc icu-uc - International Components for Unicode: Common and Data libraries icu-io

Re: [tesseract-ocr] tesseract on cygwin

2015-07-27 Thread ShreeDevi Kumar
Marco, Please see https://github.com/tesseract-ocr/tesseract/wiki/Compiling for dependencies and instructions for compiling training tools. They may not compile with 3.04.00 , please see https://github.com/tesseract-ocr/tesseract/issues/61 Closed *building **tesseract** under **cygwin**: traini

Re: [tesseract-ocr] tesseract on cygwin

2015-07-26 Thread Marco Atzeri
On 7/27/2015 4:54 AM, ShreeDevi Kumar wrote: Thank you, Marco. 1. Is there a way to download just the tesseract package and dependencies (like Simon had setup) for testing purposes for those who do not have a cygwin install? possible: The package is available on mirrors: http://mirrors.kern

Re: [tesseract-ocr] tesseract on cygwin

2015-07-26 Thread ShreeDevi Kumar
Thank you, Marco. 1. Is there a way to download just the tesseract package and dependencies (like Simon had setup) for testing purposes for those who do not have a cygwin install? 2. The pdf output option (as far as I understand it) adds the OCRed text layer on top of copy of the original image,

Re: [tesseract-ocr] tesseract on cygwin

2015-07-26 Thread marco . atzeri
Il giorno venerdì 24 luglio 2015 09:48:53 UTC+2, Simon Eigeldinger ha scritto: > > hi, > > sorry missed the point. > just reproduced it: > > $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu pdf > > Tesseract Open Source OCR Engine v3.05.00dev with Leptonica > Page 1 > Error in f

Re: [tesseract-ocr] tesseract on cygwin

2015-07-24 Thread Simon Eigeldinger
hi, sorry missed the point. just reproduced it: $ tesseract testing\eurotext.tif testing\eurotext -l eng+deu pdf Tesseract Open Source OCR Engine v3.05.00dev with Leptonica Page 1 Error in fopenWriteStream: stream not opened Error in pixWrite: stream not opened Error in fopenReadStream: file no

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
hi, i did test it 2 days ago and it seems to work. at least over here and on a windows 7 machine in the office. but i could recheck again. greetings, simon Am 24.07.2015 um 08:50 schrieb zdenko podobny: it is not about input, but output. pdf output is key feature of leptonica 1.71 release (

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread zdenko podobny
it is not about input, but output. pdf output is key feature of leptonica 1.71 release (and tesseract 3.03/3.04) and I guess it was not tested on cygwin yet. Zdenko On Fri, Jul 24, 2015 at 8:42 AM, Simon Eigeldinger wrote: > Hi, > > i never tried to give tesseract a pdf as an input. > cygwin h

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
Hi, i never tried to give tesseract a pdf as an input. cygwin has leptonica 1.71 or 1.72 by default so i used this for compiling. maybe leptonica doesn't like pdf files so it might complain. so ShreeDevi Kumar might convert the pdf into an image or he uses a normal image (tif, jpg, etc.). gre

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread zdenko podobny
On Fri, Jul 24, 2015 at 7:10 AM, ShreeDevi Kumar wrote: > > C:\Users\User\Downloads\TESS>tesseract test/eurotext.tif > test/eurotext-eng-pdf -l eng pdf > Tesseract Open Source OCR Engine v3.04.00 with Leptonica > Page 1 > Error in fopenWriteStream: stream not opened > Error in pixWrite: stream no

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread ShreeDevi Kumar
Simon, I gave the cygwin compiled windows binary a try. It runs fine and I was able to create the txt and hocr output. I am getting some error creating the pdf and also if I use a gif as input. Just FYI, in the past I have used MSYS2 on the same PC for building tesseract - not sure if that is cau

Re: [tesseract-ocr] tesseract on cygwin: training tools seem not to build

2015-07-23 Thread Simon Eigeldinger
hi, and i just opened a ticket: https://github.com/tesseract-ocr/tesseract/issues/61 greetings, simon Am 23.07.2015 um 23:23 schrieb Jim O'Regan: On 23 July 2015 at 19:02, Simon Eigeldinger wrote: Hi all, pango_font_info.cpp:223:46: error: 'strcasestr' was not declared in this scope

Re: [tesseract-ocr] tesseract on cygwin: training tools seem not to build

2015-07-23 Thread Jim O'Regan
On 23 July 2015 at 19:02, Simon Eigeldinger wrote: > Hi all, > > pango_font_info.cpp:223:46: error: 'strcasestr' was not declared in this > scope >is_fraktur_ = (strcasestr(family, "Fraktur") != NULL); > ^ This seems to be the relevant part. I don

[tesseract-ocr] tesseract on cygwin: training tools seem not to build

2015-07-23 Thread Simon Eigeldinger
Hi all, this is what make produces when it should make the training tools: sorry for sending all the stuff but maybe it might be interesting. $ make training make[1]: Entering directory '/home/Besitzer/tesseractsrc/training' depbase=`echo boxchar.lo | sed 's|[^/]*$|.deps/&|;s|\.lo$||'`;\ /bin/s

Re: [tesseract-ocr] tesseract on cygwin

2015-07-23 Thread Simon Eigeldinger
Hi, Just fixed the how to file. greetings, simon Am 23.07.2015 um 04:55 schrieb ShreeDevi Kumar: http://domasofan.spdns.eu/tesseract/how%20to%20install.txt Excellent instructions, Simon. I am downloading and will give it a try under Windows8. I would suggest that you add 'Tesseract for W

Re: [tesseract-ocr] tesseract on cygwin

2015-07-22 Thread ShreeDevi Kumar
http://domasofan.spdns.eu/tesseract/how%20to%20install.txt Excellent instructions, Simon. I am downloading and will give it a try under Windows8. I would suggest that you add 'Tesseract for Windows' as a heading on the instructions page too. Thanks! >> did you managed to build training tools

[tesseract-ocr] tesseract on cygwin

2015-07-22 Thread Simon Eigeldinger
Hi, sorry for starting a new thread but i deleted all the other mails. just updated the package for the german and english languages to include osd.traineddata to make the error go away. the other 2 files are unchanged at the moment. bleeding edge code doesn't compile every day i know that. th