Hi ShreeDevi,
Which icu libs is requested ?
$ pkg-config --list-all|grep icu-
icu-i18n icu-i18n - International Components for
Unicode: Internationalization library
icu-uc icu-uc - International Components for
Unicode: Common and Data libraries
icu-io icu-io - International Components for
Unicode: Stream and I/O Library
icu-le icu-le - International Components for
Unicode: Layout library
icu-lx icu-lx - International Components for
Unicode: Paragraph Layout library
$ pkg-config --libs icu-i18n
-licui18n -licuuc -licudata -lpthread -lm
On 7/27/2015 9:05 AM, ShreeDevi Kumar wrote:
Marco,
Please see https://github.com/tesseract-ocr/tesseract/wiki/Compiling for
dependencies and instructions for compiling training tools.
They may not compile with 3.04.00 , please see
https://github.com/tesseract-ocr/tesseract/issues/61
Closed
*building **tesseract**under **cygwin**: training tools don't build #61*
- sent from my phone. excuse the brevity and typos.
On 27 Jul 2015 11:50, "Marco Atzeri" <[email protected]
<mailto:[email protected]>> wrote:
On 7/27/2015 4:54 AM, ShreeDevi Kumar wrote:
Thank you, Marco.
1. Is there a way to download just the tesseract package and
dependencies (like Simon had setup) for testing purposes for
those who
do not have a cygwin install?
possible:
The package is available on mirrors:
http://mirrors.kernel.org/sourceware/cygwin/x86_64/release/tesseract-ocr/
the setup.hint reports the dependencies, in this case
requires: cygwin libgcc1 libleptonica_3 libstdc++6
libtesseract-ocr_3 tesseract-ocr-eng
http://mirrors.kernel.org/sourceware/cygwin/x86_64/release/leptonica/libleptonica_3/
requires: cygwin libgif4 libjpeg8 libpng16 libtiff6 libwebp5 zlib0
and so on..
2. The pdf output option (as far as I understand it) adds the
OCRed text
layer on top of copy of the original image, so looking like the
original
image is by intention.
I guessed so.
3. Are the training tools (text2image and other programs from
training
directory) included as part of this? If so, may I request you to
also
include the bash scripts in training directory - tesstrain.sh,
tesstrain_util.sh and language-specific.sh. Training also requires
langdata which is available in a separate repository -
https://github.com/tesseract-ocr/langdata
No, from what I see nothing is built or installed from the training
directory.
There is a specific switch for that ?
I am using a clean configure call.
Question for Zdenko, Jeff, Ray ...
Should Tesseract training tools be packaged separately from
tesseract-ocr, since not everyone is interested in doing training?
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it,
send an email to [email protected]
<mailto:tesseract-ocr%[email protected]>.
To post to this group, send email to [email protected]
<mailto:[email protected]>.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/55B5C992.8070304%40gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to [email protected]
<mailto:[email protected]>.
To post to this group, send email to [email protected]
<mailto:[email protected]>.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUL2wq%2Bb442ceQVkxw7PZNdAH%3DhD1Z1WGC%3DuFHOLqA4vg%40mail.gmail.com
<https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUL2wq%2Bb442ceQVkxw7PZNdAH%3DhD1Z1WGC%3DuFHOLqA4vg%40mail.gmail.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/55B5FDB3.6080002%40gmail.com.
For more options, visit https://groups.google.com/d/optout.