What kind of OS you use, what version of tesseract etc...
I tried
tesseract original.jpg original -l chi_tra
and
tesseract preprocessed.tiff preprocessed -l chi_tra
and I did not get any error message (on openSUSE linux 64bit 12.2 with
tesseract 3.02.02)...
Why did you upscale image?
Thank you for the reply. I appreciate the help.
We are compiling on Windows7 (64-bit machine), but src compiled in 32-bit.
We are using tesseract 3.02 compiled from scratch with VS2010. I link to
the built static libs from a wrapper. Also using leptonica 1.68 static
lib, not built from
I do apologize, but I am not familiar with Chinese (or other Asian
languages ;-) ). So I tried
tesseract original.jpg original -l chi_sim
and the message was:
Too many unichars in ambiguity on line 0
Too many unichars in ambiguity on line 0
Tesseract Open Source OCR Engine v3.02.02 with
Seeing the same issue, Win7 Starter with a fresh install of 3.02.02 and the
3.02 simplified Chinese which I renamed to chi.traineddata.
I've tried a few files of various quality and even very high quality still
has the same errors.
It does produce an output file with quality which more or less
4 matches
Mail list logo