Re: Chinese Simplified on this image not working

2012-12-18 Thread zdenko podobny
What kind of OS you use, what version of tesseract etc... I tried tesseract original.jpg original -l chi_tra and tesseract preprocessed.tiff preprocessed -l chi_tra and I did not get any error message (on openSUSE linux 64bit 12.2 with tesseract 3.02.02)... Why did you upscale image?

Re: Chinese Simplified on this image not working

2012-12-18 Thread occorled
Thank you for the reply. I appreciate the help. We are compiling on Windows7 (64-bit machine), but src compiled in 32-bit. We are using tesseract 3.02 compiled from scratch with VS2010. I link to the built static libs from a wrapper. Also using leptonica 1.68 static lib, not built from

Re: Chinese Simplified on this image not working

2012-12-18 Thread zdenko podobny
I do apologize, but I am not familiar with Chinese (or other Asian languages ;-) ). So I tried tesseract original.jpg original -l chi_sim and the message was: Too many unichars in ambiguity on line 0 Too many unichars in ambiguity on line 0 Tesseract Open Source OCR Engine v3.02.02 with

Re: Chinese Simplified on this image not working

2012-12-18 Thread Lee Kohl-Bradley
Seeing the same issue, Win7 Starter with a fresh install of 3.02.02 and the 3.02 simplified Chinese which I renamed to chi.traineddata. I've tried a few files of various quality and even very high quality still has the same errors. It does produce an output file with quality which more or less