What kind of OS you use, what version of tesseract etc...???? I tried tesseract original.jpg original -l chi_tra and tesseract preprocessed.tiff preprocessed -l chi_tra and I did not get any error message (on openSUSE linux 64bit 12.2 with tesseract 3.02.02)...
Why did you upscale image? It is not omnipotent solution - see some experience from past[1]. [1] https://groups.google.com/d/msg/tesseract-ocr/KVHsGxfDdy0/hh6r4AFUvRMJ Zdenko On Tue, Dec 18, 2012 at 2:55 AM, occorled <occor...@gmail.com> wrote: > I have an image here that was initially 32 bit depth, but I scaled it > larger, grayscaled, TRCed, then threshold to binary to produce the image I > send to tesseract. However it still cannot capture anything. Is there > something wrong with this image because the text is so big? > > Tesseract prints out the following errors: > >> Too many unichars in ambiguity on line 102163292 >> Too many unichars in ambiguity on line 102163292 >> Too many unichars in ambiguity on line 102163292 >> Garbage result of merge? Left Ragged (414,645)->(258,2108) w=6 s=0, sort >> key=146 >> 17152, boxes=53, partners=0 >> Garbage result of merge? Right Ragged (425,619)->(291,1877) w=5 s=0, sort >> key=14 >> 872288, boxes=19, partners=0 >> Garbage result of merge? Right Ragged (686,619)->(552,1881) w=6 s=0, sort >> key=22 >> 788162, boxes=41, partners=0 >> Garbage result of merge? Right Ragged (1967,899)->(1840,2098) w=5 s=0, >> sort key= >> 62484498, boxes=38, partners=0 >> Garbage result of merge? Right Ragged (2166,687)->(2055,1729) w=6 s=0, >> sort key= >> 67827472, boxes=30, partners=0 >> Garbage result of merge? Left Ragged (3336,571)->(3173,2096) w=5 s=0, >> sort key=1 >> 02864440, boxes=45, partners=0 >> Garbage result of merge? Right Ragged (3403,646)->(3248,2100) w=5 s=0, >> sort key= >> 105121744, boxes=58, partners=0 >> Garbage result of merge? Left Ragged (3510,565)->(3401,1596) w=5 s=0, >> sort key=1 >> 08127380, boxes=33, partners=0 >> Garbage result of merge? Right Ragged (3632,565)->(3469,2096) w=6 s=0, >> sort key= >> 111800270, boxes=60, partners=0 >> > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en