What kind of OS you use, what version of tesseract etc...????
I tried
    tesseract original.jpg original -l chi_tra
and
    tesseract preprocessed.tiff preprocessed -l chi_tra
and I did not get any error message (on openSUSE linux 64bit 12.2 with
tesseract 3.02.02)...

Why did you upscale image? It is not omnipotent solution - see some
experience from past[1].
[1] https://groups.google.com/d/msg/tesseract-ocr/KVHsGxfDdy0/hh6r4AFUvRMJ

Zdenko



On Tue, Dec 18, 2012 at 2:55 AM, occorled <occor...@gmail.com> wrote:

> I have an image here that was initially 32 bit depth, but I scaled it
> larger, grayscaled, TRCed, then threshold to binary to produce the image I
> send to tesseract.  However it still cannot capture anything.  Is there
> something wrong with this image because the text is so big?
>
> Tesseract prints out the following errors:
>
>> Too many unichars in ambiguity on line 102163292
>> Too many unichars in ambiguity on line 102163292
>> Too many unichars in ambiguity on line 102163292
>> Garbage result of merge? Left Ragged (414,645)->(258,2108) w=6 s=0, sort
>> key=146
>> 17152, boxes=53, partners=0
>> Garbage result of merge? Right Ragged (425,619)->(291,1877) w=5 s=0, sort
>> key=14
>> 872288, boxes=19, partners=0
>> Garbage result of merge? Right Ragged (686,619)->(552,1881) w=6 s=0, sort
>> key=22
>> 788162, boxes=41, partners=0
>> Garbage result of merge? Right Ragged (1967,899)->(1840,2098) w=5 s=0,
>> sort key=
>> 62484498, boxes=38, partners=0
>> Garbage result of merge? Right Ragged (2166,687)->(2055,1729) w=6 s=0,
>> sort key=
>> 67827472, boxes=30, partners=0
>> Garbage result of merge? Left Ragged (3336,571)->(3173,2096) w=5 s=0,
>> sort key=1
>> 02864440, boxes=45, partners=0
>> Garbage result of merge? Right Ragged (3403,646)->(3248,2100) w=5 s=0,
>> sort key=
>> 105121744, boxes=58, partners=0
>> Garbage result of merge? Left Ragged (3510,565)->(3401,1596) w=5 s=0,
>> sort key=1
>> 08127380, boxes=33, partners=0
>> Garbage result of merge? Right Ragged (3632,565)->(3469,2096) w=6 s=0,
>> sort key=
>> 111800270, boxes=60, partners=0
>>
>  --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to tesseract-ocr@googlegroups.com
> To unsubscribe from this group, send email to
> tesseract-ocr+unsubscr...@googlegroups.com
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com
To unsubscribe from this group, send email to
tesseract-ocr+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to