Hi Oleg,
Did you create a file with mapping of character codes? Or Korean text
file that you printed and scanned in? Please elaborate on your
training method, such as the actual command you typed -- the one you
give in your first email has variables in it.
--Sven


On Thu, Apr 28, 2011 at 11:23 AM, Oleg Tikhonov <[email protected]> wrote:
> It's exactly where I'm started and stuck. The produced box does not contain
> any Korean character only Latin ones. And that is a problem.
>
> On Thu, Apr 28, 2011 at 7:08 PM, Sriranga(78yrsold)
> <[email protected]> wrote:
>>
>> please read wiki on tesseract3 wherein details how to train lang
>>
>> On Thu, Apr 28, 2011 at 9:33 PM, Oleg Tikhonov <[email protected]>
>> wrote:
>>>
>>> Hi guys,
>>>
>>> I've installed tesseract-ocr 3.0 on Windows 7. All work fine if selected
>>> language is English.
>>> I tried to add/teach the system the Korean. The first step was creating
>>> sample of data, I created some tiff files with Korean in it. After, I ran
>>> tesseract command:
>>> tesseract [lang].[fontname].exp[num].tif [lang].[fontname].exp[num]
>>> batch.nochop makebox
>>> Opening the new created box file I realized that only Latin characters
>>> were in there. What's wrong? Might be I have to change a system language?
>>> Please advise me how anyway to create a training data set? Thank you in
>>> advance,
>>>
>>> Oleg
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To post to this group, send email to [email protected]
>>> To unsubscribe from this group, send email to
>>> [email protected]
>>> For more options, visit this group at
>>> http://groups.google.com/group/tesseract-ocr?hl=en
>>
>> --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en
>



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to