It's exactly where I'm started and stuck. The produced box does not contain any Korean character only Latin ones. And that is a problem.
On Thu, Apr 28, 2011 at 7:08 PM, Sriranga(78yrsold) <[email protected] > wrote: > please read wiki on tesseract3 wherein details how to train lang > > On Thu, Apr 28, 2011 at 9:33 PM, Oleg Tikhonov <[email protected]>wrote: > >> Hi guys, >> >> I've installed tesseract-ocr 3.0 on Windows 7. All work fine if selected >> language is English. >> I tried to add/teach the system the Korean. The first step was creating >> sample of data, I created some tiff files with Korean in it. After, I ran >> tesseract command: >> tesseract [lang].[fontname].exp[num].tif [lang].[fontname].exp[num] >> batch.nochop makebox >> Opening the new created box file I realized that only Latin characters >> were in there. What's wrong? Might be I have to change a system language? >> Please advise me how anyway to create a training data set? Thank you in >> advance, >> >> Oleg >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> To unsubscribe from this group, send email to >> [email protected] >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

