[tesseract-ocr] How jpn word separation improve with fine tuning.

2021-11-30 Thread Yudai Sano
Hi, tesseract-ocr group. I have a question about the subject. If I perform OCR in Japanese using best/jpn.traineddata, the address or bank name text will be divided into the following words. ・Ex1  - Document Text : 東京都渋谷区桜丘町  - Word output : 東京, 都, 渋谷, 区, 桜丘, 町 ・Ex2  - Document Text :

Re: [tesseract-ocr] Failed to find library "leptonica-1.80.0.dll" for platform x86.

2021-11-30 Thread harika reddy
HI Zdenko, Thanks for your reply. I'm not facing any issues in adding Tessaract dependencies to my solution and even its working fine with visual studio. but, getting the above mentioned error while publishing it. I checked many forum to get solution. but no luck. So please help me with

[tesseract-ocr] Pre-train data

2021-11-30 Thread Quang Linh
Hey everyone . How can I pre-train data (New font , new language , ) .Thanks -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to

[tesseract-ocr] Digit recognition again.

2021-11-30 Thread Владимир Калачихин
Are there any examples of the recognition of code-stamped digits, such as ZIP codes? Or a real approach to recognize handwritten digits? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails