[tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-05-31 Thread shree
This has been an issue for long. Thanks for finding the problem. Please submit a PR on github. On Friday, June 1, 2018 at 1:55:25 AM UTC+5:30, Paul Kitchen wrote: > > After a lot of stepping through tesseract code, I found the problem. > > 1) In file coutln.cpp, function

[tesseract-ocr] Preprocess Image

2018-05-31 Thread Hongguo An
Hi: When trying to OCR the above image, the date 09/02/2017 is always wrong, (0G/02/2017). This is tesseract 4 running on linux, the cmd line is:

[tesseract-ocr] Re: Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread Александр Поздняков
You need to replace in */etc/apt/sources.list* repository *http://us.archive.ubuntu.com/ubuntu* at *http://old-releases.ubuntu.com/ubuntu/* sudo apt-get update sudo apt install tesseract-ocr As for the beta version, I'll think ... четверг, 31 мая 2018 г., 10:04:19 UTC+3 пользователь

Re: [tesseract-ocr] lstmeval gives a perfect result but tesseract fails

2018-05-31 Thread ShreeDevi Kumar
>I've trained a LSTM model for a custom language from scratch as explained here . >The language only has about 100 words and 17 characters, so it's pretty simple. For such a small model, try to build the legacy version

Re: [tesseract-ocr] Help for training tesseract to recognize a new (dead) language

2018-05-31 Thread Ramast Magdy
Impressive! I thought we would need to do a lot of work in order to reach that stage??. The "??" in the text correspond to an unknown character to me, I also can't find it among the available unicode characters. It's certainly 100% not part of the text. Probably indicator of new chapter.

Re: [tesseract-ocr] Where to find tessdata folder?

2018-05-31 Thread Zdenko Podobny
Did you follow instruction for installation of that package? Did you try internet search before posting on forum? Did you try to search for help in project tesserocr??? I just put it to google and I got: https://pypi.org/project/tesserocr/ https://github.com/sirfz/tesserocr

[tesseract-ocr] Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread RT-Rakesh
user@computer:~$ sudo apt install tesseract-ocr Reading package lists... Done Building dependency tree Reading state information... Done The following packages were automatically installed and are no longer required: libgnutls-openssl27 postfix-sqlite Use 'sudo apt autoremove' to remove

[tesseract-ocr] Where to find tessdata folder?

2018-05-31 Thread Abel Tan
Hi, I installed pytesseract by using Anaconda conda install -c simonflueckiger tesserocr The path to Anaconda is :C:\Users\Tan\Anaconda3\ The path to tesseract package is : C:\Users\Tan\Anaconda3\Lib\site-packages\tesserocr However when I start up Jupyter notebook and run : import

[tesseract-ocr] Training of tesseract

2018-05-31 Thread AKS
Hi, I want to use tesseract OCR on images with varying font types and font sizes. Also there is a lot of background variations in each images such as multi-colored background, background with some designs, illumination variance, white background. If I simply apply a tesseract with some