[tesseract-ocr] OCR on photos of IDs?

2020-02-20 Thread Tom Apeltauer
Greetings everyone, I am standing in front of quite challenging task. Optical Character Recognition of the data from the IDs taken by smartphone camera. I have tried tesseract as-is, but the accuracy rate is somewhere around 40%. I have started tweaking around, disabling dictionaries and

[tesseract-ocr] Re: how to make Tesseract 3.3.0 reads more accurately from image

2020-02-20 Thread Lakshay Saini
Hello, You can go to my github repo, few months back I uploaded an executable file of tesseract version 4.0. If you use Windows you can use that to install newer version. Github: https://github.com/lakshay1296/ocrmyPDF_Windows?files=1 Regards Lakshay -- You received this message because you

[tesseract-ocr] Re: how to make Tesseract 3.3.0 reads more accurately from image

2020-02-20 Thread Justin Yeh
Sorry, I forgot to mention that I was using C# with Visual Studio 2015 to create an OCR application. Right now, all I can get from Nuget is tesseract 3.3.0. I found this online but I am not sure how to reference it in my current C#

Re: [tesseract-ocr] tesseract 3.3.0 always misinterpret few characters (desperate right now ...)

2020-02-20 Thread Justin Yeh
Sorry, I forgot to mention that I was using C# with Visual Studio 2015 to create an OCR application. Right now, all I can get from Nuget is tesseract 3.3.0. I found this online but I am not sure how to reference it in my current C#

[tesseract-ocr] Re: how to make Tesseract 3.3.0 reads more accurately from image

2020-02-20 Thread Lakshay Saini
On Thursday, February 20, 2020 at 7:01:55 PM UTC+5:30, Justin Yeh wrote: > Take this attached image for example, seems like Tesseract 3.3.0 (downloaded > from nuget) cannot recognize few characters correctly: such as 8 and B, or 5 > and Z, or 0 and O etc... > > > Is there any way that I could

Re: [tesseract-ocr] tesseract 3.3.0 always misinterpret few characters (desperate right now ...)

2020-02-20 Thread Zdenko Podobny
What is tesseract 3.3.0? I did not find it in https://github.com/tesseract-ocr/tesseract/releases Or did you mean 3.03-rc1 release on on Sep 20, 2014 ? Zdenko št 20. 2. 2020 o 14:25 Justin Yeh napísal(a): > Unfortunately tesseract 3.3.0 keeps misinterpreting characters such as B > and 8, or

[tesseract-ocr] how to make Tesseract 3.3.0 reads more accurately from image

2020-02-20 Thread Lakshay Saini
Hi there, The image size and quality greatly impacts on the OCR quality. So, that can be reason. And, you are using an old version of tesseract, try upgrading it to 4.1.1 and then test the image again. Regards, Lakshay -- You received this message because you are subscribed to the Google

[tesseract-ocr] how to make Tesseract 3.3.0 reads more accurately from image

2020-02-20 Thread Justin Yeh
Take this attached image for example, seems like Tesseract 3.3.0 (downloaded from nuget) cannot recognize few characters correctly: such as 8 and B, or 5 and Z, or 0 and O etc... Is there any way that I could get string from image more accurately? or ... How do I avoid this kind of

[tesseract-ocr] tesseract 3.3.0 always misinterpret few characters (desperate right now ...)

2020-02-20 Thread Justin Yeh
Unfortunately tesseract 3.3.0 keeps misinterpreting characters such as B and 8, or Z and 2. Take this attached image for example, I have tried my best to make image as clear as possible, but seems like tesseract still cannot read all characters correctly. Is there any way that I could do to

[tesseract-ocr] Re: How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Juanjo Serrano Lloria
sudo add-apt-repository 'deb http://es.archive.ubuntu.com/ubuntu focal main universe' sudo apt-get update sudo apt-get install tesseract-ocr or sudo apt-get install tesseract-ocr-all (with all language packages) -- You received this message because you are subscribed to the Google Groups

[tesseract-ocr] Re: How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Juanjo Serrano Lloria
Maybe you can add repo: add-apt-repository 'deb http://es.archive.ubuntu.com/ubuntu focal main universe' -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to

Re: [tesseract-ocr] Re: Using tesseract on browser page insufficient

2020-02-20 Thread Zdenko Podobny
Why we should document how to use Ubuntu? You should be familiar with your OS. PPA repositories for each tesseract version are listed on https://tesseract-ocr.github.io/tessdoc/Home.html Zdenko št 20. 2. 2020 o 9:20 Alexander Dietz napísal(a): > With an update to version 4 (undocumented

Re: [tesseract-ocr] Re: Using tesseract on browser page insufficient

2020-02-20 Thread Alexander Dietz
With an update to version 4 (undocumented procedure!!) the application works much better Thanks On Wednesday, February 19, 2020 at 6:33:27 PM UTC+1, shree wrote: > > You are using an old version of software. > > See https://tesseract-ocr.github.io/tessdoc/Home.html > > On Wed, Feb 19, 2020 at

[tesseract-ocr] Re: How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Alexander Dietz
Thank you, that seem to have worked finally. Maybe you can include that procedure on the tesseract documentation page HERE ? On Thursday, February 20, 2020 at 9:09:02 AM UTC+1, Александр Поздняков wrote: > > Hi. > > tesseract 4.1.1 > >

[tesseract-ocr] How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Александр Поздняков
Hi. tesseract 4.1.1 https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr sudo add-apt-repository ppa:alex-p/tesseract-ocr sudo apt-get update sudo apt-get install tesseract-ocr tesseract 5.0.0-alpha https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr-devel sudo

Re: [tesseract-ocr] How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Shree Devi Kumar
If you are on Ubuntu, use Alex's ppa. The link should be on the tessdoc documentation page that I had referred to earlier. On Thu, Feb 20, 2020 at 1:30 PM Alexander Dietz wrote: > How do I install a current version of tesseract on Ubuntu 16.04.6? I did > > sudo apt install tesseract-ocr >

[tesseract-ocr] How to install current version of tesseract on Ubuntu 16.04.6

2020-02-20 Thread Alexander Dietz
How do I install a current version of tesseract on Ubuntu 16.04.6? I did sudo apt install tesseract-ocr sudo apt upgrade tesseract-ocr but it still seems to be version (3.04.01-4, also it says tesseract-ocr is already the newest version (3.04.01-4). -- You received this