Hello,
I have created a web extension which solves this problem. Upload image to
https://imagescanner-online.com/  it will clear your noise and
pixel-segment text so that you get a very good quality input, which you can
feed to tesseract and get good output

Regards
Ajinkya

On Wed, Jun 2, 2021 at 12:13 AM Timo Richter <timo.j...@gmail.com> wrote:

> Hi everyone,
>
> I have tried to ocr an identity card [1] and big parts were not
> recognised. I do not get anything from the headline nor the first few rows.
> From the middle, Tesseract partially finds correct text. There are lines
> and things in the background, as usual. In the monochrome picture I could
> not completely extract the letters from the background. Some gray pixels
> stay there. But there is a website that does OCR and it works perfectly
> [2]. Why do I get bad results and my Tesseract does not read the text? What
> will the website do another way?
>
>
> Thank you in advance,
>
> Timo
>
>
> [1]
> https://en.wikipedia.org/wiki/Philippine_passport#/media/File:Philippine_passport_(2016_edition)_data_page.jpg
> (public domain)
> [2] https://cloud.google.com/document-ai#section-2
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/4f6d0261-5e0a-49c8-b6db-3e2b0e4ad9f5n%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/4f6d0261-5e0a-49c8-b6db-3e2b0e4ad9f5n%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAHy6iNM46Md3%2BgnnO9H62pCQRTpbrURTg_1%2Babbu0qzyOgwiGw%40mail.gmail.com.

Reply via email to