Hi Ajinkya,

the result looks better than mine. But it looks like a very low resolution, 
the text is not readable. How did you do it?
Still the Google AI website is a lot more accurate. How can they have done 
this?


[email protected] schrieb am Mittwoch, 2. Juni 2021 um 17:23:44 UTC+2:

> Hello,
> I have created a web extension which solves this problem. Upload image to 
> https://imagescanner-online.com/  it will clear your noise and 
> pixel-segment text so that you get a very good quality input, which you can 
> feed to tesseract and get good output
>
> Regards
> Ajinkya
>
> On Wed, Jun 2, 2021 at 12:13 AM Timo Richter <[email protected]> wrote:
>
>> Hi everyone,
>>
>> I have tried to ocr an identity card [1] and big parts were not 
>> recognised. I do not get anything from the headline nor the first few rows. 
>> From the middle, Tesseract partially finds correct text. There are lines 
>> and things in the background, as usual. In the monochrome picture I could 
>> not completely extract the letters from the background. Some gray pixels 
>> stay there. But there is a website that does OCR and it works perfectly 
>> [2]. Why do I get bad results and my Tesseract does not read the text? What 
>> will the website do another way?
>>
>>
>> Thank you in advance,
>>
>> Timo
>>
>>
>> [1] 
>> https://en.wikipedia.org/wiki/Philippine_passport#/media/File:Philippine_passport_(2016_edition)_data_page.jpg
>>  
>> (public domain)
>> [2] https://cloud.google.com/document-ai#section-2
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/4f6d0261-5e0a-49c8-b6db-3e2b0e4ad9f5n%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/4f6d0261-5e0a-49c8-b6db-3e2b0e4ad9f5n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/9e83609b-1bad-4134-950a-025357e092b5n%40googlegroups.com.

Reply via email to