Re: [tesseract-ocr] Stop OCRing

2021-06-02 Thread Zdenko Podobny
Have a look at the ETEXT_DESC *monitor (e,.g. [1]). It is usually used for
getting progress monitor (e.g. [2] ), but should be used for cancelling
progress too according header file.

[1]
https://github.com/tesseract-ocr/tesseract/blob/7a308edcb1fc7455008b531bc2a49de583d7b171/include/tesseract/baseapi.h#L576
[2] https://bucket401.blogspot.com/2021/04/show-ocr-progress.html

Zdenko


st 2. 6. 2021 o 14:23 Саша Карпюк  napísal(a):

> Hello everybody.
> I work on Android and use Tesseract.
> Is exist any way to stop OCRing? For example, if I set bitmap to
> TessBaseApi and called getUTF8Text() method - can I stop this procces in
> another thread?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/5cc076b4-8383-4d35-9cc6-dda71ba0b5c0n%40googlegroups.com
> 
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wLTSaa%2BdEKtv131jrzzCksDpjSvaQMB%3Dk50zEYpNggmg%40mail.gmail.com.


Re: [tesseract-ocr] Tesseract cannot read text on stripe background / but Google AI can

2021-06-02 Thread Ajinkya Bobade
Hello,
I have created a web extension which solves this problem. Upload image to
https://imagescanner-online.com/  it will clear your noise and
pixel-segment text so that you get a very good quality input, which you can
feed to tesseract and get good output

Regards
Ajinkya

On Wed, Jun 2, 2021 at 12:13 AM Timo Richter  wrote:

> Hi everyone,
>
> I have tried to ocr an identity card [1] and big parts were not
> recognised. I do not get anything from the headline nor the first few rows.
> From the middle, Tesseract partially finds correct text. There are lines
> and things in the background, as usual. In the monochrome picture I could
> not completely extract the letters from the background. Some gray pixels
> stay there. But there is a website that does OCR and it works perfectly
> [2]. Why do I get bad results and my Tesseract does not read the text? What
> will the website do another way?
>
>
> Thank you in advance,
>
> Timo
>
>
> [1]
> https://en.wikipedia.org/wiki/Philippine_passport#/media/File:Philippine_passport_(2016_edition)_data_page.jpg
> (public domain)
> [2] https://cloud.google.com/document-ai#section-2
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/4f6d0261-5e0a-49c8-b6db-3e2b0e4ad9f5n%40googlegroups.com
> 
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAHy6iNM46Md3%2BgnnO9H62pCQRTpbrURTg_1%2Babbu0qzyOgwiGw%40mail.gmail.com.


[tesseract-ocr] Help me to creating new .traineddata file

2021-06-02 Thread Vijay K
Hi, 

i'm working on Mechanical Engineering drawings that contain GDT symbols 
such as a symbol to indicate depth ↧, a counterbore ⌴ , countersink ◎, 
diameter Ø, etc. How to train these font on tesseract.

Tried with upending same fonts to eng.traineddata. but symbols are not 
recognizing. 

Please help me in creating new traineddata file on GDT symbols

Thanks in Advance.
Regards
Vijay
vijaybk@gmail.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/f76ec052-91f7-4841-b37f-565fb4a35477n%40googlegroups.com.


[tesseract-ocr] Stop OCRing

2021-06-02 Thread Саша Карпюк
Hello everybody. 
I work on Android and use Tesseract. 
Is exist any way to stop OCRing? For example, if I set bitmap to 
TessBaseApi and called getUTF8Text() method - can I stop this procces in 
another thread? 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5cc076b4-8383-4d35-9cc6-dda71ba0b5c0n%40googlegroups.com.