[tesseract-ocr] Where to find the Tesseract.dll for Tesseract OCR version v5.0.0.

2021-04-21 Thread Sharp Subbu
Dear Friends, We have tried to find the Tesseract.dll for Tesseract OCR version v5.0.0. in the Tesseract git hub url (). Kindly share the Tesseract.dll for Tesseract OCR version v5.0.0 if you have it, or kindly share the steps to create this dll from

Re: [tesseract-ocr] Re: tessedit_create_boxfile condensed like boxaGetBox

2021-04-21 Thread Zdenko Podobny
Use tsv output but you will still need to parse it to get line information. Zdenko st 21. 4. 2021 o 16:38 Baris Unsal napísal(a): > I want the opposite way. Getting ril_textline like output from passing > argument to tesseract. > > On Wednesday, 21 April 2021 at 17:36:35 UTC+3 Quan Nguyen

Re: [tesseract-ocr] detect decimal point in amount with psm 11

2021-04-21 Thread Zdenko Podobny
1. You got the result for the image you provided. 2. I suggest you to use other oem 3. I know that invoice digitalizator use different parameters for parsing numbers. Zdenko st 21. 4. 2021 o 17:45 Kumar Rajwani napísal(a): > Hi Zdenop, As i said i know psm 6 working better in

Re: [tesseract-ocr] detect decimal point in amount with psm 11

2021-04-21 Thread Kumar Rajwani
Hi Zdenop, As i said i know psm 6 working better in number but it not able to get all text in image. where psm 11 does better. So this the reason i want to with psm 11 but i am getting wrong amount that's the only problem i am facing with psm 11. So can you tell me how can i achive same result

Re: [tesseract-ocr] detect decimal point in amount with psm 11

2021-04-21 Thread Zdenko Podobny
Try to use better config parameters. e.g: $ tesseract download.png - --psm 6 --oem 0 will produce: $ 250,941.00 $ -75,282.00 $ 175,659.00 $ -15,072 00 $ 2,860.00 $ 0.00 $ 163,447.00 legacy engine could be better for numbers Zdenko st 21. 4. 2021 o 14:10 Kumar Rajwani napísal(a): > Hey, > I

[tesseract-ocr] Re: tessedit_create_boxfile condensed like boxaGetBox

2021-04-21 Thread Baris Unsal
I want the opposite way. Getting ril_textline like output from passing argument to tesseract. On Wednesday, 21 April 2021 at 17:36:35 UTC+3 Quan Nguyen wrote: > I think it would need to operate at RIL_SYMBOL level, not RIL_TEXTLINE. > > On Wednesday, April 21, 2021 at 7:17:04 AM UTC-5

[tesseract-ocr] Re: tessedit_create_boxfile condensed like boxaGetBox

2021-04-21 Thread Quan Nguyen
I think it would need to operate at RIL_SYMBOL level, not RIL_TEXTLINE. On Wednesday, April 21, 2021 at 7:17:04 AM UTC-5 yosoyl...@gmail.com wrote: > Hi, when I pass tessedit_create_boxfile 1 argument to tesseract it outputs > individual chars' location. But when I use api like this: > > ``` >

Re: [tesseract-ocr] tessedit_create_boxfile condensed like boxaGetBox

2021-04-21 Thread Zdenko Podobny
Hello, it is unclear for what do you do/want to do: - you wrote want individual chars, but request from API line (RIL_TEXTLINE) - then you wrote " Is there any way to combine individual boxes to print like API" so what do you want to combine? Maybe it would be better if you provide

[tesseract-ocr] tessedit_create_boxfile condensed like boxaGetBox

2021-04-21 Thread Baris Unsal
Hi, when I pass tessedit_create_boxfile 1 argument to tesseract it outputs individual chars' location. But when I use api like this: ``` Boxa* boxes = api->GetComponentImages(tesseract::RIL_TEXTLINE, true,NULL, NULL); for(int i = 0; i < boxes->n; i++){ BOX* box =boxaGetBox(boxes,i,L_CLONE);

[tesseract-ocr] detect decimal point in amount with psm 11

2021-04-21 Thread Kumar Rajwani
Hey, I am using tesseract to identify amounts in my forms. You can look below image for sample. i am getting perfect amount with decimal in psm 6. but when i use psm 11 i am getting follwing output. I have to use psm 11 as it identify more text with compare to psm 6 in my images. 250,941 00 00