Re: [tesseract-ocr] how to see which fonts are used in .traineddata files

2020-10-23 Thread Zdenko Podobny
e.g. https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.444.226=rep1=pdf https://arthurflor23.medium.com/text-segmentation-b32503ef2613 Zdenko pi 23. 10. 2020 o 5:05 H Brenner napĂ­sal(a): > Hi Zdenko, > > Per you suggestion I have installed the latest version of tesseract (Ver > 5),

Re: [tesseract-ocr] how to see which fonts are used in .traineddata files

2020-10-22 Thread H Brenner
Hi Zdenko, Per you suggestion I have installed the latest version of tesseract (Ver 5), and I played with the psm. I get the best result using --psm 11, like you did. Other values of psm give poor results. npsm 11 is the best, but it is still not good. How do I create custom image

Re: [tesseract-ocr] how to see which fonts are used in .traineddata files

2020-10-05 Thread H Brenner
Hello Zdenko, 1) Can I assume you used the latest version of tesseract to produce the output you produced? To install the latest version, do I need to first *uninstall *the older version that I have on my PC? 2) How do I create a custom image segmentation? Thanks, Hylton On Sat, Oct 3, 2020

Re: [tesseract-ocr] how to see which fonts are used in .traineddata files

2020-10-03 Thread Zdenko Podobny
1. try the latest version 2. try play with psm: e.g. tesseract 20201002.png - --psm 11 --dpi 300 produces: 8 27 26 10 04 03 01 N29 19 16 14 09 03 131 27 25 18 12 03 N21 18 16 13 07 04 N32 232112 10 07 N 36 34 30 27 21 01 X35 3417 13 10 08 N36 33 29 28 14 09 R 33 32 31 21 06 01 - oe