Re: [tesseract-ocr] Could anyone help me about pytessract?

2019-09-19 Thread luffy monky
Sorry because I can understand why the out put is nothing...But an other code use the same way it will out the string but show 03 not 09 I just want to debug about those question -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To

Re: [tesseract-ocr] Could anyone help me about pytessract?

2019-09-19 Thread Lorenzo Bolzani
Try to invert the images. Lorenzo Il giorno gio 19 set 2019 alle ore 05:52 luffy monky ha scritto: > Hi ALL > I try to use any sample code from google. > But it's show no thing in my code > Could I trouble you for any advice?? > Here is my sample code > > import

Re: [tesseract-ocr] Trained data for E13B font

2019-09-19 Thread ElGato ElMago
Hello, CMC-7 is totally a different font than E13B. It's only E13B around myself. I've never seen CMC-7 in person. I had about 100 sample checks and used a check reading machine, one of those at banks. Thus they're in the same image quality and character quality. Although it's a small

[tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread 'Sandra M.' via tesseract-ocr
thanks for your responses @Timothy Snyder: I think I cannot do this in postprocesssing, as it is possible that both spellings occur, but I have to differentiate them. Or what did you do exactly? @zdenop: Unfortunately it is not possible for me to send a longer text. anyone else any ideas? Am

[tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread 'Sandra M.' via tesseract-ocr
[image: currentImage.png] @Lorenzo Blz: This is an example image. The output of my code is "calibrations". The height of the letters is not the same. Of course it cannot be recognized if there is only a "c", but in the context to the other letters tesseract should be able to detect if it is a

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread Zdenko Podobny
Please provide more information (versions info, how you do OCR - seem like you use some coding). I just tried tesseract (tesseract 5.0.0-alpha-416-g408d6) command line with tessdata_best and if work for me: tesseract unnamed.png - Warning: Invalid resolution 0 dpi. Using 70 instead. Estimating

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread Lorenzo Bolzani
I tried to upscale, downscale, with and without the white border and I always get Calibrations. I even tried a few psm modes. I'm using: tesseract 4.0.0 leptonica-1.76.0 libjpeg 8d (libjpeg-turbo 1.5.2) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 What I would do is this: - prepare a test

Re: [tesseract-ocr] text2image: No such file or directory

2019-09-19 Thread Zdenko Podobny
Does /usr/local/bin/text2image exists? Did you installed text2image/training tools? Zdenko št 19. 9. 2019 o 13:59 Ajinkya Khalwadekar napísal(a): > I am following https://github.com/tesseract-ocr/tesseract/issues/1453 for > tesseract 4.0 learning. > I am using macOS mojave. > > All was good

[tesseract-ocr] text2image issue

2019-09-19 Thread Ajinkya Khalwadekar
I am following https://github.com/tesseract-ocr/tesseract/issues/1453 for tesseract 4.0 learning. I am using macOS mojave. All was good until i tried 'text2image --list_available_fonts --fonts_dir=/Library/Fonts'. o/p i get on this is '-bash: /usr/local/bin/text2image: No such file or

Re: [tesseract-ocr] text2image issue

2019-09-19 Thread Zdenko Podobny
You already send this to forum and I already replied. Did you read it? Zdenko št 19. 9. 2019 o 15:04 Ajinkya Khalwadekar napísal(a): > I am following https://github.com/tesseract-ocr/tesseract/issues/1453 for > tesseract 4.0 learning. > I am using macOS mojave. > > All was good until i tried

[tesseract-ocr] text2image: No such file or directory

2019-09-19 Thread Ajinkya Khalwadekar
I am following https://github.com/tesseract-ocr/tesseract/issues/1453 for tesseract 4.0 learning. I am using macOS mojave. All was good until i tried 'text2image --list_available_fonts --fonts_dir=/Library/Fonts'. o/p i get on this is '-bash: /usr/local/bin/text2image: No such file or

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread 'Sandra M.' via tesseract-ocr
You were both right - updating to version 5 fixed the problem more or less! Only in one case there is still a problem with lower and upper case letters, but for the other cases it's working now! Am Donnerstag, 19. September 2019 12:49:43 UTC+2 schrieb zdenop: > > your tesseract version is old.

Re: [tesseract-ocr] Re: Compile Tesseract with vcpkg to get dynamic libraries

2019-09-19 Thread Zdenko Podobny
I did not tried it, but if you have installed leptonica, you can install tesseract from source, just adjust relevant part of cmake configuration. AFAIK vcpkg uses cmake and ninja, so this this tutorial (last part) can help you:

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread 'Sandra M.' via tesseract-ocr
You were both right - updating to version 5 fixed the problem more or less! Only in one case there is still a problem with lower and upper case letters, but for the other cases it's working now! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr"

[tesseract-ocr] OCR results are different on different OS (Linux and Windows)

2019-09-19 Thread Karan Singh
For the same image, I am using the tesseract to get the text output. But apparently the output is bad on linux version (RHEL) than windows (Windows 10). I also made sure that all the installation dependencies and version are same. Kindly let me know how to deal with this. Thanks -- You

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread Zdenko Podobny
please provide image for testing. Zdenko št 19. 9. 2019 o 18:06 'Sandra M.' via tesseract-ocr < tesseract-ocr@googlegroups.com> napísal(a): > But therefore I get empty strings now, because it occurs a symbol that > tesseract does not know. I had this problem before as well, but could fix > it

Re: [tesseract-ocr] Re: problems with upper-case character

2019-09-19 Thread 'Sandra M.' via tesseract-ocr
But therefore I get empty strings now, because it occurs a symbol that tesseract does not know. I had this problem before as well, but could fix it for whatever reason with config='--psm 7'. This doesn't work now anymore... Do you have an idea for this as well? I don't need to detect the

[tesseract-ocr] Which mode is better if i crop the exact text with multiple words and pass to tesseract for accuracy?

2019-09-19 Thread Purushotham Rao Eravalli
I am using some other model for text detection, I get text boxes for each line in the image(basically some time identity cards). Now i need to pass them through the tesseract for recognition. Which psm mode do you think will the higher accuracy between psm13 and psm7. -- You received this

Re: [tesseract-ocr] OCR results are different on different OS (Linux and Windows)

2019-09-19 Thread Zdenko Podobny
Do you really think that somebody can reproduce problem based on information you provided? Zdenko št 19. 9. 2019 o 18:10 Karan Singh napísal(a): > For the same image, I am using the tesseract to get the text output. But > apparently the output is bad on linux version (RHEL) than windows

[tesseract-ocr] Re: Compile Tesseract with vcpkg to get dynamic libraries

2019-09-19 Thread Anon ymous
I have the same problem On Sunday, September 30, 2018 at 1:19:37 PM UTC-4, PLOBEXRIME wrote: > > Hi, I'm searching for a way to compile Tesseract and get tesseract.dll > library file. CPPAN works for me from time to time but always fail for x64 > build so I've tried vcpkg instead. However vcpkg