VietOCR (Java version) does not feed the original image to Tesseract, but rather it reads and then writes back out an uncompressed TIFF file, rescaled to 300 DPI if instructed so, which is then sent to the engine. I found this regurgitated image somehow has been more amenable to Tesseract.
The program also supports a command-line interface that you can use. Glad that it helped. Regards, Quan On Sep 11, 11:46 pm, Jon <[email protected]> wrote: > Never mind. I just didn't look "way to the right in the upper > corner." > That selection was all by itself and I missed it. User error. > > All is good, now. > > Interesting thing is that I can load up one of the TIFF files in > VietOCR, the exact same one I might attempt a command > line with tesseract on, and there is no crash and VietOCR > just does a fairly okay job of conversion to text. It's still some > manual labor, but works fairly well just the same. But if I then > drop into a command line box under Win7 64-bit, and try out > the tesseract command directly, it crashes on that same image > file. > > So perhaps there is something that VietOCR does in setting up > a process for executing tesseract that I am NOT getting when > I just fire up the command box. Because both manually and with > VietOCR, the same tesseract.exe is being used and so is the same > input file. Or something else I'm missing. > > One other item. The ImageMagick installation suggests that I do > this: > > convert logo: logo.miff > imdisplay logo.miff > > With either the Q8 (32-bit code) or Q16 versions (64-bit code), the > first > command works fine. But the second line always fails with an error > disalog box and doesn't work. But if I just type 'imdisplay' then the > program fires up just fine and if I then load the logo.miff file, it > also > displays just fine. > > So there may be something wrong with my command line box settings > that I'm ignorant about, too. > > I will get some time to try all this on a Win7 Ultimate 64-bit, with a > system I pieced together myself from parts. Both are Intel I7, but > the one I built up is very new and hasn't had much added to it. It > is fairly plain and gets me closer to an ideal installation (or so I > imagine.) > > Thanks again. And that VietOCR software is very nice!! Good > advice and much appreciated. > > Jon -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

