[tesseract-ocr] Re: any chance to get this .tiff converted to text?

2014-10-30 Thread boris
Hi Quan, thanks for your help. Sorry, this might be a silly question, but as told I am novice with Tesseract ;-) How do I select the Screenshot mode? Regards, Boris >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from

[tesseract-ocr] Re: Many 'question mark' chars in recognized text

2014-10-30 Thread Quan Nguyen
I suspect you have saved the Unicode text output with a wrong character encoding. Try UTF-8 encoding when you save the file. Tesseract may misrecognize the characters but rarely put question marks in their places. On Thursday, October 16, 2014 3:18:58 AM UTC-5, Salvo Piazza wrote: > > Hi all, >

[tesseract-ocr] Re: jTessBoxEditor 0.6 Beta release

2014-10-30 Thread Quan Nguyen
You only need JRE to run jTessBoxEditor. The AddOns page lists a few .NET tools. On Tuesday, October 28, 2014 8:31:25 AM UTC-5, iram akbar wrote: > > anyone tell me the tr

[tesseract-ocr] Re: image processing to improve tesseract OCR accuracy

2014-10-30 Thread Quan Nguyen
The number was recognized after grayscale, binarize, and invert color steps. On Thursday, October 30, 2014 9:51:59 AM UTC-5, Rick Leir wrote: > > The simpler method: convert to greyscale then binarize with the > appropriate threshold. > > However if the colors convert to similar grey values then

[tesseract-ocr] Re: any chance to get this .tiff converted to text?

2014-10-30 Thread Quan Nguyen
Hi Boris, Be sure to select Screenshot Mode. The image has too low resolution. Quan On Wednesday, October 29, 2014 1:10:49 PM UTC-5, boris wrote: > > Hi Shree, > > I have changed language to German but it won´t realy improve. > > Anyhow, I am thinking of programming my own OCR for my project as

[tesseract-ocr] Re: image processing to improve tesseract OCR accuracy

2014-10-30 Thread Rick Leir
The simpler method: convert to greyscale then binarize with the appropriate threshold. However if the colors convert to similar grey values then you need to choose a hue (color) and brighten it, then choose the contrasting hue and darken it, On Wednesday, October 29, 2014 4:34:26 AM UTC-4, D

Re: [tesseract-ocr] Re: any chance to get this .tiff converted to text?

2014-10-30 Thread ShreeDevi Kumar
Do look at https://code.google.com/p/tesseract-ocr/wiki/ImproveQuality for pre-processing steps for your images to improve recognition regardless of the OCR you use. ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Wed,

Re: [tesseract-ocr] Passing glyph vector data directly to tesseract

2014-10-30 Thread zdenko podobny
On Fri, Oct 24, 2014 at 1:45 AM, Ryan Dev wrote: > Hi, I have what I think is a unique situation, and I was hoping I could > get some hints on how to proceed. > > I have problem font files, for which I want to fix the unicode mappings > for. I also have PDF files with these fonts, so I also have

Re: [tesseract-ocr] Language

2014-10-30 Thread zdenko podobny
>From my experience dictionary has only limited effect on OCR result: e.g. adding word to dictionary does not mean that tesseract will recognize it. But on other side missing word in dictionary does not mean that tesseract will not recognize it correctly. So if you have just ascii text (without lan

[tesseract-ocr] Re: Language file for MICR font

2014-10-30 Thread Andrew Litvinov
For me too. The one shared by Hunter doesn't work. (Ubuntu 14.04 , tesseract version 3.03) On Monday, June 9, 2014 10:32:59 PM UTC+3, Anurag Kalra wrote: > > Ok, the MICR training data shared by Quan is now working for me. > -- You received this message because you are subscribed to the Google