Hello

Previously I was able to increase its performance by using
graphicsmagick (you can also use imagemgick instead).

http://www.graphicsmagick.org/

Simply download source in Mac OSX and compile.


Explore the following option to apply a pre-processor from terminal:

gm convert -magnify
gm convert -sharpen 6x1
gm convert -gamma 2
gm convert input.png -gamma 2 -colors 2 output.bmp
bmp2tiff -c none main.bmp main.tif
tesseract main.tif result -l eng


But applying these type of filter to all images wont be a good idea!
(note you also may need to install bmp2tiff/similar tool in Mac OSX)


You can also try with BARCODE!!! reader library, if available. :)


regards
Salahuddin

salahuddin66.blogspot.com



On Fri, 2011-01-07 at 06:02 -0800, Wojciech Radomski wrote:
> Hi, I am working on iPhone application that recognizes ISBN numbers
> (ISBN: 978-83-7380-900-0) I use tesseract for this, but it is not
> working very well. I can see other applications, using same engine to
> work better.
> 
> to limit the characters i use this config line:
> tess->SetVariable("tessedit_char_whitelist", "SN:0123456789X-"); so
> all "I" are converted to "1", and "B" to 8. Using this it wont make
> mistake with those letters, whick are not important to me. After that
> i use regular expression to find the correct part of recognized text.
> 
> I also crop the image, so tesseract recognizes only part of the image,
> where isbn is visible (i placed color rect on camera overlay, so user
> have to place code in correct place) I also resize the image to 1000px
> width (also tried other sizes)
> 
> It works quite well when the light is excellent, but it is really hard
> to recognize correctly when the lighting isn't perfect.
> 
> The last digit of isbn number is a control sum.
> 
> What can I do to make it work better? Is there any way to say
> tesserect to recognize text only in given regular expression? Maybe i
> should do something with image first?
> 
> Sample images, that are not recognized correctly:
> http://img412.imageshack.us/i/img0367si.jpg/
> http://img264.imageshack.us/i/img0361d.jpg/
> 
> At the moment I am also making pictures at 2x zoom so camera is not so
> close to the object. It gives better results, but it is easier to move
> the camera and take fuzzy image.
> 
> -- 
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to tesseract-ocr
> [email protected].
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to