http://sourceforge.net/projects/vietocr/files/vietocr/4.0%20Beta/
Version 4.0 Beta (29 July 2014) - Upgrade to Tesseract 3.03 RC (r1127) - Upgrade Tess4J library - Update JNA to v4.1.0 - Update Ghost4J to v0.5.1 - Add support for searchable PDF output in bulk/batch mode ShreeDevi ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu, Oct 23, 2014 at 12:24 PM, ShreeDevi Kumar <[email protected]> wrote: > Try .net wrapper with newer version of tesseract. > > invert the image, smoothen/blur, make greyscale ... I tried with vietocr > > output is 'QBCDEFGHIJKL' > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Thu, Oct 23, 2014 at 12:07 PM, <[email protected]> wrote: > >> Hello. >> >> I have images that contain characters that are made from individual dots, >> like from a dot matrix printer. I tried to use various operations on the >> images (binarization, edge detection, dilatation, ...) and was able to make >> the dots bigger so they are connected 90% of the time. However, detection >> is still very bad. >> >> This image contains characters from A to L >> >> >> <https://lh3.googleusercontent.com/-WxgjmUF846M/VEig6eA1FNI/AAAAAAAAAAM/BdQPQPVTUrs/s1600/AL.png> >> my modified version is >> >> >> <https://lh5.googleusercontent.com/-TUZSXsiBHJY/VEihDy5RCUI/AAAAAAAAAAU/HmwIkEemSAY/s1600/AL2.png> >> after recognition, Tesseract (3.02, using the .NET wrapper) gives me for >> the standard english language the characters "FJBEDEFEHIJKL". Only the last >> 5 characters are right, the rest is wrong. Do you know of a way to make >> recognition better besides training a new font for this special case? >> Tesseract works quite good for other projects I have, I would love a >> solution that does not rely on a special font if possible. >> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at http://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/e6b8d4bb-ecc3-463c-9cc7-96f46a63be27%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/e6b8d4bb-ecc3-463c-9cc7-96f46a63be27%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU0z2Mkyjxwe1dxXTGKjB9BSxqcN%3D_p95CgBxPm%3DFJkaQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

