just wanted to follow up I wrote some simple code to preprocess the image because I realized I will be doing basically the same image every time so its foolish to try and use Tesseracts binaziration technique which was designed for a different and more general purpose. So basically I just turned every pixel white that wasnt a pixel that contained part of a letter, and when I send that to tesseract I get flawless output with the language data I trained. Thanks so much for the replies Paul and Nick, I learned a lot and it put me in the right direction! cheers!
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d3c3ceff-d632-46a6-81ed-7625d1ddce2e%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.