Hello, I am having some trouble getting Tesseract to recognize individual characters. Whenever I think I have overcome actual errors, I get the line "Too few characters. Skipping this page"
Because I am using Tess4J I have been struggling to find out exactly what all of the different options you can set for Tesseract actually are. Would anyone be able to tell me if there is a way to set it to not limit the minimum number of characters on a page? Also, I am trying to get Tesseract to recognise characters from chemical elements (example attached.) Will Tesseract be able to ignore the structure and just pick up on the characters? Basically any advice as to what would be a good way to go about this would be helpful! Even if I should look at training Tesseract or creating a word list with the chemical elements or something? Thanks a lot! Chris -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/484898bc-71e4-44ed-8327-e731a8100c0d%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

