[tesseract-ocr] Too few characters. Skipping this page

Chris Nevin Sat, 19 Apr 2014 11:21:07 -0700

Hello,

 I am having some trouble getting Tesseract to recognize individual 
characters. Whenever I think I have overcome actual errors, I get the line 
"Too few characters. Skipping this page"


Because I am using Tess4J I have been struggling to find out exactly what 
all of the different options you can set for Tesseract actually are. Would 
anyone be able to tell me if there is a way to set it to not limit the 
minimum number of characters on a page?

Also, I am trying to get Tesseract to recognise characters from chemical 
elements (example attached.) Will Tesseract be able to ignore the structure 
and just pick up on the characters?

Basically any advice as to what would be a good way to go about this would 
be helpful! Even if I should look at training Tesseract or creating a word 
list with the chemical elements or something?

Thanks a lot!

   Chris

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/484898bc-71e4-44ed-8327-e731a8100c0d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Too few characters. Skipping this page

Reply via email to