Helping Tesseract recognize scientific vocabulary

Kyle Jensen Sun, 22 Feb 2009 19:59:30 -0800

Hi,

I'm really frustrated and I hope someone can help me out.  I'm using
Tesseract & ocropus to ocr scientific documents that have a very
particular vocabulary.  However, I noticed that Tesseract doesn't
always do really well on these documents.


I would like to predispose Tesseract to recognizing words that are
frequently used in my documents.  However, my additions to the
"eng.user-words" files seem to have no effect.

Can anybody recommend a method for training Tesseract to better
recognize my particular vocabulary?  Thanks so much for your help!

Sincerely,
Kyle
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en
-~----------~----~----~----~------~----~------~--~---

Helping Tesseract recognize scientific vocabulary

Reply via email to