Hi, I'm really frustrated and I hope someone can help me out. I'm using Tesseract & ocropus to ocr scientific documents that have a very particular vocabulary. However, I noticed that Tesseract doesn't always do really well on these documents.
I would like to predispose Tesseract to recognizing words that are frequently used in my documents. However, my additions to the "eng.user-words" files seem to have no effect. Can anybody recommend a method for training Tesseract to better recognize my particular vocabulary? Thanks so much for your help! Sincerely, Kyle --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---

