Ocropus dictionary and tesseract

Dieselkopf Mon, 20 Oct 2008 01:14:01 -0700

Hi there.

I'm new to Ocropus and currently playing around with it. I thought it
was using a dictionary and OpenFST as a language model. However,
Ocropus sometimes reads a "c" for and "e". It recognises "thc" instead
of "the" and "largc" instead of "large". Shouldn't the dictionary take
care of such ambiguous characters? Also, if I run tesseract on the
same text the character is being recognised correctly. Can someone
explain to me what's going on here? Thanks.


Chris
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/ocropus?hl=en
-~----------~----~----~----~------~----~------~--~---

Ocropus dictionary and tesseract

Reply via email to