Re: [Matterhorn-users] OCR Text Tuneup

Christopher Brooks Thu, 19 May 2011 21:10:29 -0700

Hi Hank,

Don't know for sure, but there was a dictionary issue that was fixed
in the upcoming 1.1.1 branch:


http://opencast.jira.com/browse/MH-7618

Might be related,

Chris

On Thu, 19 May 2011 15:14:53
-0700 (PDT) Hank Magnuski <[email protected]> wrote:

> 
> I'm trying out the OCR-->text function and I'm getting about 25%
> recognizable words and 75% gibberish.
> 
> My dictionaries were registered into the database and I see the
> tables have about 20K entries.
> 
> Any hints on debugging this? I'm using the default workflow.
> 
> There are a lot of words missing, too. Does the 3rd party tools
> package produce reasonable quality text extraction?
> 
> Hank
> _______________________________________________
> Matterhorn-users mailing list
> [email protected]
> http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Re: [Matterhorn-users] OCR Text Tuneup

Reply via email to