Hi Hank, Don't know for sure, but there was a dictionary issue that was fixed in the upcoming 1.1.1 branch:
http://opencast.jira.com/browse/MH-7618 Might be related, Chris On Thu, 19 May 2011 15:14:53 -0700 (PDT) Hank Magnuski <[email protected]> wrote: > > I'm trying out the OCR-->text function and I'm getting about 25% > recognizable words and 75% gibberish. > > My dictionaries were registered into the database and I see the > tables have about 20K entries. > > Any hints on debugging this? I'm using the default workflow. > > There are a lot of words missing, too. Does the 3rd party tools > package produce reasonable quality text extraction? > > Hank > _______________________________________________ > Matterhorn-users mailing list > [email protected] > http://lists.opencastproject.org/mailman/listinfo/matterhorn-users _______________________________________________ Matterhorn-users mailing list [email protected] http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
