The text segments also seem scrambled in order - the text listed does not match
the slide shown on the left side.
Hank
On Thu, 19 May 2011, Hank Magnuski wrote:
I'm trying out the OCR-->text function and I'm getting about 25% recognizable
words and 75% gibberish.
My dictionaries were registered into the database and I see the tables have
about 20K entries.
Any hints on debugging this? I'm using the default workflow.
There are a lot of words missing, too. Does the 3rd party tools package
produce reasonable quality text extraction?
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users