The text segments also seem scrambled in order - the text listed does not match the slide shown on the left side.

Hank

On Thu, 19 May 2011, Hank Magnuski wrote:


I'm trying out the OCR-->text function and I'm getting about 25% recognizable words and 75% gibberish.

My dictionaries were registered into the database and I see the tables have about 20K entries.

Any hints on debugging this? I'm using the default workflow.

There are a lot of words missing, too. Does the 3rd party tools package produce reasonable quality text extraction?
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Reply via email to