Re: [Matterhorn-users] OCR Text Tuneup - more

Hank Magnuski Thu, 19 May 2011 17:04:10 -0700

The text segments also seem scrambled in order - the text listed does not matchthe slide shown on the left side.


Hank

On Thu, 19 May 2011, Hank Magnuski wrote:

I'm trying out the OCR-->text function and I'm getting about 25% recognizablewords and 75% gibberish.
My dictionaries were registered into the database and I see the tables haveabout 20K entries.
Any hints on debugging this? I'm using the default workflow.
There are a lot of words missing, too. Does the 3rd party tools packageproduce reasonable quality text extraction?

_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Re: [Matterhorn-users] OCR Text Tuneup - more

Reply via email to