Hi,

I'm having some issues with the text extraction with our fresh 1.2 installation.
I keep getting the following error:

2011-10-13 13:03:31 WARN (TextAnalyzerServiceImpl:229) - Error extracting text from http://ic**.ugent.be:8080/files/collection/composer/550.tif
java.lang.IllegalArgumentException: The text cannot be empty
at org.opencastproject.metadata.mpeg7.TextualImpl.<init>(TextualImpl.java:81) at org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.analyze(TextAnalyzerServiceImpl.java:324) at org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.extract(TextAnalyzerServiceImpl.java:194) at org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.process(TextAnalyzerServiceImpl.java:253) at org.opencastproject.job.api.AbstractJobProducer$JobRunner.call(AbstractJobProducer.java:184) at org.opencastproject.job.api.AbstractJobProducer$JobRunner.call(AbstractJobProducer.java:156) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
        at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
        at java.lang.Thread.run(Thread.java:662)

This error is repeated a number of times in the log. The text extraction does not fail for every image, just for some images, but as a result the recording
has the status failed with following error :

org.opencastproject.workflow.api.WorkflowOperationException: org.opencastproject.workflow.api.WorkflowOperationException: Text extraction failed on images from http://ic**.ugent.be:8080/files/mediapackage/5952f751-e8f9-41e5-b55d-7002ca31a67b/8fd9ca3d-cfbc-429a-a035-2ddcbf608412/logica_trimmed.avi

These are tests with manually uploaded files, not sure if this could be a factor why it fails?

Thanks

Kristof Keppens
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Reply via email to