Hi Andreas,
Didn't investigate it any further, hoping it would be resolved in the
next version. Didn't have the chance to test the 1.3 release for text
extraction however. If it isn't resolved in the 1.3 version we'll
investigate it further and try to resolve it.
Kristof
On 02/18/2012 08:13 PM, [email protected] wrote:
Hi Kristof,
allow me a short question: did you get further into investigating the
reason for the many grey images with once in a while a correct slide?
We experience the same here, first time I noticed; do you know some
easy trick to make this work? Did you go into that more deeply (so I
don't have to ;) ?
Regards, Andreas
Kristof Keppens schrieb am Tue, 22 Nov 2011 betreff "Re:
[Matterhorn-users]...":
Hi,
We are getting further with the setup of our matterhorn
infrastructure, and so far most things work and we are almost ready
to launch the 1.2 version. However the problem with the text
extraction is still there and I haven't found a solution so far. I
did find the reason why the text extraction fails, the tif file
generated for text extraction is most of the time a blank grey image,
always the same file size and solid grey. Once in a while there is a
correct tif file generated and the text extraction is fine then.
I don't see a clear connection between the successful tif files and
the failed ( it's a ratio of about 1/10 tif's are correct ) ones.
Is anyone else experiencing these problems and found a solution ?
Thanks
Kristof Keppens
Ghent University
On 2011-10-13 14:56, Kristof Keppens wrote:
Hi,
I'm having some issues with the text extraction with our fresh 1.2
installation.
I keep getting the following error:
2011-10-13 13:03:31 WARN (TextAnalyzerServiceImpl:229) - Error
extracting text from
http://ic**.ugent.be:8080/files/collection/composer/550.tif
java.lang.IllegalArgumentException: The text cannot be empty
at
org.opencastproject.metadata.mpeg7.TextualImpl.<init>(TextualImpl.java:81)
at
org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.analyze(TextAnalyzerServiceImpl.java:324)
at
org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.extract(TextAnalyzerServiceImpl.java:194)
at
org.opencastproject.textanalyzer.impl.TextAnalyzerServiceImpl.process(TextAnalyzerServiceImpl.java:253)
at
org.opencastproject.job.api.AbstractJobProducer$JobRunner.call(AbstractJobProducer.java:184)
at
org.opencastproject.job.api.AbstractJobProducer$JobRunner.call(AbstractJobProducer.java:156)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
This error is repeated a number of times in the log. The text
extraction
does not fail for every image, just for some images, but as a result
the
recording
has the status failed with following error :
org.opencastproject.workflow.api.WorkflowOperationException:
org.opencastproject.workflow.api.WorkflowOperationException: Text
extraction failed on images from
http://ic**.ugent.be:8080/files/mediapackage/5952f751-e8f9-41e5-b55d-7002ca31a67b/8fd9ca3d-cfbc-429a-a035-2ddcbf608412/logica_trimmed.avi
These are tests with manually uploaded files, not sure if this could be
a factor why it fails?
Thanks
Kristof Keppens
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
-----------------------
[email protected]
01/58801 DW 41523
mobil: 0664/60 588 4523
TU Wien
DVR-Nummer: 0005886
-----------------------
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users