You can also check if the fail-on-error is set to true for the text extraction. That would cause the problem you describe ( we've encountered the same issue ). Setting the fail-on-error to false will have the same effect as removing these lines but you will have some text extraction, although in our experience the text extraction only rarely succeeds. So you might want to consider skipping it completely.

Kristof

On 2011-12-08 12:18, Dr Leslaw Zieleznik wrote:
Reza,

You probably need to create your own dictionary?

Below is instruction of how to disable text extraction, recently published on 
the list:

**************************************************

Non-authorative answer:

in $FELIX_HOME/conf/workflow/*.xml
remove the following lines:
------------------------------------------
    <!-- Run text analysis -->
    <operation
      id="extract-text"
      fail-on-error="false"
      exception-handler-workflow="error"
      description="Extracting text from presentation segments">
      <configurations>
        <configuration key="source-flavor">presentation/trimmed</configuration>
        <configuration key="source-tags"></configuration>
        <configuration key="target-tags">engage</configuration>
      </configurations>
    </operation>
-------------------------------------------

Leslaw

On 8 Dec 2011, at 11:08, VISIONAIRE-Reza Toghraee wrote:

Dear Dr Leslaw


Thank you very much for your reply.
I had enabled the Audio and attached a mic as well. But today I realized that 
the Audio mode was set on Line. I changed it to MIC and now at least on preview 
CGI of MCA, I can hear the voice as well.

Currently Im suffering from the OCR. Whenever Im making a recording it is 
failing during the Text Extraction. Is there any way to disable the Text 
Extraction from the workflow?

Thank you
Reza



From: [email protected] 
[mailto:[email protected]] On Behalf Of Dr Leslaw 
Zieleznik
Sent: Thursday, December 08, 2011 1:00 PM
To: Matterhorn Users
Subject: Re: [Matterhorn-users] MattherHorn 1.2 and Epiphan Capture Appliance 
Updates and Issues -- New User

Hi Reza,

0- till now, after recording more than 10 times, I couldn't manage to have
the complete result of recording (Video + VGA + Audio) in Engage.

You need to  enable the audio and connect the microphone, it will then start 
recording all three streams.

1- sometimes when recording has to be finished, the server still shows that
the MCA is in Capturing state. I don't understand why the status is not
being updated.

This is know bug,  MCA sits in the capture state (the workflow is showing 
PAUSE) for about 35min and then will do the ingestion.

In summary the device is capturing nicely if you take the above into account - 
see also below.
There is more information about the device behave on the Matterhorn Users list.

3- almost most of the times, when the MCA ingests the file to the Core and
Core starts digesting, usually it fails in 'Text Extraction" Phase. Is there
any way to skip the text extraction phase while digesting?

I have no problem with this, only the text OCR is not very accurate.

Calendar Polling Interval: 1 minuet
Agent State Push Interval: 60 seconds
Ingest Interval : 2 minutes   (I changed it from 60 minutes to 2 minute to
speedup the ingest)

I am using a similar setup: 2, 60, 5.

Best,
Leslaw





_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Reply via email to