Reza,
You probably need to create your own dictionary?
Below is instruction of how to disable text extraction, recently published on
the list:
**************************************************
Non-authorative answer:
in $FELIX_HOME/conf/workflow/*.xml
remove the following lines:
------------------------------------------
<!-- Run text analysis -->
<operation
id="extract-text"
fail-on-error="false"
exception-handler-workflow="error"
description="Extracting text from presentation segments">
<configurations>
<configuration key="source-flavor">presentation/trimmed</configuration>
<configuration key="source-tags"></configuration>
<configuration key="target-tags">engage</configuration>
</configurations>
</operation>
-------------------------------------------
Leslaw
On 8 Dec 2011, at 11:08, VISIONAIRE-Reza Toghraee wrote:
> Dear Dr Leslaw
>
>
> Thank you very much for your reply.
> I had enabled the Audio and attached a mic as well. But today I realized that
> the Audio mode was set on Line. I changed it to MIC and now at least on
> preview CGI of MCA, I can hear the voice as well.
>
> Currently Im suffering from the OCR. Whenever Im making a recording it is
> failing during the Text Extraction. Is there any way to disable the Text
> Extraction from the workflow?
>
> Thank you
> Reza
>
>
>
> From: [email protected]
> [mailto:[email protected]] On Behalf Of Dr Leslaw
> Zieleznik
> Sent: Thursday, December 08, 2011 1:00 PM
> To: Matterhorn Users
> Subject: Re: [Matterhorn-users] MattherHorn 1.2 and Epiphan Capture Appliance
> Updates and Issues -- New User
>
> Hi Reza,
>
> 0- till now, after recording more than 10 times, I couldn't manage to have
> the complete result of recording (Video + VGA + Audio) in Engage.
>
> You need to enable the audio and connect the microphone, it will then start
> recording all three streams.
>
> 1- sometimes when recording has to be finished, the server still shows that
> the MCA is in Capturing state. I don't understand why the status is not
> being updated.
>
> This is know bug, MCA sits in the capture state (the workflow is showing
> PAUSE) for about 35min and then will do the ingestion.
>
> In summary the device is capturing nicely if you take the above into account
> - see also below.
> There is more information about the device behave on the Matterhorn Users
> list.
>
> 3- almost most of the times, when the MCA ingests the file to the Core and
> Core starts digesting, usually it fails in 'Text Extraction" Phase. Is there
> any way to skip the text extraction phase while digesting?
>
> I have no problem with this, only the text OCR is not very accurate.
>
> Calendar Polling Interval: 1 minuet
> Agent State Push Interval: 60 seconds
> Ingest Interval : 2 minutes (I changed it from 60 minutes to 2 minute to
> speedup the ingest)
>
> I am using a similar setup: 2, 60, 5.
>
> Best,
> Leslaw
>
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users