Re: [Matterhorn-users] OCR options

Tobias Wunden Tue, 03 Jul 2012 07:46:38 -0700

Hi,

if you know what's on the slides and when, the best strategy would be to add an 
MPEG-7 catalog just as the OCR service is doing, containing the time segments 
and the text within each segment. Take a look a the classes


- VideoSegmenterWorkflowOperationHandler and
- TextAnalysisWorkflowOperationHandler

to see what is happening. If you provide your own catalogs and get the 
flavors/tags right, you can easily implement your own algorithm or way of 
providing segments and texts.

Tobias

On 29.06.2012, at 21:03, Бисер Миланов <[email protected]> wrote:

> Greetings,
> 
> I was wondering do we have control over the OCR that is being done on the 
> slides, apart from inserting new dictionaries? For instance, if I already 
> know what is on the slides, I don't even need OCR in the first place -- I 
> would just fill the information out. To be more precise, can I edit the 
> strings that are included in the media package that are generated from the 
> OCR?
> 
> Regards,
> Biser Milanov
> _______________________________________________
> Matterhorn-users mailing list
> [email protected]
> http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Re: [Matterhorn-users] OCR options

Reply via email to