John and Matterhorn workflow fans,

The fix John submitted publishes dublincore to new distributions that go through a full workflow. We are trying to adapt the fix with a short workflow applied to episodes, see below, to publish dublincore to already published distributions. The short workflow tags dublincore/episode, dublincore/series and the mpeg-7/text flavors for publish.

The problem is that a bad mpeg-7 file is being sent by the workflow. The Admin interface, Inspect Workflow -> Mediapackage, has the mpeg-7/text location to a good file located on the Episode server. But, that good file is not being accessed during the short workflow. The path to an empty file on the Admin server is accessed instead.

My hunch is that a temporary mpeg-7 was created on the Admin during it's first workflow. Somehow, the file was not archived but the link was.

Course of action...
1) Figure out how admin knows about a good link, but the bad link takes over in the workflow.
2) Figure out how the good link can be tagged and sent in publish step.
3) If it's really necessary to have the mpeg-7 on the Admin, ensure the mpeg-7 file is archived on the Admin for future workflows. It doesn't help the current problem.

I am grateful for any insights, suggestions, or alternative approaches before I dig deeper.

Best Regards,
Karen

More Background

In the Matterhorn Admin's Inspect Workflow -> Mediapackage there are good links to the the appropriate xml files. The mpeg-7/text link is to the Engage server, not the Admin server where the dublincore files are.

During the workflow's publish step, the Engage server chokes with a bad link to the mpeg-7 file. Engage can not parse the empty xml in the link sent, so it logs the error and leaves the workflow hanging. The link sent was to an empty file on the Admin server, not the good link to the mpeg-7 file on the Engage server.

"2012-09-25 16:43:50 WARN (StaticMetadataServiceDublinCoreImpl:293) - Unable to load metadata from catalog 'http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0' 2012-09-25 16:43:50 DEBUG (SolrIndexManager:383) - Looking for mpeg-7 catalogs containing segment texts ...."

--------------------------------------------
1) Admin Tool Inspect Workflow -> Mediapackage  links
-------------------------------------
dublincore/episode
http://admin...edu/files/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/dublincore.xml

dublincore/series
http://admin...edu/files/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/43807a83-849c-45c7-8c22-02bbd2e693f5/dublincore.xml

mpeg-7/text
http://engage...edu/static/12a663c2-78c1-4ace-bc95-f60fec00ee4b/catalog-4/slidetext.xml
[Observe, mpeg-7 xml link is to the engage server, not the admin.]

---------------------------------------------------------
2) Workflow Applied to Episodes to try get their dublin core over to search:
-----------------------------------------------------------

<?xml version="1.0" encoding="UTF-8"?>
<definition xmlns="http://workflow.opencastproject.org";>

  <!-- Description -->
  <id>DCE-fix-dublin-core-try-3</id>
  <title>DCE: Workaround to add dublin core to already published</title>
  <description>
    Tag source flavors dublincore/* and mpeg-7 for target publish.
    Publish to search index.
  </description>

  <!-- Operations -->
  <operations>
<operation id="tag" description="Tagging dublin core catalogs for publishing">
      <configurations>
<configuration key="source-flavors">mpeg-7/text,dublincore/*</configuration>
        <configuration key="target-tags">publish</configuration>
      </configurations>
    </operation>

    <!-- Publish to engage search index -->
<operation id="publish" fail-on-error="true" exception-handler-workflow="error" description="Published">
      <configurations>
        <configuration key="source-tags">publish</configuration>
      </configurations>
    </operation>
  </operations>
</definition>

-------------------------------------------
3) Engage Error stacktrace in matterhorn.log "
----------------------------------
2012-09-25 16:43:50 DEBUG (WorkspaceImpl:249) - http://admin1.mh-dev.harvard.edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc9
5-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0 has not been modified.
2012-09-25 16:43:50 WARN (DublinCoreCatalogImpl$DublinCoreParser:843) - Fatal error parsing dublincore catalog: Premature end of file. 2012-09-25 16:43:50 WARN (StaticMetadataServiceDublinCoreImpl:293) - Unable to load metadata from catalog 'http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0' 2012-09-25 16:43:50 DEBUG (SolrIndexManager:383) - Looking for mpeg-7 catalogs containing segment texts 2012-09-25 16:43:50 DEBUG (WorkspaceImpl:249) - http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc9
5-f60fec00ee4b/catalog-4/0 has not been modified.
2012-09-25 16:43:50 WARN (Mpeg7Parser:397) - Fatal error while parsing mpeg-7 catalog: Premature end of file. 2012-09-25 16:43:50 WARN (SearchRestService:107) - org.apache.solr.client.solrj.SolrServerException: java.lang.IllegalStateExceptio
n: Unable to load mpeg-7 catalog data:Premature end of file.
org.opencastproject.search.api.SearchException: org.apache.solr.client.solrj.SolrServerException: java.lang.IllegalStateException: U
nable to load mpeg-7 catalog data:Premature end of file.
at org.opencastproject.search.impl.SearchServiceImpl.add(SearchServiceImpl.java:374) at org.opencastproject.search.endpoint.SearchRestService.add(SearchRestService.java:104)
        at sun.reflect.GeneratedMethodAccessor195.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:616)
at org.apache.cxf.service.invoker.AbstractInvoker.performInvocation(AbstractInvoker.java:173) at org.apache.cxf.service.invoker.AbstractInvoker.invoke(AbstractInvoker.java:89)
        ...

_______________________________________________
Matterhorn mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn


To unsubscribe please email
[email protected]
_______________________________________________

Reply via email to