John and Matterhorn workflow fans,
The fix John submitted publishes dublincore to new distributions that go
through a full workflow. We are trying to adapt the fix with a short
workflow applied to episodes, see below, to publish dublincore to
already published distributions. The short workflow tags
dublincore/episode, dublincore/series and the mpeg-7/text flavors for
publish.
The problem is that a bad mpeg-7 file is being sent by the workflow. The
Admin interface, Inspect Workflow -> Mediapackage, has the mpeg-7/text
location to a good file located on the Episode server. But, that good
file is not being accessed during the short workflow. The path to an
empty file on the Admin server is accessed instead.
My hunch is that a temporary mpeg-7 was created on the Admin during it's
first workflow. Somehow, the file was not archived but the link was.
Course of action...
1) Figure out how admin knows about a good link, but the bad link takes
over in the workflow.
2) Figure out how the good link can be tagged and sent in publish step.
3) If it's really necessary to have the mpeg-7 on the Admin, ensure the
mpeg-7 file is archived on the Admin for future workflows. It doesn't
help the current problem.
I am grateful for any insights, suggestions, or alternative approaches
before I dig deeper.
Best Regards,
Karen
More Background
In the Matterhorn Admin's Inspect Workflow -> Mediapackage there are
good links to the the appropriate xml files. The mpeg-7/text link is to
the Engage server, not the Admin server where the dublincore files are.
During the workflow's publish step, the Engage server chokes with a bad
link to the mpeg-7 file. Engage can not parse the empty xml in the link
sent, so it logs the error and leaves the workflow hanging. The link
sent was to an empty file on the Admin server, not the good link to the
mpeg-7 file on the Engage server.
"2012-09-25 16:43:50 WARN (StaticMetadataServiceDublinCoreImpl:293) -
Unable to load metadata from catalog
'http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0'
2012-09-25 16:43:50 DEBUG (SolrIndexManager:383) - Looking for mpeg-7
catalogs containing segment texts ...."
--------------------------------------------
1) Admin Tool Inspect Workflow -> Mediapackage links
-------------------------------------
dublincore/episode
http://admin...edu/files/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/dublincore.xml
dublincore/series
http://admin...edu/files/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/43807a83-849c-45c7-8c22-02bbd2e693f5/dublincore.xml
mpeg-7/text
http://engage...edu/static/12a663c2-78c1-4ace-bc95-f60fec00ee4b/catalog-4/slidetext.xml
[Observe, mpeg-7 xml link is to the engage server, not the admin.]
---------------------------------------------------------
2) Workflow Applied to Episodes to try get their dublin core over to search:
-----------------------------------------------------------
<?xml version="1.0" encoding="UTF-8"?>
<definition xmlns="http://workflow.opencastproject.org">
<!-- Description -->
<id>DCE-fix-dublin-core-try-3</id>
<title>DCE: Workaround to add dublin core to already published</title>
<description>
Tag source flavors dublincore/* and mpeg-7 for target publish.
Publish to search index.
</description>
<!-- Operations -->
<operations>
<operation id="tag" description="Tagging dublin core catalogs for
publishing">
<configurations>
<configuration
key="source-flavors">mpeg-7/text,dublincore/*</configuration>
<configuration key="target-tags">publish</configuration>
</configurations>
</operation>
<!-- Publish to engage search index -->
<operation id="publish" fail-on-error="true"
exception-handler-workflow="error" description="Published">
<configurations>
<configuration key="source-tags">publish</configuration>
</configurations>
</operation>
</operations>
</definition>
-------------------------------------------
3) Engage Error stacktrace in matterhorn.log "
----------------------------------
2012-09-25 16:43:50 DEBUG (WorkspaceImpl:249) -
http://admin1.mh-dev.harvard.edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc9
5-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0 has not been modified.
2012-09-25 16:43:50 WARN (DublinCoreCatalogImpl$DublinCoreParser:843) -
Fatal error parsing dublincore catalog: Premature end of file.
2012-09-25 16:43:50 WARN (StaticMetadataServiceDublinCoreImpl:293) -
Unable to load metadata from catalog
'http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc95-f60fec00ee4b/49037340-33e9-43f8-8a9d-7dba29d0cbef/0'
2012-09-25 16:43:50 DEBUG (SolrIndexManager:383) - Looking for mpeg-7
catalogs containing segment texts
2012-09-25 16:43:50 DEBUG (WorkspaceImpl:249) -
http://admin...edu/episode/archive/mediapackage/12a663c2-78c1-4ace-bc9
5-f60fec00ee4b/catalog-4/0 has not been modified.
2012-09-25 16:43:50 WARN (Mpeg7Parser:397) - Fatal error while parsing
mpeg-7 catalog: Premature end of file.
2012-09-25 16:43:50 WARN (SearchRestService:107) -
org.apache.solr.client.solrj.SolrServerException:
java.lang.IllegalStateExceptio
n: Unable to load mpeg-7 catalog data:Premature end of file.
org.opencastproject.search.api.SearchException:
org.apache.solr.client.solrj.SolrServerException:
java.lang.IllegalStateException: U
nable to load mpeg-7 catalog data:Premature end of file.
at
org.opencastproject.search.impl.SearchServiceImpl.add(SearchServiceImpl.java:374)
at
org.opencastproject.search.endpoint.SearchRestService.add(SearchRestService.java:104)
at sun.reflect.GeneratedMethodAccessor195.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:616)
at
org.apache.cxf.service.invoker.AbstractInvoker.performInvocation(AbstractInvoker.java:173)
at
org.apache.cxf.service.invoker.AbstractInvoker.invoke(AbstractInvoker.java:89)
...
_______________________________________________
Matterhorn mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn
To unsubscribe please email
[email protected]
_______________________________________________