hi tobias,

thanks for the reply!

On 17/12/12 17:44, Tobias Wunden wrote:
we are using a distributed install of matterhorn 1.3.1.

i'm trying to make sure that only necessary files are left behind so as not to 
fill up our disks too quickly! i think the only files we want to keep are the 
original ingested files (such as the avi uploaded by our epiphan capture agents 
for example).

you also want to keep the original dublin core catalogs.

ah yes, good point.

i'm assuming that files left behind in the shared work directory are not used 
at all and everything is served from the distribution directories?

If everything is configured in the right way, then that assumption is true. If 
you want to make sure, just go to your media module (Engage) and use the REST 
endpoint at /search/docs to take a look at a random MediaPackage XML. If the 
XML does not include any URLs pointing to the working file repository (that 
would be /files/...), then you are free to delete whatever you deem unnecessary.

the only thing that points to /files is the acl. but this doesn't seem to get cleaned up anyway.

the archive operation of the workflow doesn't appear to do anything (other than 
add a message in the log saying the media package is archived) but i can't find 
any documentation saying what it is supposed to do so i'm not sure what is 
supposed to happen. can anybody explain it's purpose?

In 1.3, the archive operation simply adds the mediapackage to to the archive's 
search index. Be sure to realize that in most configurations, using the archive 
will require you to keep archived files in the working file repository or use 
the archive operation in a way to only archive files that are reachable through 
the distribution channels.

1.4 will bring a much enhanced archive that is storing its own copies of the 
content you decide to archive, so the working file repositoy is back to what it 
should be: a repository for keeping working files during processing time.

ok i think i understand. so there is really no need to use the archive operation for now? if we do include it should it be placed after the cleanup?

is there an operation you can add to the workflow to delete all the files from 
the shared workspace?

The "cleanup" operation is what you want to look at.

aha! that does the trick!

i added the following to our (epiphan specific) workflow and it seems to be working nicely:

    <operation
      id="cleanup"
      fail-on-error="false"
      description="Cleaning up">
      <configurations>
<configuration key="preserve-flavors">multitrack/source,dublincore/*</configuration>
      </configurations>
    </operation>

many thanks,

paul.

--
Paul Pettit
Multimedia Development
University of Sussex
+44 (0) 1273 87 3223
_______________________________________________
Matterhorn-users mailing list
[email protected]
http://lists.opencastproject.org/mailman/listinfo/matterhorn-users

Reply via email to