Tobias, I don't quite follow. How are those mediapackages invalid? When mediapackage elements are cleaned up, they are effectively taken out from the manifests and so on, so I don't know why there should be "pointers to files and catalogs that do not exist anymore".
I'm particularly against keeping intermediate "work" files, which are created in the middle steps of the workflow but never get distributed. Those files are a consequence of the specific implementation of the workflow and, should another workflow be run, they should be re-created as needed. After the workflow ends, in my view they're just garbage (which accounts for the fact that the name given to the operation that gets rid of those files is "cleanup"). If somehow the cleanup operation is not correctly deleting all the references to the deleted files, then the question is not skipping the cleanup operation alltogether, because the need to save disk space is still there, and it's critical in most cases. The right way to go is fixing the cleanup operation, or whichever processes that are failing to update the broken references. If the problem is that the distributed files are not kept, then it's a question of changing the default workflow and tell the "cleanup" operation to keep those files also. As an adopter institution, the disk space consumed by Matterhorn (by our media content in general) is a critical issue. I won't vote on this until knowing about those "broken references" better, but the cleanup operation makes the disk comsumption more efficient, and in general I'm against removing it completely from the default workflow. Regards Rubén 2012/6/19 Christopher Brooks <[email protected]> > We already have a bit of an issue with disk space in MH, so we would > end up turning this back on right away. We archive outside of the > system atm. > > To me it's no big deal to have it on or off by default, so 0 from me, > > Chris > > On Tue, 19 Jun 2012 19:24:24 +0200 > Tobias Wunden <[email protected]> wrote: > > > As part of our default workflow, we enforce the "cleanup" operation, > > which is removing all files referenced by a single media package from > > the file system in order to save space and prevent disks from filling > > up. So far for the pros of making this the default behavior. > > > > On the downside, mediapackages get added to the various indexes in > > Matterhorn (workflow, episode, search) that can't be used for further > > (or repeated) processing since they are basically invalid, having > > pointers to files and catalogs that don't exist anymore. This will be > > true until we have a working archive in place which unfortunately > > does note seem doable until 1.5. > > > > In order to make sure people make an active choice to have their > > original media and catalogs removed, I #propose to comment out that > > operation and add documentation right into the workflow that explains > > what happens if a user choses to enable it. But if they don't they > > are still by default maintaining a working installation with valid > > media packages. > > > > I am especially looking for feedback from adopters. > > > > Tobias > > _______________________________________________ > > Matterhorn mailing list > > [email protected] > > http://lists.opencastproject.org/mailman/listinfo/matterhorn > > > > > > To unsubscribe please email > > [email protected] > > _______________________________________________ > > > > -- > Christopher Brooks, BSc, MSc > ARIES Laboratory, University of Saskatchewan > > Web: http://www.cs.usask.ca/~cab938 > Phone: 1.306.966.1442 > Mail: Advanced Research in Intelligent Educational Systems Laboratory > Department of Computer Science > University of Saskatchewan > 176 Thorvaldson Building > 110 Science Place > Saskatoon, SK > S7N 5C9 > _______________________________________________ > Matterhorn mailing list > [email protected] > http://lists.opencastproject.org/mailman/listinfo/matterhorn > > > To unsubscribe please email > [email protected] > _______________________________________________ >
_______________________________________________ Matterhorn mailing list [email protected] http://lists.opencastproject.org/mailman/listinfo/matterhorn To unsubscribe please email [email protected] _______________________________________________
