Adam--

To your Akubra question: We've been working on an Akubra architecture utilizing 
multiple filesystems multiplexed together in a binary tree developed from a 
custom URI hash calculation, which we're putting into production this month. 
The relevance is that we're very interested in bringing Akubra's Quiescing 
Storage design into play for HSM (our IT folks currently support a Quantum 
StorNext instance which we use for workflow but without strong integration with 
the repository architecture), and would be interested in discussing partnership 
with another institution to do just that.

An alternative would be to employ a rules-based storage architecture like 
iRODS. An iRODS-Fedora integration is available now, although I believe it is 
under redesign for Akubra.

I'm thinking that in any naive implementation of your first two designs, 
applications that request datastreams from Fedora will have to be aware that 
responses from an HTTP request to a Fedora Web service may take as long to 
appear as the HSM takes to retrieve material. If your HSM is like ours, that is 
often several minutes and must involve a checksum operation for surety. This 
implies some sophisticated behavior on the part of requesting applications that 
must present some unusual workflow to an end user. ("Click here and wait ten 
minutes for download to begin"?) Of course, your HSM may be better than ours 
and be able to respond more immediately with more immediately useful 
information. If it is and can, those designs make more sense. (And I envy you 
your HSM!)

---
A. Soroka
Online Library Environment
the University of Virginia Library




On Mar 21, 2011, at 4:56 PM, Adam Wead wrote:

> Dear all,
> 
> I'm currently mulling options as to how to implement a storage solution for 
> our fedora repository.  For backup and disaster recovery, I'm using Tivoli 
> Storage Manager, with a SAN and LTO tape environment.  I also have a Tivoli 
> add-on called Space Management which acts as a hierarchical storage manager 
> and moves files from disk to tape while leaving a portion of the file on the 
> disk.  We'll be holding a lot of uncompressed video files which won't change 
> (ever) and won't be accessed very often.  Possible strategies include:
> 
> 1. Place Fedora's objects directory atop an HSM-managed filesystem, exported 
> to the fedora server via NFS.  Tweak HSM policies so that large files are 
> moved out to tape, but Fedora can still "see" that they're there.  Smaller 
> derivative files remain on disk for faster access.  Advantages: Fedora just 
> gets a filesystem.  Everything else is pushed back to the storage layer.  
> Drawbacks/unknowns: complexity, performance via NFS and HSM, unknowns, others?
> 
> 2. Keep the large files away from Fedora as externally-referenced objects 
> sitting on an HSM filesystem; they're pulled from tape when needed.  
> Derivative and datastream data stays in the repository.  Advantages: simpler 
> implementation.  Drawbacks: more complicated ingest putting derivative files 
> in one place and preservation files in another, maintaining link between the 
> object reference and its actual file.
> 
> 3. Akubra?  The above two examples assume the default low-level storage 
> method in versions prior to 3.4.  I haven't used Akubra and don't know what 
> it's able to do in this regard, or if it's a better option to use with #1 or 
> #2 instead of the default objects directory.
> 
> From what I've seen on the wiki, HSM directly with Fedora is on the wishlist, 
> but not ready.  Is anyone out there using Fedora with HSM in some way, 
> doesn't have to be IBM specifically, and could offer suggestions, advice, 
> caveats?
> 
> thanks in advance,
> 
> ...adam
> ____________________________________________
> Adam Wead
> Systems and Digital Collections Librarian
> Rock and Roll Hall of Fame and Museum
> 216.515.1960 (t)
> 215.515.1964 (f)
> ------------------------------------------------------------------------------
> Enable your software for Intel(R) Active Management Technology to meet the
> growing manageability and security demands of your customers. Businesses
> are taking advantage of Intel(R) vPro (TM) technology - will your software 
> be a part of the solution? Download the Intel(R) Manageability Checker 
> today! 
> http://p.sf.net/sfu/intel-dev2devmar_______________________________________________
> Fedora-commons-users mailing list
> Fedora-commons-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users


------------------------------------------------------------------------------
Enable your software for Intel(R) Active Management Technology to meet the
growing manageability and security demands of your customers. Businesses
are taking advantage of Intel(R) vPro (TM) technology - will your software 
be a part of the solution? Download the Intel(R) Manageability Checker 
today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to