We keep large objects as E datastreams (on the file system), and in BagIt.

We've been just sucking it up while the checksums are validated for
the E datastreams, which is agonizingly slow on SAMFS.  Asger's
suggestion would be really useful in those contexts.  Perhaps even
better would be a hook for file system data that understands BagIt...

On Tue, Oct 25, 2011 at 4:55 PM, Chris Wilper <cwil...@duraspace.org> wrote:
> Hi Adam,
>
> I suspect you were running up against this problem, reported by Asger
> a while ago:
>
> https://jira.duraspace.org/browse/FCREPO-752
>
> Your approach sounds pretty reasonable to me. Using "E" datastreams is
> often done for repositories dealing with very large files for a
> variety of reasons. And storing such datastreams in a BagIt structure
> seems like a good idea on its own. Perhaps at some point when
> FCREPO-752 is fixed, it will be more practical to *also* store fixity
> info inside the Fedora objects.
>
> I'm also interested in other people's opinions and experiences with
> this kind of approach.
>
> - Chris
>
> On Tue, Oct 25, 2011 at 10:56 AM, Adam Wead <amsterda...@gmail.com> wrote:
>> Hi all,
>>
>> I'd like to throw out my latest strategy for storing external video files in
>> Fedora and have you all pick at it, see if I'm doing something dumb.
>>
>> My first attempt used external datastreams (type E) with checksums.  This
>> worked fine in test scenarios but I ran into problems with large video
>> files.  I would get timeouts from the ruby code that I was using.  I'm
>> developing a hydra head to ingest the video.  I was going to start digging
>> in and figure out what was going on, but a weekend off made me think it
>> might be better (and easier) to decouple these two things a bit more.
>>
>> The next idea I have is to use external datastreams but ditch Fedora's
>> checksums.   I didn't have any problems when I tried this last week with
>> large files, so I know it will work.  Obviously, I need the checksums
>> somewhere, so I'm considering adopting a BagIt structure on the filesystem
>> to store all the video files, or groups of video files, and then point
>> fedora to the location of each file.
>>
>> Has anyone tried such a scenario with or without success?  I could always
>> put a checksum file right alongside the video file and forgo the BagIt
>> option, but I like the idea of having two independent but functional
>> systems.  On one side Fedora which tracks the locations of the files and
>> their metadata, and BagIt which tracks the binary data.
>>
>> As a matter of policy, I would use Fedora's checksums for data stored within
>> Fedora, which we don't have at the moment, but I anticipate using for images
>> and other smaller sized files.
>>
>> Any thoughts are appreciated.
>>
>> thanks,
>>
>> ...adam
>>
>> ____________________________________________
>> Adam Wead
>> Systems and Digital Collections Librarian
>> Rock and Roll Hall of Fame and Museum
>> 216.515.1960 (t)
>> 215.515.1964 (f)
>>
>> ------------------------------------------------------------------------------
>> The demand for IT networking professionals continues to grow, and the
>> demand for specialized networking skills is growing even more rapidly.
>> Take a complimentary Learning@Cisco Self-Assessment and learn
>> about Cisco certifications, training, and career opportunities.
>> http://p.sf.net/sfu/cisco-dev2dev
>> _______________________________________________
>> Fedora-commons-users mailing list
>> Fedora-commons-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>>
>>
>
> ------------------------------------------------------------------------------
> The demand for IT networking professionals continues to grow, and the
> demand for specialized networking skills is growing even more rapidly.
> Take a complimentary Learning@Cisco Self-Assessment and learn
> about Cisco certifications, training, and career opportunities.
> http://p.sf.net/sfu/cisco-dev2dev
> _______________________________________________
> Fedora-commons-users mailing list
> Fedora-commons-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>

------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to