Tim Donohue created DS-1382:
-------------------------------

             Summary: AIP Backup & Restore functionality should not duplicate 
unchanged files across Item Versions
                 Key: DS-1382
                 URL: https://jira.duraspace.org/browse/DS-1382
             Project: DSpace
          Issue Type: Improvement
          Components: DSpace API
    Affects Versions: 3.0
            Reporter: Tim Donohue
            Priority: Major


The DSpace 3.0 model for storing Item Versions in AIPs is to generate a 
*separate* AIP for each version of the Item. 

Suppose you have an Item "123/45" with old versions "123/45.1" and "123/45.2". 
To export all versions, you'd need to export a total of 3 AIPs (123-45.zip, 
123-45.1.zip and 123-45.2.zip), one for each version. 

Although this may sound reasonable, it can lead to "ballooning storage costs" 
as you version Items. Since 3 AIPs are generated in the above example, each of 
the 3 AIPs must duplicate all content files within it. So, if the size of the 
initial AIP is 100KB, after 10 versions, you may be storing around 
10x100KB=~1MB of content, much of it actually duplicative in nature. A few ways 
around this issue would be to either:
       (a) store AIPs as "unzipped" (so they could link to the same content 
files & avoid some content duplication), OR
       (b) generate a single AIP zip package which describes all versions of 
the Item (again that way you could avoid content file duplication). This single 
AIP zip package could either describe all versions in a single METS file, or 
potentially include a separate METS file for each version.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_nov
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to