Hi Alan,

On 1/16/2014 9:10 AM, Alan Orth wrote:
> Hi,
>
> I've got a development instance where we uploaded a few hundred items
> (in one community and several collections).  Our editors spent some time
> manually uploading bit streams to many of these items.  Now I want to
> migrate the community and its hierarchy to the production instance.  We
> can't use the CSV via "Export Metadata" because of the bit streams, so
> I've been looking at using AIP, ie:
>
> dspace packager -s -a -t AIP -e [email protected] -p 10568/0 33474.zip
>
> This works great, but the resulting items now have two of each of the
> following fields:
>      - dc.date.accessioned
>      - dc.date.available
>      - dc.identifier.uri
>
> I can't figure out a work flow that doesn't produce this effect...

These three fields are unfortunately auto-generated by DSpace whenever 
you treat an AIP as a submission information package (SIP), which is 
what the -s option. Essentially, the '-s' option assumes this is new 
content, so DSpace defines these fields as:
     * dc.date.accessioned - the date this new content was added to DSpace
     * dc.date.available - the date this new content became available in 
DSpace (i.e. finished approval workflow)
     * dc.identifier.uri - the assigned Handle for this object

For your situation, you may need to consider some metadata related 
questions.

* Does your development instance assign proper Handles?  If not, then 
you *need* Production to assign a new dc.identifier.uri.  This may mean 
that you'll have to unfortunately do some post-metadata cleanup (perhaps 
via the Bulk Metadata Editor) of the invalid "development" handles in 
the dc.identifier.uri fields. DSpace never overwrites or removes 
existing metadata.

* Do you want the "date.accessioned" and "date.available" fields to be 
set to the dates the Item was added to *development* or to *production*? 
If the latter, again, you may unfortunately need to do some 
post-metadata cleanup, as DSpace specifically *never* removes/overwrites 
existing metadata fields.


Depending on your setup/answers to your questions, there are three 
possible AIP import options I can see:

1a) Use "Restore/Replace" option instead (-r) when migrating to Production.

If you treat this as an AIP "restoration" then DSpace will skip creating 
"date.accessioned", "date.available" and "identifier.uri" fields and 
assume that the provided values in the AIPs are correct (as it assumes 
you are restoring a set of deleted objects).  WARNING: If the 
'dc.identifier.uri' in the AIP does NOT correspond to a valid Handle, 
then you will end up with invalid Handles in Production! (See next option.)

More on Restore/Replace:
https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-Restoring/ReplacingusingAIP(s)

1b) When using "Restore/Replace", you may want/need to override some of 
the default options. For example, restoration will always assume the 
'dc.identifier.uri' is a valid Handle (so a new Handle will not be 
assigned). Restoration will also always attempt to restore an object 
under the *specified* parent object in the AIP -- so, this means if a 
Collection was under a Community with ID "123456789/1" in your 
development instance, then it will be restored under a Community of the 
*same ID* in Production

Luckily, these defaults can be overridden. See the 'ignoreHandle' and 
'ignoreParent' Advanced options documented here:

https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-AdditionalPackagerOptions

2) The other option is to still use Submission (-s) option, but use one 
or more of the Advanced options (in 1b) to tweak the defaults.

I know this is a lot of info, but hopefully it gives you some ideas to 
go on.

- Tim

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to