Hi Michael,

First off, I'll admit, I know next-to-nothing about the Digital Commons 
export format.

In case others on this list are in a similar scenario, could you 
describe the format of the Digital Commons data export?  You said that 
it's a large Zip archive. What is the format of files or directory 
structure in that archive?

If we know a bit more about the export format, we may be able to give 
some extra tips to getting that data into DSpace.

However, in general, here are some of the ways to bulk import content 
into DSpace:

1. You can import content in bulk using the DSpace Simple Archive Format 
(essentially a directory per item): 
https://wiki.duraspace.org/display/DSDOC18/Importing+and+Exporting+Items+via+Simple+Archive+Format

2. You can import content in bulk using the (METS-based) AIP packages:
https://wiki.duraspace.org/display/DSDOC18/AIP+Backup+and+Restore

3. DSpace also supports SWORD (v1 and v2), which can be used as a way to 
submit content packages for ingestion. These are again METS-based 
packages, but store their metadata in a different format (EPrints DC XML).
https://wiki.duraspace.org/display/DSDOC18/SWORDv1+Server
https://wiki.duraspace.org/display/DSDOC18/SWORDv2+Server

In addition, there are also some third-party tools that have been 
written to help with getting data ready for bulk ingest into DSpace.

For example, there is a "SAFBuilder" tool helps you build valid DSpace 
Simple Archive folders from a Spreadsheet (CSV) and a set of content files:

https://github.com/peterdietz/SAFBuilder

The output of that SAFBuilder program can then be imported into DSpace 
by following the link under Option #1 listed above.

So, these are just a few generic options for getting content into DSpace.

However, it's very possible that someone else on this list has actually 
performed a full migration from Digital Commons to DSpace, and may have 
further hints or tips for you.

- Tim


On 5/29/2012 12:33 AM, MICHAEL QUINTON Smith wrote:
> Hi
> Our library is busy migrating from a hosted Digital Commons repository
> to an on-campus DSpace server setup on RedHat Linux. We have a 10 gig
> zipped archive with our data. We had unzipped the archive.
> My question: do we need to split this file into individual components
> before proceeding with ingesting into DSpace. Is there anyone in the
> community that had done a migration from Digital Commons to DSpace that
> can give some advice?
> Regards
>
>
> ------------------------------------------------------------------------------------------------
> Disclaimer
> This e-mail transmission contains confidential information,
> which is the property of the sender.
> The information in this e-mail or attachments thereto is
> intended for the attention and use only of the addressee.
> Should you have received this email in error, please delete
> and destroy it and any attachments thereto immediately.
> Under no circumstances will the Cape Peninsula University of Technology
> or the sender
> of this email be liable to any party for any direct, indirect,
> special or other consequential damages for any use of this e-mail.
> For the detailed e-mail disclaimer please refer to
> http://www.cput.ac.za/email.php or call *+27 (0)21 460 3911*
> ------------------------------------------------------------------------------------------------
>
>
>
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and
> threat landscape has changed and how IT managers can respond. Discussions
> will include endpoint security, mobile security and the latest in malware
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>
>
>
> _______________________________________________
> DSpace-tech mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dspace-tech

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to