Hi Michael, I'm copying back in the dspace-tech listserv, as others may also be able to assist. Plus, keeping these discussions on the list can help others in the future, who may be searching the list archives for an answer to the same question.
Unfortunately, I'm not entirely sure what is in each file in the "DKArchive" folder. For example, what is the "00" and "01" files you mentioned? Or are those folders? Also, what format of metadata (e.g. dublin core, MODS/METS) is in the "metadata.xml" file? At a glance, this DKArchive folder looks similar in structure to the DSpace Simple Archive format described here: https://wiki.duraspace.org/display/DSDOC18/Importing+and+Exporting+Items+via+Simple+Archive+Format However, it doesn't look like it's the same format exactly. The reason why you got errors reported when you attempted to import this Zip file using the AIP tools, is that this DKArchive ZIP file does not seem to be a valid DSpace AIP. The DSpace AIP format is METS-based and described here: https://wiki.duraspace.org/display/DSDOC18/DSpace+AIP+Format Essentially, your next steps should be to determine how best to transform this DKArchive into a valid DSpace import format. Currently, it looks like it's *closest* to the Simple Archive Format (as described above), but it all depends on the format of the metadata stored in the "metadata.xml" file. It's possible you'd need to transform that "metadata.xml" file into Dublin Core metadata (if it isn't already). Sorry I cannot be of more immediate help. Again, I'm just not familiar with this export format. Hopefully there are ways you can easily transform it into one of the valid DSpace import formats. - Tim On 5/30/2012 10:05 AM, MICHAEL QUINTON Smith wrote: > Hi Tim > Thank you for your response. The format of the fille is: > DKArchive folder > ----- folder name > -------- 00 > -------- metadata.xml > -------- content.pdf > -------- 01 > -------- metadata.xml > -------- some content.pdf > > I am a librarian and one of the web developers are assisting me. Please > find attached the error log that was generated when we tried to unzip > the archive. Thanks for the URL references I will follow it up. > Regards > > >>> Tim Donohue <[email protected]> 2012/05/29 04:51 PM >>> > Hi Michael, > > First off, I'll admit, I know next-to-nothing about the Digital Commons > export format. > > In case others on this list are in a similar scenario, could you > describe the format of the Digital Commons data export? You said that > it's a large Zip archive. What is the format of files or directory > structure in that archive? > > If we know a bit more about the export format, we may be able to give > some extra tips to getting that data into DSpace. > > However, in general, here are some of the ways to bulk import content > into DSpace: > > 1. You can import content in bulk using the DSpace Simple Archive Format > (essentially a directory per item): > https://wiki.duraspace.org/display/DSDOC18/Importing+and+Exporting+Items+via+Simple+Archive+Format > > 2. You can import content in bulk using the (METS-based) AIP packages: > https://wiki.duraspace.org/display/DSDOC18/AIP+Backup+and+Restore > > 3. DSpace also supports SWORD (v1 and v2), which can be used as a way to > submit content packages for ingestion. These are again METS-based > packages, but store their metadata in a different format (EPrints DC XML). > https://wiki.duraspace.org/display/DSDOC18/SWORDv1+Server > https://wiki.duraspace.org/display/DSDOC18/SWORDv2+Server > > In addition, there are also some third-party tools that have been > written to help with getting data ready for bulk ingest into DSpace. > > For example, there is a "SAFBuilder" tool helps you build valid DSpace > Simple Archive folders from a Spreadsheet (CSV) and a set of content files: > > https://github.com/peterdietz/SAFBuilder > > The output of that SAFBuilder program can then be imported into DSpace > by following the link under Option #1 listed above. > > So, these are just a few generic options for getting content into DSpace. > > However, it's very possible that someone else on this list has actually > performed a full migration from Digital Commons to DSpace, and may have > further hints or tips for you. > > - Tim > > > On 5/29/2012 12:33 AM, MICHAEL QUINTON Smith wrote: > > Hi > > Our library is busy migrating from a hosted Digital Commons repository > > to an on-campus DSpace server setup on RedHat Linux. We have a 10 gig > > zipped archive with our data. We had unzipped the archive. > > My question: do we need to split this file into individual components > > before proceeding with ingesting into DSpace. Is there anyone in the > > community that had done a migration from Digital Commons to DSpace that > > can give some advice? > > Regards > > > > > > > ------------------------------------------------------------------------------------------------ > > Disclaimer > > This e-mail transmission contains confidential information, > > which is the property of the sender. > > The information in this e-mail or attachments thereto is > > intended for the attention and use only of the addressee. > > Should you have received this email in error, please delete > > and destroy it and any attachments thereto immediately. > > Under no circumstances will the Cape Peninsula University of Technology > > or the sender > > of this email be liable to any party for any direct, indirect, > > special or other consequential damages for any use of this e-mail. > > For the detailed e-mail disclaimer please refer to > > http://www.cput.ac.za/email.php or call *+27 (0)21 460 3911* > > > ------------------------------------------------------------------------------------------------ > > > > > > > > > ------------------------------------------------------------------------------ > > Live Security Virtual Conference > > Exclusive live event will cover all the ways today's security and > > threat landscape has changed and how IT managers can respond. Discussions > > will include endpoint security, mobile security and the latest in malware > > threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ > > > > > > > > _______________________________________________ > > DSpace-tech mailing list > > [email protected] > > https://lists.sourceforge.net/lists/listinfo/dspace-tech > > > ------------------------------------------------------------------------------------------------ > Disclaimer > This e-mail transmission contains confidential information, > which is the property of the sender. > The information in this e-mail or attachments thereto is > intended for the attention and use only of the addressee. > Should you have received this email in error, please delete > and destroy it and any attachments thereto immediately. > Under no circumstances will the Cape Peninsula University of Technology > or the sender > of this email be liable to any party for any direct, indirect, > special or other consequential damages for any use of this e-mail. > For the detailed e-mail disclaimer please refer to > http://www.cput.ac.za/email.php or call *+27 (0)21 460 3911* > ------------------------------------------------------------------------------------------------ > ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

