Hi Zen, We have done this migration recently, taking a standalone subject-based EPrints repository and importing it into our DSpace instance, where it now lives as one community in a campus-wide IR.
We have some scripts on Github [1], but these were not created with the intention that they would be anything more than a one-off solution to our particular problem, so I'm not sure how much help they would be to you. I suppose it depends on the requirements of your migration project. Our approach was to do what Tim described, essentially following the Extract-Transform-Load pattern. (1) For extraction, for the most part we just pulled data from the Dublin Core representation of the metadata that was exposed by the existing EPrints instance. If I remember correctly there was just one thing we needed to pull from the database directly, and that was easy to integrate by matching on the EPrints ID. (2) Next, we wrote some scripts that performed a custom crosswalk of this Dublin Core metadata to the metadata format required by our DSpace. This was pretty straightforward because DSpace also uses DC metadata, though there were things that needed to be mapped over (specifically the type vocabulary did not match up exactly between the two versions of DC). We also used this opportunity to bring the data into closer alignment with the standards and practices of our IR. One of the challenges for us was that the existing EPrints repository had included many external links (without the original files), and a large number of those links were broken. We ended up updating the links that were permanent redirects but excluding broken links from our import package unless the object also had a copy of the binary. (3) Finally, we had scripts to assemble a package in Simple Archive Format for import to DSpace. If any of this is of interest I'm happy to answer specific questions. Best, Josh Westgard University of Maryland Libraries [1] https://github.com/jwestgard/eprints2dspace On Friday, February 7, 2020 at 4:56:47 AM UTC-5, zen zenitram wrote: > > good day > > Is the way to migrate eprints to dspace and get all the data that stored > in eprints? > > > > Thank you! > -- All messages to this mailing list should adhere to the DuraSpace Code of Conduct: https://duraspace.org/about/policies/code-of-conduct/ --- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/3c4eaef4-f492-40fa-8764-3ec154f81f6b%40googlegroups.com.
