Re: [sword-app-tech] Documentation on packaging formats
Hi Mark, If a publisher or other service provider asks Does your repository support SWORD? I'd like to be able to tell them, Yes, in these formats that we all understand, instead of having to explain to the publisher how I want the content packaged up, or take the content in whatever format the publisher provides and figure out how to get it into the target repository. The package formats supported/accepted by the Server are listed in the Service Document provided by the server when a client first connects to it. If the client sends an unsupported package type, the server should reply with an error message stating that the that package type is not supported. Deciding which packages to support and creating packages that comply with the standards is the real problem I think. For example, I think the only way to reliably create DSpace METS SIP files is via the export function in DSpace. As for BagIt, I used the Bagger utility provided by the Library of Congress: http://sourceforge.net/projects/loc-xferutils/files/loc-bagger/ and built my DSpace ingester according to the BagIt specifications. I hope this helps! Marco - Original Message - Hi Mark, I have been using SWORDv2 with DSpace. SWORD is just a transfer protocol, it doesn't really matter what type of package you send with it, as long as the receiving SWORD server understands how to handle it. I used DSpace METS SIP, simple zip files, and binary files because the DSpace SWORDv2 server implementation supported those packages. Then, we wanted to try BagIt, so I wrote a so called ingester to let the SWORDv2 DSpace server handle BagIt packages. And since I was at it, I also made one for DataBankBagIt packages, which are not in the SWORD documentation (and also have a different namespace, http://dataflow.ox.ac.uk/package/DataBankBagIt). You can define your own package format if you want to. You can read about our work on SWORD on the blog of our project, Sustainable Management of Digital Music Research Data: http://rdm.c4dm.eecs.qmul.ac.uk/sword-tools http://rdm.c4dm.eecs.qmul.ac.uk/datastage-and-dspace Good luck! Best regards Marco -- Marco Fabiani Postdoctoral Research Assistant Centre for Digital Music School of Electronic Engineering and Computer Science Queen Mary, University of London Mile End Road, London E1 4NS, UK On 23 May 2012, at 21:40, Mark Jordan wrote: Hi, Sorry for this second n00b question to the list in less than a few weeks. Is there any public documentation on SWORD2 packaging formats? The profile uses the DSpace METS SIP and BagIt as examples. I assume that the DSpace packaging format is the one described at https://wiki.duraspace.org/display/DSPACE/DSpaceMETSSIPProfile, but is this actually the case? Does a BagIt profile actually exist or is it just used as an example? Our most immediate use case is that we am exploring using SWORD2 to move theses from our thesis management system to our Drupal-based IR. There is a SWORD1 server module for Drupal but not a SWORD2 server. I'd like to make the SWORD2 server as generic as possible in terms of deposit but am unclear on what the common packaging formats are and how they are documented. Thanks, Mark -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ sword-app-tech mailing list sword-app-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sword-app-tech -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ sword-app-tech mailing list sword-app-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sword-app-tech
Re: [sword-app-tech] SWORD 2 and DSpace
Hi Richard, I having been testing the latest version of the Python libraries from your fork on bitbucket. I added 5 issues to the project, and indicated the causes. I did small changes on my local copy and everything works now, but I am not sure if my solutions are correct or just workarounds, so I prefer to leave it to you. In particular, I don't know if errors in parsing the service document and the deposit receipts are caused by the DSpace Sword2 server sending incorrect information, or by the python client. Cheers Marco On 29 Mar 2012, at 16:26, Marco Fabiani wrote: Hi Richard, I patched this one: https://bitbucket.org/beno/python-sword2 . I will test my code with your version as well, and apply my changes (if needed) to see if everything works. I will let you know. Cheers Marco On 29 Mar 2012, at 16:20, Richard Jones wrote: Hi Marco, Which version of the python-sword2 library have you patched? I have done a large iteration on it recently (not yet formally released, but soon) at: https://bitbucket.org/richardjones/python-sword2 But if your patch is for that one, or still applicable, I'd be happy to have it. You could post it to the bitbucket issue tracker for the project. Cheers, Richard On 29 March 2012 16:14, Marco Fabiani marco.fabi...@eecs.qmul.ac.uk wrote: Hi Richard, I created an issue on JIRA as Stuart suggested (https://jira.duraspace.org/browse/DS-1149). I have never used JIRA before, so I'm not quite sure how to submit a patch, but I will give it a try. On a similar subject, I also had to slightly change the python-sword2 module to make it work with edit-media. Should I submit these changes as well? Cheers Marco On 29 Mar 2012, at 16:09, Richard Jones wrote: That's brilliant, thanks for picking that up. I will apologise in advance that I probably won't do anything about this until after Easter, but it is on my list ... Cheers, Richard On 29 March 2012 15:39, LEWIS Stuart stuart.le...@ed.ac.uk wrote: Hi Marco, Thanks - submitting a patch to DSpace via JIRA would be great! - https://jira.duraspace.org/browse/DS Many thanks, Stuart -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. -Original Message- From: Marco Fabiani [mailto:marco.fabi...@eecs.qmul.ac.uk] Sent: 29 March 2012 15:37 To: Richard Jones Cc: LEWIS Stuart; sword-app-tech@lists.sourceforge.net Subject: Re: [sword-app-tech] SWORD 2 and DSpace Hi Richard and Stuart, I was looking at the BinaryContentIngester code to try to make my own ingester and I found the ORIGINAL bundle duplication bug: Interesting - that looks like a bug with the DSpace implementation (ORIGINAL bundle duplication). I have some time scheduled to work on this implementation over the next month to six weeks, so will look for this and try to put in a fix. Also, I'll look into whether the content type can be put into the bitstream format field. In BinaryContentIngester, line 138: Bundle original = null; is assigned but never used because at lines 148: Bitstream bs = item.createSingleBitstream(deposit.getInputStream()); which creates a new bundle disregarding the original bundle. I this code should solve the problem, and also add the bitstream format field: Bitstream bs = original.createBitstream(deposit.getInputStream()); BitstreamFormat format = this.getFormat(context,deposit.getFilename()); bs.setFormat(format); At least from my short testing, this works. Should I submit this as an official bug to DSpace? Cheers Marco -- Richard Jones, Founder, Cottage Labs t: @richard_d_jones, @cottagelabs w: http://cottagelabs.com -- Richard Jones, Founder, Cottage Labs t: @richard_d_jones, @cottagelabs w: http://cottagelabs.com -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ sword-app-tech mailing list sword-app-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sword-app-tech -- Better than sec? Nothing is better than sec when it comes to monitoring Big Data applications. Try Boundary one-second resolution app monitoring today. Free. http://p.sf.net/sfu/Boundary-dev2dev___ sword-app-tech mailing list sword-app-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sword-app-tech
Re: [sword-app-tech] SWORD 2 and DSpace
Hi Richard and Stuart, I was looking at the BinaryContentIngester code to try to make my own ingester and I found the ORIGINAL bundle duplication bug: Interesting - that looks like a bug with the DSpace implementation (ORIGINAL bundle duplication). I have some time scheduled to work on this implementation over the next month to six weeks, so will look for this and try to put in a fix. Also, I'll look into whether the content type can be put into the bitstream format field. In BinaryContentIngester, line 138: Bundle original = null; is assigned but never used because at lines 148: Bitstream bs = item.createSingleBitstream(deposit.getInputStream()); which creates a new bundle disregarding the original bundle. I this code should solve the problem, and also add the bitstream format field: Bitstream bs = original.createBitstream(deposit.getInputStream()); BitstreamFormat format = this.getFormat(context,deposit.getFilename()); bs.setFormat(format); At least from my short testing, this works. Should I submit this as an official bug to DSpace? Cheers Marco -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ sword-app-tech mailing list sword-app-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/sword-app-tech