Re: [Dspace-tech] Producing mets.xml for SWORD
Mika, You may also want to try out the METS Java API (or other tools) available at http://www.loc.gov/standards/mets/mets-tools.html to see what suits you best. Scott. Date: Fri, 5 Jun 2009 14:37:33 -0400 From: Larry Stone l...@mit.edu Subject: Re: [Dspace-tech] Producing mets.xml for SWORD To: mikan.d.dspace listmail mikan.dsp...@gmail.com Cc: Dspace Tech dspace-tech@lists.sourceforge.net Message-ID: f22e2167-c9ec-4299-a7e6-2297effb0...@mit.edu Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes The Harvard METS toolkit gives you an convenient (though DOM-like) API for the METS data model, takes care of some of the housekeeping, and it has logic to prevent schema mistakes: http://hul.harvard.edu/mets/ There's also a toolkit that helps you build a whole submission package, including the METS. It ought to work with SWORD too; see: http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit (If you develop any tricks or procedures for using it with SWORD, please add them to that wiki page!) -- Larry On Jun 5, 2009, at 5:25 AM, mikan.d.dspace listmail wrote: Im hoping to use SWORD in order to deposit items to DSpace. What would be the best way to produce mets-metadata for the items to be submitted? Should I write the entire metadata file myself or is there templates / frameworks available for this? Thanks, Mika -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] PDF Thumbnails, installed, but caused a lot of, trouble!
Ruzbeh, Assuming the table names haven't changed from 1.4.x and there is no bitstream management in 1.5.x, you could probably do something like: update bitstream set deleted=true where name like '%.pdf.jpg'; and then run $DSPACE/bin/cleanup to get rid of the thumb files in the assetstore You might also need to also do: delete from bundle2bitstream where bitstream_id in (select bitstream_id from bitstream where name like '%.pdf.jpg'); as I'm not sure whether the cleanup job will do this or whether this is done when a bitstream is manually deleted. Alternatively you could just write a quick java program to delete these using the API which could be safer than relying on my memory! Scott. -- Message: 1 Date: Sun, 5 Apr 2009 15:51:25 -0700 (PDT) From: Ruzbeh ruzbe...@yahoo.com Subject: [Dspace-tech] PDF Thumbnails, installed, but caused a lot of trouble! To: DSpace-tech@lists.sourceforge.net Message-ID: 409153.6872...@web38808.mail.mud.yahoo.com Content-Type: text/plain; charset=us-ascii Hi everybody I installed the mediafilter available athttp://sts.anu.edu.au/drs/downloads/index.php on 1.5.0. But during the bin/filter-media operations, handling a few items threw exceptions. The exception was caused because ImageMagick Could not convert some pdfs with special sets of fonts into jpeg! It seemed non-vital at first, but now, when I enable the browse list to show thumbnails, the jsp page throws an IOException with Format Not Supported message, while performing the method of InputStream.read() which causes a jspexception and that causes and the page remains only half-rendered. 1) I would appreciate it if somebody helped me with this issue. I tried to manually, force the filter-media to handle those bad handled items, but I didn't succeed! And then I tried to pin down every bad handled item with filter-media, in which I failed too! 2)I tried to manually delete those bitstreams generated by the filtermedia, but there are at least 400 pdf items in my database!!! Any help or suggestion regarding the removal of some bitstreams from items? any procedure who checks all the bitstreams could prove handy, because all the newly generated bitstreams end up in .pdf.jpg. Roozbeh Ebrahimi, BSc of pure Mathematics at Sharif University of Tech. -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] citations, journals, volumes, issues, , articles and dublin core
Hi Mark, In a DSpace context I think a major problem is lack of agreement on, or default implementation for a storage model for various classes of content (typically aggregations). If this could be achieved then value-add services could be far more easily connected - dissemination services such as Manakin (flexible themes, theme re-use, them choice, continuous improvement of targetted themes)and greater interoperability in a broader context. The technology part is solved. The standards and the tools are (mostly) there. The consistency isn't. For example, in a DSpace context there are three journal models I know of: the one we developed under APSR; TDL's journal display; and FORESITE as well as probably variants on just using DC for container relations you mention below, and who knows what Fedora or other platforms do. There are probably others. I personally don't think much progress will be made in the repository value-add service space (i.e. anything other than storage and file-level access) without content and interchange models. I'm not saying there is just one (right) way to package or store journals, but if there are more than two or three I think we just get stuck in a set of unsustainable proprietary implementations which require an equal number of unsustainable proprietary solutions in order to migrate or exchange content *in a meaningful form* over time. Fedora appear to be looking at this space in its new Content Model Architecture, but from a cursory look they are again just providing tools to achieve (a large set of unsustainable?) implementations rather than work on reference implementations for more complex object classes such as journals or other aggregations. Given the joint nature of DSpace and Fedora development and the new OAI-ORE space with its aggregations and re-use and exchange scope the technologies are available to address this but it still needs some content/metadata modelling to provide consistency for value-add service providers. Some of the work we have recently been doing involves developing METS Profiles for particular content classes which allows at least consistency in the packaging of materials for deposit to a repository (or some target process). We expect this work to continue over the coming years as we have been funded to being development of an Australian Data Commons, and this is unachievable without addressing the kind of problem you describe in you mail below. It's not a fast process. Scott. Message: 3 Date: Tue, 24 Feb 2009 09:34:20 -0500 From: Mark H. Wood mw...@iupui.edu Subject: Re: [Dspace-tech] citations, journals, volumes, issues, articlesand dublin core To: dspace-tech@lists.sourceforge.net Message-ID: 20090224143420.ga21...@iupui.edu Content-Type: text/plain; charset=us-ascii May I suggest that we should never, never, never! get used to being shocked and surprised by some of the aspects of digital libraries, but rather to the warm feeling of having done something about them. It seems to me that metadata support for journal articles, while fundamental, is the least of our worries in this area. People have been cataloging journal articles for a long time, and it shouldn't be that hard to map established practice to a mechanical representation. More urgently, I feel, we need to think a bit about how we ought to go forward. I see two roads: o adapt repository software (such as DSpace) to permit the behaviors needed in representing paper journals online o develop the most appropriate ways of organizing journal articles for online presentation, and support those ways People are used to seeing papaer journals as *ordered* collections of articles (buttered in-between with features and announcements and similar matter) as issues within *ordered* collections of issues as volumes. To go this route, we need to generalize a bit so that we can create new subclasses of containment in addition to communities and collections and items. Order hasn't received enough attention in the design of DSpace. But is that really the most useful way of organizing journal articles? It's important to keep the association with the journal brand, and it's valuable to preserve the binding to time, but I don't think that people read volumes or journals; even if they habitually read from cover to cover, what they read is articles and features and whatnot. Is there really a need for volume and issue to be structural elements at all? If we simply tag items with these attributes as metadata, and provide top-quality tools for searching on these attributes (as well as others), won't that serve? If we have a submission process that can deal efficiently with repetitive metadata when submitting items which are related, won't that be sufficient? How do readers, and catalogers, *want* to do their work? How close can we reasonably get? We have a new medium to work with, and we should be
Re: [Dspace-tech] How to order the articles within an issue
Hi Andrew, Your suggested approach using collections for issues and (sub-)community for journals is the approach we took. Unfortunately the hosted journal that intended to go into production didn't eventuate so it isn't implemented in our production repository but there is a screencast (http://www.apsr.edu.au/ore/) and a more recent presentation (http://www.apsr.edu.au/open_access_publishing/yeadon%20ojs-sword.pdf) you can look at which helps illustrate the approach we took. For ordering the items within an issue an OAI-ORE aggregation is generated by a DSpace journal METS ingestion module. We've done a little more work on this late last year, but the above links give you an idea of the process. If you want further info, feel free to contact me off list, as I don't think we've posted the lastest source anywhere. The downside to all these different approaches for journal storage in DSpace is that Manakin themes can't be shared, but I think to achieve theme sharing beyond simple objects needs some co-ordinated DSpace storage modelling of compound objects and community agreement which is no small task. Good luck! Scott. -- Message: 3 Date: Thu, 22 Jan 2009 15:35:18 -0600 From: Scott Phillips scott.a.phill...@gmail.com Subject: Re: [Dspace-tech] How to order the articles within an issue To: Diggory Mark mdigg...@gmail.com Cc: dspace-tech@lists.sourceforge.net dspace-tech@lists.sourceforge.net Message-ID: 6254f291-71aa-451d-864a-9d18a2505...@gmail.com Content-Type: text/plain; charset=us-ascii You may want to look at a Journal collection we've done. It doesn't completely address the problem you're asking about but is in the same sphere, and i second what Mark said, it is likely best to not use communities and collections in that manor. http://repositories.tdl.org/tdl/handle/2249.1/5065 Here is the theme used to generate that view is located here: http://repositories.tdl.org/tdl/themes/TDL/Periodicals/sitemap.xmap http://repositories.tdl.org/tdl/themes/TDL/Periodicals/Periodicals.xsl The trick we used is to add xml describing journal issues in the metadata of the collection, then the theme knows to interpret this small bit of data to build a browsable list of journal issues. Each of those provide links to a masked search page for individual articles, plus a list to an item that contains the full pdf for the journal. Here is the current bit of XML on the collection: issues xmlns:tdl=http://www.tdl.org/NS/tdl; xmlns=http://www.tdl.org/NS/tdl issue vol=66 num=3amp;4 year=2008 name= handle=2249.1/5643/ issue vol=66 num=1amp;2 year=2008 name= handle=2249.1/5546/ issue vol=65 num=3amp;4 year=2007 name= handle=2249.1/5496/ issue vol=65 num=1amp;2 year=2007 name= handle=2249.1/5438/ issue vol=64 num=3amp;4 year=2006 name= handle=2249.1/5439/ issue vol=64 num=1amp;2 year=2006 name= handle=2249.1/5440/ issue vol=63 num=3amp;4 year=2005 name= handle=2249.1/5441/ issue vol=63 num=1amp;2 year=2005 name= handle=2249.1/5442/ issue vol=62 num=3amp;4 year=2004 name= handle=2249.1/5443/ issue vol=62 num=1amp;2 year=2004 name= handle=2249.1/5444/ issue vol=61 num=3amp;4 year=2003 name= handle=2249.1/5445/ issue vol=61 num=1amp;2 year=2003 name= handle=2249.1/5446/ issue vol=60 num=3amp;4 year=2002 name= handle=2249.1/5447/ issue vol=60 num=1amp;2 year=2002 name= handle=2249.1/5448/ /issues Scott-- On Jan 22, 2009, at 2:08 PM, Diggory Mark wrote: Andrew, I might recommend avoiding using Collections for Issues, mostly because then your going to end up with lots of the them, and that tends to make the interface both difficult to navigate and maintain. Another possibility to consider, instead, add the issue detail either to the Item directly, or possibly create separate Items to hold the issue metadata (relating them to one another in dc.relation.isPartOf/hasPart fields). Thus: Society = Community Journal = Collection Issue and its Errata: Held in Issue Item Article and its Errata: Held in Article Item You might then customize your presentation and search to Group Articles under Issues. -Mark On Jan 22, 2009, at 1:41 AM, Andrew Marlow wrote: I am trying to set up a DSpace that models societies, journals, issues and articles. Each article is an item. The collection that contains the items is an issue. The journal that publishes the issue(s) is a sub-community, and the society that publishes the journal(s) is a community. My question is, how do I make it so that the collection orders the items the way I want? An issue has a particular order for the articles because it relates to the physical printed version. That's the order I want for the collection also. I think collections are ordered by publish date (please correct me if I'm wrong about that). That's not good enough for me since an issue has a
Re: [Dspace-tech] DSpace-tech Digest, Vol 28, Issue 57
Ameer, It's old, but you can find a mediafilter at http://sts.anu.edu.au/drs/downloads/index.php, requires ImageMagick Scott. Message: 3 Date: Mon, 25 Aug 2008 14:27:55 -0400 From: d space [EMAIL PROTECTED] Subject: [Dspace-tech] How to view preview/thumbnails of pdf documents To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=iso-8859-1 HI All, I have Dspace 1.4.2 installed on my server and I am able to view thumbnail images of JPEG files but would like to know if there is any way of presenting thumbnail/preview images of PDF documents. Does anyone have any idea of how to enable this? Best Regards, Ameer - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] APSR software
Hi All, With APSR project winding down some proof-of-concept work has been posted on the APSR website for anyone who wants to take a look. This work is in the repository interoperability space. Rather than repeat the web site blurb refer to the follow web pages: Australian METS Profile: http://www.apsr.edu.au/nla-mets/index.htm Repository Submission Service: http://www.apsr.edu.au/submission_service/index.htm Manakin Theme Editor (also comes bundled with Submission Service): http://www.apsr.edu.au/dissemination_service/index.htm Note that if experimenting with the Manakin theme editor I'd suggest pointing it at an image collection, as the conference and journal collections have specific requirements as to how these materials are stored. Some of the code from this will be incorporated in Fez 2.0 and is applicable to any repository which can generate DRI-compliant XML. While we can't commit to further development of the latter two software packages (all was developed under funded project monies), we will be attempting to build on work delivered under APSR in future projects. I'm happy to take any queries off-list regarding these packages and further development of them. Scott. - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] importing a collection of files with identical names (from different directories)
Rowan Brownlee wrote: That's very useful information. Thanks Scott. The metadata that I'm importing from a spreadsheet contains latitude longitutude information which I'm using to generate kml files (for import to dspace along with the other data files). I'm not familiar with manakin but I'm assuming that at some stage we might use it to provide a customised user interface to the collection that would include display of google-map-like features. Yes, Leo does this for bidwern but uses Manakin (cocoon) to generate the kml on the fly rather than archive it. From what you say, it sounds like it could be useful to include path information in the metadata. Do you have any thoughts on best options for doing this? E.g. would I add a repeating custom metadata text field containing an entry for each file? You could do that, it depends what the metadata represents as to how you map it e.g. does it represent subject, discipline, etc or is it fairly arbitrary. Also - I'm not familiar with how dspace arranges file storage. Would the file path metadata entry be a replication of the entry as recorded in the contents file? The path used in the contents file would be stored as metadata with each bitstream. Each bitstream however is renamed by DSpace and stored in the DSpace assetstore, so the file path, once in DSpace only exists as metadata. Bye rowan - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] stand-alone Handle Server
Hi John, This won't help much now, but one of the things I want to look at next year is using DSpace with the PILIN software (http://www.arrow.edu.au/PILIN). PILIN software is essentially a wrapper around the handle server code and can supposedly be run as a local handle server for minting, updating and removing handles. The work Jim R is doing on the identifier code looks like it will be of great help to this. Scott. Date: Thu, 23 Aug 2007 14:54:55 -0400 From: John Davison [EMAIL PROTECTED] Subject: [Dspace-tech] stand-alone Handle Server To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=us-ascii Hello All, I'm investigating the use of a stand-alone remote Handle Server instead of the one bundled with DSpace. Robert Tansley and the China Digital Museum Project released a patch on SourceForge (Enable Use Of Remote Handle Server: https://sourceforge.net/tracker/index.php?func=detailaid=1272731group_id=1 9984atid=319984) but the file was last updated on 9-28-2005. Its summary lists several outstanding issues, including batch handle generation for bulk import and suffix sequencing. I'm curious if this is the currently accepted solution, or if there are other, more recent options available. Thanks in advance, John Davison Ohio Library Information Network - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Item Display as per the Order of Ingestion
Hi Jayan, You'd need to make modifications to handle the ordering somehow through metadata (e.g. ensuring they're in order by adding 01, 02 etc to the titles) or some db table to handle ordering and couple that with a custom jsp tag. It all gets a bit messy though and we've found this is better done through Manakin. In a couple of weeks we're deploying some software packages which among other things include a DSpace package ingester, Manakin theme and OJS-to-METS stylesheets for submitting and handling OJS journals in DSpace. After some production testing we'll be making this available via Sourceforge as part of APSR. At this stage we're looking at sometime late Sept. In this instance the journal ordering is maintained via a manifest object (RDF file). Scott. Date: Fri, 3 Aug 2007 14:42:52 +0800 From: Jayan Chirayath Kurian [EMAIL PROTECTED] Subject: [Dspace-tech] Item Display as per the Order of Ingestion To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=us-ascii Hi! In a Journal we have sequential list of items. Is there a way to display items in a DSpace collection according to the sequential order in which they appear in the journal? By default dspace items are displayed alphabetically taking into consideration the title metadata. Please suggest. Thanks, Jayan -- next part -- An HTML attachment was scrubbed... -- - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech End of DSpace-tech Digest, Vol 16, Issue 7 ** - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] RAMP project beta software
Hi All, FYI - a beta release of the mura software developed under the RAMP project has been made available at http://drama.ramp.org.au/cgi-bin/trac.cgi. While the GUI component is Fedora related, the authors suggest that the other modules are applicable to repositories in general. However they stipulate that this is only true where the authorisation and authentication mechanisms are not tightly coupled withint the repository core, which is not currently true in DSpace but presumably where we want to end up. Anyone looking into the auth* management for DSpace at the moment may want to have a look (it's on my todo list, but that's a long list!). Scott. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [Dspace-devel] Implicit authentication methods
Hi Mark, I got around it for now just be re-processing the request and seems to work fine (see http://sourceforge.net/mailarchive/message.php?msg_name=46786482.1020808%40anu.edu.au for details), maybe that's a quick fix that may help. I'm not sure that's a real fix though. The throwing of the exception from the Bitstream class wrestles away control of the request flow so I guess that's where it should properly be fixed up. I haven't really had time to do further investigation as yet. I suspect this was coded before the boolean test for authorisation was available or for some other reason such as automated login not being part of the initial implementation. Scott. Mark Diggory wrote: Scott, We observe this bug in our production systems and need to find the fix as well. We have an implicit MIT Certificate Authentication thats failing to work directly against bitstreams because of it. -Mark On Jun 18, 2007, at 2:20 AM, Scott Yeadon wrote: OK, it's not null, the authentication succeeds, but since the code drops down into the AuthorizationException handling of the DSpaceServlet.java, there is no redirection (implicit authentication) so it bombs out. Is this a bug? Scott. Scott Yeadon wrote: Hi All, I have a stackable authentication class which obtains credentials from a request. When I access a protected bitstream, the auth class works (credentials are parsed and all ok) however the context user is still null although setCurrentUser() is called before returning the SUCCESS status. The flow is: - attempt to access protected bitstream - AuhorizeAction happens, fails, throws AuthorizeException - Authenticate.startAuthentication occurs and my class in invoked with success - After return from startAuthentication the currentUser is still null (even though explicitly being set in the authenticate method), so the request fails with access denied to user 0 (default value when getCurrentUser() is null). As my class is implicit authentication there is no redirection either. I don't understand why this is null, the log message shows: 2007-06-18 14:31:47,032 INFO org.dspace.app.webui.servlet.DSpaceServlet @ myuser:session_id=AB51545186B04E419B63AD9FF140C7BF:ip_addr=150.203.2.97:authorize_error:org.dspace.authorize.AuthorizeException: Authorization denied for action READ on BITSTREAM:32978 by user 0 showing that the myuser user is the current user (at least in the logHeader!) but not in the context object. Anyone done this before? My code is almost identical to the X509 auth, the only real difference being where the credentials are taken from. There are no problems with the credentials or request itself, just something I'm missing with the auth flow I suspect. Scott. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Dspace-devel mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/dspace-devel ~ Mark R. Diggory - DSpace Systems Manager MIT Libraries, Systems and Technology Services Massachusetts Institute of Technology Office: E25-131 Phone: (617) 253-1096 - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Implicit authentication methods
Hi All, I have a stackable authentication class which obtains credentials from a request. When I access a protected bitstream, the auth class works (credentials are parsed and all ok) however the context user is still null although setCurrentUser() is called before returning the SUCCESS status. The flow is: - attempt to access protected bitstream - AuhorizeAction happens, fails, throws AuthorizeException - Authenticate.startAuthentication occurs and my class in invoked with success - After return from startAuthentication the currentUser is still null (even though explicitly being set in the authenticate method), so the request fails with access denied to user 0 (default value when getCurrentUser() is null). As my class is implicit authentication there is no redirection either. I don't understand why this is null, the log message shows: 2007-06-18 14:31:47,032 INFO org.dspace.app.webui.servlet.DSpaceServlet @ myuser:session_id=AB51545186B04E419B63AD9FF140C7BF:ip_addr=150.203.2.97:authorize_error:org.dspace.authorize.AuthorizeException: Authorization denied for action READ on BITSTREAM:32978 by user 0 showing that the myuser user is the current user (at least in the logHeader!) but not in the context object. Anyone done this before? My code is almost identical to the X509 auth, the only real difference being where the credentials are taken from. There are no problems with the credentials or request itself, just something I'm missing with the auth flow I suspect. Scott. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] [Dspace-general] DSpace `Dublin Core' | Date Issued | Date Range | How to represent
Hi Richard, Hello Scott, Thanks for your note. On Tue, 2007-06-12 at 12:06, Scott Yeadon wrote: Hi Richard, It's up to you how you represent your values, you could use the DCMI Period or something simple such as 1930-1940. We tend to have the latter since that's what our users typically enter. The batch import process won't parse the values, as long as the document is valid XML the values will be accepted. Granted that using something such as: dcvalue element=date qualifier=issued1964-1970/dcvalue in one's `dublin_core.xml' file seems practical and expedient, on my system at least -- DSpace-1.4.0 -- such an approach breaks DSpace's `Browse by Title', `Browse by Date', and the offending item's `Brief View'. This is the reason I first asked the lists for details of how one should _correctly_ represent a date range in the DSpace `dublin_core.xml' file. Using `1964-1970' and so on simply does not seem to work. It's likely that this is because the default metadata display is not able to render date ranges properly. In your DSpace config file put the following entry: webui.itemdisplay.default = dc.title, dc.title.alternative, dc.contributor.*, dc.subject, dc.date.issued, dc.publisher, dc.identifier.citation, dc.relation.ispartofseries, dc.description.abstract, dc.description, dc.identifier.govdoc, dc.identifier.uri(link), dc.identifier.isbn, dc.identifier.issn, dc.identifier.ismn, dc.identifier The date.issued field is by default formatted to a date (see ItemTag.java for the hardcoded list) using the dc.date.issued(date) field display text. Removing the (date) part of this will stop any special rendering taking place. Also, setting: webui.itemlist.columns = dc.date.issued, dc.title, dc.contributor.* in the dspace.cfg file may also resolve your ranges not showing up in the browse page (the default specifies dc.date.issued(date)), so as above removing the rendering rules should fix this) There should also be a section in the docs covering the above config items you could have a look at, since you may also want to customise your simple view and browse columns. I have put together a series of screenshots to indicate the issues: http://indica-et-buddhica.org/sections/repositorium-preview/known-issues/dspace-item-date-ranges As you will see, I am - unhappily - coming to the conclusion that DSpace does not support item date ranges at all. It is also becoming clear that the lack of genuine validation by the item importer can easily lead to the widespread corruption of ones metadata. I hope I am wrong as these would be serious deficiencies. I imagine the reason validation is not included is that DSpace in general doesn't set down value format rules, since every institution is likely to have differing requirements. Having said that, there is no easy plugin to set your own rules, you would have to modify the import process to support this if you needed it, or pre-process your material prior to loading to apply any rules you need enforced. Hope that helps. Scott. Best regards, Richard Mahoney Scott. Message: 3 Date: Fri, 08 Jun 2007 12:08:16 +1200 From: Richard MAHONEY [EMAIL PROTECTED] Subject: [Dspace-general] DSpace `Dublin Core' | Date Issued | Date Range | How to represent To: DSpace Tech dspace-tech@lists.sourceforge.net,DSpace General [EMAIL PROTECTED] Message-ID: [EMAIL PROTECTED] Content-Type: text/plain Dear List Members, I am in the process or preparing material for bulk import and have again encountered and issue that I was inclined to gloss over last time it arose: the format of the DSpace Dublin Core Date Elements, Qualifiers, and particularly, the Values. What exactly is the required Value format and is it configurable? Simple date Values such as the following present no difficulty: dcvalue element=date qualifier=issued1970/dcvalue The trouble for me -- and this situation would arise often for many projects -- is how to correctly represent date ranges, for e.g., date issued, 1964 to 1970. Which Value format should should be used to represent a date range in DSpace DC? Some DSpace version of the W3C-DTF/ISO 8601 scheme? http://dublincore.org/documents/2000/07/28/dcmi-period/ Best regards, Richard Mahoney - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] chinese search
You need to set your search.analyzer = org.apache.lucene.analysis.cn.ChineseAnalyzer in the dspace.cfg file and run index-all. Note that we use org.apache.lucene.analysis.standard.StandardAnalyzer which seems to handle a mix of Chinese and English quite well. Scott. Date: Tue, 17 Apr 2007 16:21:31 +0800 (CST) From: =?GBK?B?zMbV1+f5?= [EMAIL PROTECTED] Subject: [Dspace-tech] chinese search To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=gbk hi, My DSpace version is 1.4.1, there is a search configuration in the file dsapce.cfg, I let the chinese Analyzer available ,but it doesn't work, who knows how to make chinese search available? [EMAIL PROTECTED] thank you! Shanghai JiaoTong University Library Evan -- next part -- An HTML attachment was scrubbed... -- - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] dspace repositories: exposing MPEG-21 DIDL metadata using OAI-PMH?
Hi Alan, Configure your oaicat.properties with: Crosswalks.didl=org.dspace.app.oai.DIDLCrosswalk Uncomment the |oai.didl.maxresponse| item in |dspace.cfg | That's all you need to do (plus restart Tomcat) to expose DIDL packages, using metadataPrefix=didl Verify the Crosswalk is activated by accessing a URL such as |http://mydspace/dspace-oai/request?verb=ListRecordsmetadataPrefix=didl | There is a section in the Configuration section of the 1.4.1 document that covers this (under the Activating Additional OAI-PMH Crosswalks section cid:part1.03080902.06090706@anu.edu.au) that you should also take a look at. Scott. Date: Mon, 19 Mar 2007 17:26:16 -0700 From: Cornish, Alan K [EMAIL PROTECTED] Subject: [Dspace-tech] dspace repositories: exposing MPEG-21 DIDL metadatausing OAI-PMH? To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=us-ascii My site recently upgraded its DSpace instances to 1.4.1. The DSpace 1.4 release info (URL http://mailman.mit.edu/pipermail/dspace-general/2006-July/001064.html) indicates that it's possible to expose METS and MPEG-21 DIDL DIPs via OAI-PMH. At my site, we have been testing both mets and didl. Retrieving mets records using OAI-PMH verbs has been working well. However, I've been unable to retrieve records using didl. I'm interested in communicating with staff or techs at a DSpace site that has successfully exposed DSpace didl metadata using OAI-PMH. What steps were needed to expose records in this extended format? Any info would be appreciated, thanks! Al Cornish Head, Library Systems Washington State University Libraries Pullman, WA 99164-5610 509.335.1895 (voice) [EMAIL PROTECTED] (e-mail) -- next part -- An HTML attachment was scrubbed... -- Message: 4 Date: Tue, 20 Mar 2007 13:08:12 +0100 From: [EMAIL PROTECTED] Subject: [Dspace-tech] Cataloguing web resources To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=ISO-8859-1; format=flowed Hallo, I would like to know if it is possible with DSpace to catalogue web resources (links to web pages or other digital resources). Essentially I need to catalogue URLs without having to upload files. I know that an instrument dedicated to this purpose is Scout Portal Toolkit (http://scout.wisc.edu/Projects/SPT/), but I'm interested also in classifying other documents too. Anyone knows how or if it's possible with DSpace? Thanks, Mauro -- Message: 5 Date: Tue, 20 Mar 2007 12:41:54 + From: orlando carvalho [EMAIL PROTECTED] Subject: [Dspace-tech] using SRW client on Dspace To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=iso-8859-1 Hi there! We put an SRW client working properly! ## import java.net.*; import org.apache.axis.*; import gov.loc.www.zing.srw.SearchRetrieveRequestType; import gov.loc.www.zing.srw.SearchRetrieveResponseType; import gov.loc.www.zing.srw.interfaces.SRWPort; import gov.loc.www.zing.srw.srw_sample_service.SRWSampleServiceLocator; /* * Created on 8/Mar/2007 * * To change the template for this generated file go to * Windowgt;Preferencesgt;Javagt;Code Generationgt;Code and Comments */ //response.getEchoedSearchRetrieveRequest().getQuery() /** * @author dspace * * To change the template for this generated type comment go to * Windowgt;Preferencesgt;Javagt;Code Generationgt;Code and Comments */ public class cliente_java { public static void main(String[] args) { try{ SRWSampleServiceLocator service = new SRWSampleServiceLocator(); URL url = new URL(http://localhost:8080/SRW/search/DSpace;); SRWPort port = service.getSRW(url); SearchRetrieveRequestType request = new SearchRetrieveRequestType(); request.setQuery(computer); SearchRetrieveResponseType response = port.searchRetrieveOperation(request); //Object i = response.getResultSetId(); System.out.println(Resultados:+ response.getEchoedSearchRetrieveRequest().getQuery()); } catch (Exception e){ e.printStackTrace(); } } } ## It returns: Results:1(this is the number of records finded) Results:name (this is the author or the title of the item) Our question is, how can we change this code to the answer to our request returns the full description of the items stored on Dspace? Thanks for the help. - Take Surveys. Earn Cash. Influence the Future of IT Join
Re: [Dspace-tech] DSpace-tech Digest, Vol 10, Issue 45
Hamish, Check out http://www.mail-archive.com/dspace-tech@lists.sourceforge.net/msg00182.html, it's supposed to be an optional dependency but isn't mentioned in the PDFBox docs. Scott. Date: Mon, 19 Feb 2007 15:51:35 +1030 From: Brett, Hamish [EMAIL PROTECTED] Subject: [Dspace-tech] Filter-media error To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=us-ascii Hi, Ever since upgrading to 1.4.1 when I run filter-media the following error ERROR filtering, skipping bitstream #1584 java.io.IOException: Invalid header signature; read 290763650945099227, expected -2226271756974174256 java.io.IOException: Invalid header signature; read 290763650945099227, expected -2226271756974174256 at org.apache.poi.poifs.storage.HeaderBlockReader.init(HeaderBlockReader. java:88) at org.apache.poi.poifs.filesystem.POIFSFileSystem.init(POIFSFileSystem.j ava:83) at org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.j ava:48) at org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.ja va:97) at org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java :155) at org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte rManager.java:327) at org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana ger.java:296) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt erManager.java:266) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media FilterManager.java:234) at org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja va:185) SKIPPED: bitstream 997 because '2000 J Thermal Stress (One-sided repair).PDF.txt' already exists SKIPPED: bitstream 2833 because 'iRoom at DSTO.pdf.txt' already exists SKIPPED: bitstream 2835 because 'Network Enabled Warfare4.pdf.txt' already exists SKIPPED: bitstream 2837 because 'DORC99-Lin-Zhang.PDF.txt' already exists SKIPPED: bitstream 2839 because 'icota98.pdf.txt' already exists Exception in thread main java.lang.NoClassDefFoundError: org/bouncycastle/jce/provider/BouncyCastleProvider at org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:905) at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204) at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java :110) at org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java :155) at org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte rManager.java:327) at org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana ger.java:296) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt erManager.java:266) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media FilterManager.java:234) at org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja va:185) Any ideas? Thanks Hamish -- next part -- An HTML attachment was scrubbed... -- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] DSpace not indexing MS Powerpoint files?
Pan, You'll need to write your own media filter class to handle the extraction of text from PowerPoint files as ppt text extraction isn't currently supported by the default set of media filters. Hopefully someone may have already done this and will share, but if not you'll have to write your own using OpenOffice or some other means. Scott. Date: Wed, 31 Jan 2007 15:07:21 -0800 From: Pan Family [EMAIL PROTECTED] Subject: [Dspace-tech] DSpace not indexing MS Powerpoint files? To: dspace-tech@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=iso-8859-1 Hi, I submitted a MS ppt file to my collection, but filter-media does not want to index this ppt file. I tried to shut down the database (PostgreSQL) and restarted it, and ran filter-media several times, but it did not help. I made sure that this ppt file is indeed in the collection by openning it using View/Open. I have no problem indexing MS Word, text, html, or pdf files. Do I need to do anything special for ppt files? Thanks a lot! -Pan - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech