Re: [Dspace-tech] Producing mets.xml for SWORD

2009-06-09 Thread Scott Yeadon
Mika,

You may also want to try out the METS Java API (or other tools) available at  
http://www.loc.gov/standards/mets/mets-tools.html to see what suits you best.

Scott.


Date: Fri, 5 Jun 2009 14:37:33 -0400
From: Larry Stone l...@mit.edu
Subject: Re: [Dspace-tech] Producing mets.xml for SWORD
To: mikan.d.dspace listmail mikan.dsp...@gmail.com
Cc: Dspace Tech dspace-tech@lists.sourceforge.net
Message-ID: f22e2167-c9ec-4299-a7e6-2297effb0...@mit.edu
Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes

The Harvard METS toolkit gives you an convenient (though DOM-like) API  
for the METS data model, takes care of some of the housekeeping, and  
it has logic to prevent schema mistakes:

http://hul.harvard.edu/mets/

There's also a toolkit that helps you build a whole submission  
package, including the METS. It ought to work with SWORD too; see:
http://wiki.dspace.org/index.php/DSpace_SIP_Toolkit

(If you develop any tricks or procedures for using it with SWORD,  
please add them to that wiki page!)

  -- Larry

On Jun 5, 2009, at 5:25 AM, mikan.d.dspace listmail wrote:


  Im hoping to use SWORD in order to deposit items to DSpace. What would
  be the best way to produce mets-metadata for the items to be
  submitted? Should I write the entire metadata file myself or is there
  templates / frameworks available for this?
 
  Thanks,
  Mika
   

--
Crystal Reports - New Free Runtime and 30 Day Trial
Check out the new simplified licensing option that enables unlimited
royalty-free distribution of the report engine for externally facing 
server and web deployment.
http://p.sf.net/sfu/businessobjects
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] PDF Thumbnails, installed, but caused a lot of, trouble!

2009-04-06 Thread Scott Yeadon
Ruzbeh,

Assuming the table names haven't changed from 1.4.x and there is no 
bitstream management in 1.5.x, you could probably do something like:

update bitstream set deleted=true where name like '%.pdf.jpg';
and then run $DSPACE/bin/cleanup to get rid of the thumb files in the 
assetstore

You might also need to also do:
delete from bundle2bitstream where bitstream_id in (select bitstream_id 
from bitstream where name like '%.pdf.jpg');

as I'm not sure whether the cleanup job will do this or whether this is 
done when a bitstream is manually deleted.

Alternatively you could just write a quick java program to delete these 
using the API which could be safer than relying on my memory!

Scott.
 --

 Message: 1
 Date: Sun, 5 Apr 2009 15:51:25 -0700 (PDT)
 From: Ruzbeh ruzbe...@yahoo.com
 Subject: [Dspace-tech] PDF Thumbnails, installed, but caused a lot of
   trouble!
 To: DSpace-tech@lists.sourceforge.net
 Message-ID: 409153.6872...@web38808.mail.mud.yahoo.com
 Content-Type: text/plain; charset=us-ascii

 Hi everybody

 I installed the mediafilter available 
 athttp://sts.anu.edu.au/drs/downloads/index.php on 1.5.0.
 But during the bin/filter-media operations, handling a few items threw 
 exceptions.
 The exception was caused because ImageMagick Could not convert some pdfs with
 special sets of fonts into jpeg! It seemed non-vital at
 first, but now, when I enable the browse list to show thumbnails, the jsp 
 page throws an IOException
 with Format Not  Supported message, while performing the method of 
 InputStream.read()
 which causes a jspexception and that causes
 and the page remains only half-rendered. 

 1) I would appreciate it if somebody helped me with this issue. I tried to 
 manually, force the filter-media
 to handle those bad handled items, but I didn't succeed! And then I tried to 
 pin down every bad handled item with
 filter-media, in which I failed too!
 2)I tried to manually delete those bitstreams generated by the 
 filtermedia, but there are at least 400 pdf items in my database!!! Any help 
 or suggestion regarding 
 the removal of some bitstreams from items? any procedure who checks all the 
 bitstreams could 
 prove handy, because all the newly generated bitstreams end up in .pdf.jpg. 

   

  Roozbeh Ebrahimi,
 BSc of  pure Mathematics at
 Sharif University of Tech.
   
   


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] citations, journals, volumes, issues, , articles and dublin core

2009-02-24 Thread Scott Yeadon
Hi Mark,

In a DSpace context I think a major problem is lack of agreement on, or 
default implementation for a storage model for various classes of 
content (typically aggregations). If this could be achieved then 
value-add services could be far more easily connected - dissemination 
services such as Manakin (flexible themes, theme re-use, them choice, 
continuous improvement of targetted themes)and greater interoperability 
in a broader context. The technology part is solved. The standards and 
the tools are (mostly) there. The consistency isn't. For example, in a 
DSpace context there are three journal models I know of: the one we 
developed under APSR; TDL's journal display; and FORESITE as well as 
probably variants on just using DC for container relations you mention 
below, and who knows what Fedora or other platforms do. There are 
probably others. I personally don't think much progress will be made in 
the repository value-add service space (i.e. anything other than storage 
and file-level access) without content and interchange models. I'm not 
saying there is just one (right) way to package or store journals, but 
if there are more than two or three I think we just get stuck in a set 
of unsustainable proprietary implementations which require an equal 
number of unsustainable proprietary solutions in order to migrate or 
exchange content *in a meaningful form* over time.

Fedora appear to be looking at this space in its new Content Model 
Architecture, but from a cursory look they are again just providing 
tools to achieve (a large set of unsustainable?) implementations rather 
than work on reference implementations for more complex object classes 
such as journals or other aggregations. Given the joint nature of DSpace 
and Fedora development and the new OAI-ORE space with its aggregations 
and re-use and exchange scope the technologies are available to address 
this but it still needs some content/metadata modelling to provide 
consistency for value-add service providers.

Some of the work we have recently been doing involves developing METS 
Profiles for particular content classes which allows at least 
consistency in the packaging of materials for deposit to a repository 
(or some target process). We expect this work to continue over the 
coming years as we have been funded to being development of an 
Australian Data Commons, and this is unachievable without addressing the 
kind of problem you describe in you mail below. It's not a fast process.

Scott.

 Message: 3
 Date: Tue, 24 Feb 2009 09:34:20 -0500
 From: Mark H. Wood mw...@iupui.edu
 Subject: Re: [Dspace-tech] citations, journals, volumes, issues,
   articlesand dublin core
 To: dspace-tech@lists.sourceforge.net
 Message-ID: 20090224143420.ga21...@iupui.edu
 Content-Type: text/plain; charset=us-ascii

 May I suggest that we should never, never, never! get used to being
 shocked and surprised by some of the aspects of digital libraries, but
 rather to the warm feeling of having done something about them.

 It seems to me that metadata support for journal articles, while
 fundamental, is the least of our worries in this area.  People have
 been cataloging journal articles for a long time, and it shouldn't be
 that hard to map established practice to a mechanical representation.

 More urgently, I feel, we need to think a bit about how we ought to
 go forward.  I see two roads:

 o  adapt repository software (such as DSpace) to permit the behaviors
needed in representing paper journals online

 o  develop the most appropriate ways of organizing journal articles
for online presentation, and support those ways

 People are used to seeing papaer journals as *ordered* collections of
 articles (buttered in-between with features and announcements and
 similar matter) as issues within *ordered* collections of issues as
 volumes.  To go this route, we need to generalize a bit so that we can
 create new subclasses of containment in addition to communities and
 collections and items.  Order hasn't received enough attention in the
 design of DSpace.

 But is that really the most useful way of organizing journal articles?
 It's important to keep the association with the journal brand, and
 it's valuable to preserve the binding to time, but I don't think that
 people read volumes or journals; even if they habitually read from
 cover to cover, what they read is articles and features and whatnot.
 Is there really a need for volume and issue to be structural elements
 at all?  If we simply tag items with these attributes as metadata, and
 provide top-quality tools for searching on these attributes (as well
 as others), won't that serve?  If we have a submission process that
 can deal efficiently with repetitive metadata when submitting items
 which are related, won't that be sufficient?  How do readers, and
 catalogers, *want* to do their work?  How close can we reasonably get?

 We have a new medium to work with, and we should be 

Re: [Dspace-tech] How to order the articles within an issue

2009-01-22 Thread Scott Yeadon
Hi Andrew,

Your suggested approach using collections for issues and (sub-)community 
for journals is the approach we took. Unfortunately the hosted journal 
that intended to go into production didn't eventuate so it isn't 
implemented in our production repository but there is a screencast 
(http://www.apsr.edu.au/ore/) and a more recent presentation 
(http://www.apsr.edu.au/open_access_publishing/yeadon%20ojs-sword.pdf) 
you can look at which helps illustrate the approach we took. For 
ordering the items within an issue an OAI-ORE aggregation is generated 
by a DSpace journal METS ingestion module. We've done a little more work 
on this late last year, but the above links give you an idea of the 
process. If you want further info, feel free to contact me off list, as 
I don't think we've posted the lastest source anywhere.

The downside to all these different approaches for journal storage in 
DSpace is that Manakin themes can't be shared, but I think to achieve 
theme sharing beyond simple objects needs some co-ordinated DSpace 
storage modelling of compound objects and community agreement which is 
no small task.

Good luck!

Scott.
 -- Message: 3 Date: Thu, 22 Jan 2009 
 15:35:18 -0600 From: Scott Phillips scott.a.phill...@gmail.com 
 Subject: Re: [Dspace-tech] How to order the articles within an issue 
 To: Diggory Mark mdigg...@gmail.com Cc: 
 dspace-tech@lists.sourceforge.net 
 dspace-tech@lists.sourceforge.net Message-ID: 
 6254f291-71aa-451d-864a-9d18a2505...@gmail.com Content-Type: 
 text/plain; charset=us-ascii You may want to look at a Journal 
 collection we've done. It doesn't completely address the problem 
 you're asking about but is in the same sphere, and i second what Mark 
 said, it is likely best to not use communities and collections in that 
 manor. http://repositories.tdl.org/tdl/handle/2249.1/5065 Here is the 
 theme used to generate that view is located here: 
 http://repositories.tdl.org/tdl/themes/TDL/Periodicals/sitemap.xmap 
 http://repositories.tdl.org/tdl/themes/TDL/Periodicals/Periodicals.xsl 
 The trick we used is to add xml describing journal issues in the 
 metadata of the collection, then the theme knows to interpret this 
 small bit of data to build a browsable list of journal issues. Each of 
 those provide links to a masked search page for individual articles, 
 plus a list to an item that contains the full pdf for the journal. 
 Here is the current bit of XML on the collection: issues 
 xmlns:tdl=http://www.tdl.org/NS/tdl; xmlns=http://www.tdl.org/NS/tdl 
  issue vol=66 num=3amp;4 year=2008 name= 
 handle=2249.1/5643/ issue vol=66 num=1amp;2 year=2008 
 name= handle=2249.1/5546/ issue vol=65 num=3amp;4 
 year=2007 name= handle=2249.1/5496/ issue vol=65 
 num=1amp;2 year=2007 name= handle=2249.1/5438/ issue 
 vol=64 num=3amp;4 year=2006 name= handle=2249.1/5439/ 
 issue vol=64 num=1amp;2 year=2006 name= 
 handle=2249.1/5440/ issue vol=63 num=3amp;4 year=2005 
 name= handle=2249.1/5441/ issue vol=63 num=1amp;2 
 year=2005 name= handle=2249.1/5442/ issue vol=62 
 num=3amp;4 year=2004 name= handle=2249.1/5443/ issue 
 vol=62 num=1amp;2 year=2004 name= handle=2249.1/5444/ 
 issue vol=61 num=3amp;4 year=2003 name= 
 handle=2249.1/5445/ issue vol=61 num=1amp;2 year=2003 
 name= handle=2249.1/5446/ issue vol=60 num=3amp;4 
 year=2002 name= handle=2249.1/5447/ issue vol=60 
 num=1amp;2 year=2002 name= handle=2249.1/5448/ /issues 
 Scott-- On Jan 22, 2009, at 2:08 PM, Diggory Mark wrote:
  Andrew,
 
  I might recommend avoiding using Collections for Issues, mostly  
  because then your going to end up with lots of the them, and that  
  tends to make the interface both difficult to navigate and maintain.
 
  Another possibility to consider, instead, add the issue detail  
  either to the Item directly, or possibly create separate Items to  
  hold the issue metadata (relating them to one another in  
  dc.relation.isPartOf/hasPart fields).
 
  Thus:
 
  Society = Community
 
  Journal = Collection
 
  Issue and its Errata: Held in Issue Item
 
  Article and its Errata: Held in  Article Item
 
  You might then customize your presentation and search to Group  
  Articles under Issues.
 
  -Mark
 
  On Jan 22, 2009, at 1:41 AM, Andrew Marlow wrote:
 
   
  I am trying to set up a DSpace that models societies, journals,  
  issues and articles. Each article is an item. The collection that  
  contains the items is an issue. The journal that publishes the  
  issue(s) is a sub-community, and the society that publishes the  
  journal(s) is a community. My question is, how do I make it so that  
  the collection orders the items the way I want? An issue has a  
  particular order for the articles because it relates to the  
  physical printed version. That's the order I want for the  
  collection also. I think collections are ordered by publish date  
  (please correct me if I'm wrong about that). That's not good enough  
  for me since an issue has a 

Re: [Dspace-tech] DSpace-tech Digest, Vol 28, Issue 57

2008-08-25 Thread Scott Yeadon
Ameer,

It's old, but you can find a mediafilter at 
http://sts.anu.edu.au/drs/downloads/index.php, requires ImageMagick

Scott.
 Message: 3
 Date: Mon, 25 Aug 2008 14:27:55 -0400
 From: d space [EMAIL PROTECTED]
 Subject: [Dspace-tech] How to view preview/thumbnails of pdf documents
 To: dspace-tech@lists.sourceforge.net
 Message-ID:
   [EMAIL PROTECTED]
 Content-Type: text/plain; charset=iso-8859-1

 HI All,
  I have Dspace 1.4.2 installed on my server and I am able to view
 thumbnail images of JPEG files but would like to know if there is any way of
 presenting thumbnail/preview images of PDF documents. Does anyone have any
 idea of how to enable this?
 Best Regards,
 Ameer


-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] APSR software

2007-12-20 Thread Scott Yeadon
Hi All,

With APSR project winding down some proof-of-concept work has been 
posted on the APSR website for anyone who wants to take a look. This 
work is in the repository interoperability space. Rather than repeat the 
web site blurb refer to the follow web pages:
Australian METS Profile: http://www.apsr.edu.au/nla-mets/index.htm
Repository Submission Service: 
http://www.apsr.edu.au/submission_service/index.htm
Manakin Theme Editor (also comes bundled with Submission Service): 
http://www.apsr.edu.au/dissemination_service/index.htm
Note that if experimenting with the Manakin theme editor I'd suggest 
pointing it at an image collection, as the conference and journal 
collections have specific requirements as to how these materials are 
stored. Some of the code from this will be incorporated in Fez 2.0 and 
is applicable to any repository which can generate DRI-compliant XML.

While we can't commit to further development of the latter two software 
packages (all was developed under funded project monies), we will be 
attempting to build on work delivered under APSR in future projects.

I'm happy to take any queries off-list regarding these packages and 
further development of them.

Scott.


-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] importing a collection of files with identical names (from different directories)

2007-10-08 Thread Scott Yeadon
Rowan Brownlee wrote:
 That's very useful information.  Thanks Scott.  

 The metadata that I'm importing from a spreadsheet contains latitude 
 longitutude information which I'm using to generate kml files (for
 import to dspace along with the other data files).  I'm not familiar
 with manakin but I'm assuming that at some stage we might use it to
 provide a customised user interface to the collection that would include
 display of google-map-like features.
   
Yes, Leo does this for bidwern but uses Manakin (cocoon) to generate the 
kml on the fly rather than archive it.
 From what you say, it sounds like it could be useful to include path
 information in the metadata.  Do you have any thoughts on best options
 for doing this?  E.g. would I add a repeating custom metadata text field
 containing an entry for each file?
   
You could do that, it depends what the metadata represents as to how you 
map it e.g. does it represent subject, discipline, etc or is it fairly 
arbitrary.
 Also - I'm not familiar with how dspace arranges file storage.  Would
 the file path metadata entry be a replication of the entry as recorded
 in the contents file?
   
The path used in the contents file would be stored as metadata with 
each bitstream. Each bitstream however is renamed by DSpace and stored 
in the DSpace assetstore, so the file path, once in DSpace only exists 
as metadata.
 Bye
 rowan


   


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] stand-alone Handle Server

2007-08-23 Thread Scott Yeadon
Hi John,

This won't help much now, but one of the things I want to look at next 
year is using DSpace with the PILIN software 
(http://www.arrow.edu.au/PILIN). PILIN software is essentially a wrapper 
around the handle server code and can supposedly be run as a local 
handle server for minting, updating and removing handles. The work Jim R 
is doing on the identifier code looks like it will be of great help to this.

Scott.


 Date: Thu, 23 Aug 2007 14:54:55 -0400
 From: John Davison [EMAIL PROTECTED]
 Subject: [Dspace-tech] stand-alone Handle Server
 To: dspace-tech@lists.sourceforge.net
 Message-ID: [EMAIL PROTECTED]
 Content-Type: text/plain; charset=us-ascii

 Hello All,

  

 I'm investigating the use of a stand-alone remote Handle Server instead of
 the one bundled with DSpace.

  

 Robert Tansley and the China Digital Museum Project released a patch on
 SourceForge (Enable Use Of Remote Handle Server:
 https://sourceforge.net/tracker/index.php?func=detailaid=1272731group_id=1
 9984atid=319984) but the file was last updated on 9-28-2005. Its summary
 lists several outstanding issues, including batch handle generation for bulk
 import and suffix sequencing. 

  

 I'm curious if this is the currently accepted solution, or if there are
 other, more recent options available.

  

 Thanks in advance,

 John Davison

 Ohio Library  Information Network

  

   


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Item Display as per the Order of Ingestion

2007-08-05 Thread Scott Yeadon
Hi Jayan,

You'd need to make modifications to handle the ordering somehow through 
metadata (e.g. ensuring they're in order by adding 01, 02 etc to the 
titles) or some db table to handle ordering and couple that with a 
custom jsp tag. It all gets a bit messy though and we've found this is 
better done through Manakin.

In a couple of weeks we're deploying some software packages which among 
other things include a DSpace package ingester, Manakin theme and 
OJS-to-METS stylesheets for submitting and handling OJS journals in 
DSpace. After some production testing we'll be making this available via 
Sourceforge as part of APSR.  At this stage we're looking at sometime 
late Sept. In this instance the journal ordering is maintained via a 
manifest object (RDF file).

Scott.

 Date: Fri, 3 Aug 2007 14:42:52 +0800
 From: Jayan Chirayath Kurian [EMAIL PROTECTED]
 Subject: [Dspace-tech] Item Display as per the Order of Ingestion
 To: dspace-tech@lists.sourceforge.net
 Message-ID:
   [EMAIL PROTECTED]
   
 Content-Type: text/plain; charset=us-ascii

 Hi! 

  

 In a Journal we have sequential list of items. Is there a way to display
 items in a DSpace collection according to the sequential order in which
 they appear in the journal? By default dspace items are displayed
 alphabetically taking into consideration the title metadata. Please
 suggest.

  

 Thanks,

 Jayan

 -- next part --
 An HTML attachment was scrubbed...

 --

 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now   http://get.splunk.com/

 --

 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


 End of DSpace-tech Digest, Vol 16, Issue 7
 **

   


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] RAMP project beta software

2007-07-25 Thread Scott Yeadon
Hi All, FYI - a beta release of the mura software developed under the 
RAMP project has been made available at 
http://drama.ramp.org.au/cgi-bin/trac.cgi. While the GUI component is 
Fedora related, the authors suggest that the other modules are 
applicable to repositories in general. However they stipulate that this 
is only true where the authorisation and authentication mechanisms are 
not tightly coupled withint the repository core, which is not currently 
true in DSpace but presumably where we want to end up. Anyone looking 
into the auth* management for DSpace at the moment may want to have a 
look (it's on my todo list, but that's a long list!). Scott.


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [Dspace-devel] Implicit authentication methods

2007-06-28 Thread Scott Yeadon
Hi Mark,

I got around it for now just be re-processing the request and seems to 
work fine (see
http://sourceforge.net/mailarchive/message.php?msg_name=46786482.1020808%40anu.edu.au
 
for details), maybe that's a quick fix that may help.

I'm not sure that's a real fix though. The throwing of the exception 
from the Bitstream class wrestles away control of the request flow so I 
guess that's where it should properly be fixed up. I haven't really had 
time to do further investigation as yet.

I suspect this was coded before the boolean test for authorisation was 
available or for some other reason such as automated login not being 
part of the initial implementation.

Scott.


Mark Diggory wrote:
 Scott,

 We observe this bug in our production systems and need to find the fix 
 as well.  We have an implicit MIT Certificate Authentication thats 
 failing to work directly against bitstreams because of it.

 -Mark

 On Jun 18, 2007, at 2:20 AM, Scott Yeadon wrote:

 OK, it's not null, the authentication succeeds, but since the code drops
 down into the AuthorizationException handling of the DSpaceServlet.java,
 there is no redirection (implicit authentication) so it bombs out. Is
 this a bug?

 Scott.

 Scott Yeadon wrote:
 Hi All,

 I have a stackable authentication class which obtains credentials from
 a request. When I access a protected bitstream, the auth class works
 (credentials are parsed and all ok) however the context user is still
 null although setCurrentUser() is called before returning the SUCCESS
 status. The flow is:

 - attempt to access protected bitstream
 - AuhorizeAction happens, fails, throws AuthorizeException
 - Authenticate.startAuthentication occurs and my class in invoked with
 success
 - After return from startAuthentication the currentUser is still null
 (even though explicitly being set in the authenticate method), so the
 request fails with access denied to user 0 (default value when
 getCurrentUser() is null). As my class is implicit authentication
 there is no redirection either.

 I don't understand why this is null, the log message shows:
 2007-06-18 14:31:47,032 INFO
 org.dspace.app.webui.servlet.DSpaceServlet @
 myuser:session_id=AB51545186B04E419B63AD9FF140C7BF:ip_addr=150.203.2.97:authorize_error:org.dspace.authorize.AuthorizeException:
  

 Authorization denied for action READ on BITSTREAM:32978 by user 0

 showing that the myuser user is the current user (at least in the
 logHeader!) but not in the context object.

 Anyone done this before? My code is almost identical to the X509 auth,
 the only real difference being where the credentials are taken from.
 There are no problems with the credentials or request itself, just
 something I'm missing with the auth flow I suspect.

 Scott.




 - 

 This SF.net email is sponsored by DB2 Express
 Download DB2 Express C - the FREE version of DB2 express and take
 control of your XML. No limits. Just data. Click to get it now.
 http://sourceforge.net/powerbar/db2/
 ___
 Dspace-devel mailing list
 [EMAIL PROTECTED]
 https://lists.sourceforge.net/lists/listinfo/dspace-devel

 ~
 Mark R. Diggory - DSpace Systems Manager
 MIT Libraries, Systems and Technology Services
 Massachusetts Institute of Technology
 Office: E25-131
 Phone: (617) 253-1096





-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Implicit authentication methods

2007-06-17 Thread Scott Yeadon
Hi All,

I have a stackable authentication class which obtains credentials from a 
request. When I access a protected bitstream, the auth class works 
(credentials are parsed and all ok) however the context user is still 
null although setCurrentUser() is called before returning the SUCCESS 
status. The flow is:

- attempt to access protected bitstream
- AuhorizeAction happens, fails, throws AuthorizeException
- Authenticate.startAuthentication occurs and my class in invoked with 
success
- After return from startAuthentication the currentUser is still null 
(even though explicitly being set in the authenticate method), so the 
request fails with access denied to user 0 (default value when 
getCurrentUser() is null). As my class is implicit authentication there 
is no redirection either.

I don't understand why this is null, the log message shows:
2007-06-18 14:31:47,032 INFO  org.dspace.app.webui.servlet.DSpaceServlet 
@ 
myuser:session_id=AB51545186B04E419B63AD9FF140C7BF:ip_addr=150.203.2.97:authorize_error:org.dspace.authorize.AuthorizeException:
 
Authorization denied for action READ on BITSTREAM:32978 by user 0

showing that the myuser user is the current user (at least in the 
logHeader!) but not in the context object.

Anyone done this before? My code is almost identical to the X509 auth, 
the only real difference being where the credentials are taken from. 
There are no problems with the credentials or request itself, just 
something I'm missing with the auth flow I suspect.

Scott.


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] [Dspace-general] DSpace `Dublin Core' | Date Issued | Date Range | How to represent

2007-06-12 Thread Scott Yeadon
Hi Richard,
 Hello Scott,

 Thanks for your note.

 On Tue, 2007-06-12 at 12:06, Scott Yeadon wrote:
   
 Hi Richard,

 It's up to you how you represent your values, you could use the DCMI 
 Period or something simple such as 1930-1940. We tend to have the 
 latter since that's what our users typically enter. The batch import 
 process won't parse the values, as long as the document is valid XML the 
 values will be accepted.
 

 Granted that using something such as:

 dcvalue element=date qualifier=issued1964-1970/dcvalue

 in one's `dublin_core.xml' file seems practical and expedient, on my
 system at least -- DSpace-1.4.0 -- such an approach breaks DSpace's
 `Browse by Title', `Browse by Date', and the offending item's `Brief
 View'.
   
 This is the reason I first asked the lists for details of how one
 should _correctly_ represent a date range in the DSpace
 `dublin_core.xml' file. Using `1964-1970' and so on simply does not
 seem to work.
   
It's likely that this is because the default metadata display is not 
able to render date ranges properly. In your DSpace config file put the 
following entry:
webui.itemdisplay.default = dc.title, dc.title.alternative, 
dc.contributor.*, dc.subject, dc.date.issued, dc.publisher, 
dc.identifier.citation, dc.relation.ispartofseries, 
dc.description.abstract, dc.description, dc.identifier.govdoc, 
dc.identifier.uri(link), dc.identifier.isbn, dc.identifier.issn, 
dc.identifier.ismn, dc.identifier

The date.issued field is by default formatted to a date (see 
ItemTag.java for the hardcoded list) using the dc.date.issued(date) 
field display text. Removing the (date) part of this will stop any 
special rendering taking place.

Also, setting:
webui.itemlist.columns = dc.date.issued, dc.title, dc.contributor.*
in the dspace.cfg file may also resolve your ranges not showing up in 
the browse page (the default specifies dc.date.issued(date)), so as 
above removing the rendering rules should fix this)

There should also be a section in the docs covering the above config 
items you could have a look at, since you may also want to customise 
your simple view and browse columns.

 I have put together a series of screenshots to indicate the issues:

 http://indica-et-buddhica.org/sections/repositorium-preview/known-issues/dspace-item-date-ranges

 As you will see, I am - unhappily - coming to the conclusion that
 DSpace does not support item date ranges at all. It is also becoming
 clear that the lack of genuine validation by the item importer can
 easily lead to the widespread corruption of ones metadata. I hope I am
 wrong as these would be serious deficiencies.
   
I imagine the reason validation is not included is that DSpace in 
general doesn't set down value format rules, since every institution is 
likely to have differing requirements. Having said that, there is no 
easy plugin to set your own rules, you would have to modify the import 
process to support this if you needed it, or pre-process your material 
prior to loading to apply any rules you need enforced.

Hope that helps.

Scott.

 Best regards,

  Richard Mahoney


   
 Scott.
 
 Message: 3
 Date: Fri, 08 Jun 2007 12:08:16 +1200
 From: Richard MAHONEY [EMAIL PROTECTED]
 Subject: [Dspace-general] DSpace `Dublin Core' | Date Issued | Date
 Range | How to represent
 To: DSpace Tech dspace-tech@lists.sourceforge.net,DSpace General
 [EMAIL PROTECTED]
 Message-ID: [EMAIL PROTECTED]
 Content-Type: text/plain

 Dear List Members,

 I am in the process or preparing material for bulk import and have
 again encountered and issue that I was inclined to gloss over last time
 it arose: the format of the DSpace Dublin Core Date Elements,
 Qualifiers, and particularly, the Values.

 What exactly is the required Value format and is it configurable?
 Simple date Values such as the following present no difficulty:

 dcvalue element=date qualifier=issued1970/dcvalue

 The trouble for me -- and this situation would arise often for many
 projects -- is how to correctly represent date ranges, for e.g., date
 issued, 1964 to 1970. Which Value format should should be used to
 represent a date range in DSpace DC? Some DSpace version of the
 W3C-DTF/ISO 8601 scheme?

  http://dublincore.org/documents/2000/07/28/dcmi-period/


 Best regards,

  Richard Mahoney


   
   


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] chinese search

2007-04-17 Thread Scott Yeadon
You need to set your search.analyzer = 
org.apache.lucene.analysis.cn.ChineseAnalyzer in the dspace.cfg file and run 
index-all. 

Note that we use org.apache.lucene.analysis.standard.StandardAnalyzer which 
seems to handle a mix of Chinese and English quite well.

Scott.


Date: Tue, 17 Apr 2007 16:21:31 +0800 (CST)
From: =?GBK?B?zMbV1+f5?= [EMAIL PROTECTED]
Subject: [Dspace-tech] chinese search
To: dspace-tech@lists.sourceforge.net
Message-ID:
[EMAIL PROTECTED]
Content-Type: text/plain; charset=gbk

hi, My DSpace version is 1.4.1, there is a search configuration in the file 
dsapce.cfg, I let the chinese Analyzer available ,but it doesn't work, who 
knows how to make chinese search available? [EMAIL PROTECTED] thank you! 
Shanghai JiaoTong University Library Evan
-- next part --
An HTML attachment was scrubbed...

--


-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] dspace repositories: exposing MPEG-21 DIDL metadata using OAI-PMH?

2007-03-20 Thread Scott Yeadon
Hi Alan,

Configure your oaicat.properties with:
Crosswalks.didl=org.dspace.app.oai.DIDLCrosswalk

Uncomment the |oai.didl.maxresponse| item in |dspace.cfg
|
That's all you need to do (plus restart Tomcat) to expose DIDL packages, 
using metadataPrefix=didl

Verify the Crosswalk is activated by accessing a URL such as 
|http://mydspace/dspace-oai/request?verb=ListRecordsmetadataPrefix=didl
|
There is a section in the Configuration section of the 1.4.1 document 
that covers this (under the Activating Additional OAI-PMH Crosswalks 
section cid:part1.03080902.06090706@anu.edu.au) that you should also 
take a look at.

Scott.

 Date: Mon, 19 Mar 2007 17:26:16 -0700
 From: Cornish, Alan K [EMAIL PROTECTED]
 Subject: [Dspace-tech] dspace repositories: exposing MPEG-21 DIDL
   metadatausing OAI-PMH?
 To: dspace-tech@lists.sourceforge.net
 Message-ID:
   [EMAIL PROTECTED]
 Content-Type: text/plain; charset=us-ascii

 My site recently upgraded its DSpace instances to 1.4.1.  The DSpace 1.4
 release info (URL
 http://mailman.mit.edu/pipermail/dspace-general/2006-July/001064.html)
 indicates that it's possible to expose METS and MPEG-21 DIDL DIPs via
 OAI-PMH.  At my site, we have been testing both mets and didl.
 Retrieving mets records using OAI-PMH verbs has been working well.
 However, I've been unable to retrieve records using didl.  I'm
 interested in communicating with staff or techs at a DSpace site that
 has successfully exposed DSpace didl metadata using OAI-PMH.  What steps
 were needed to expose records in this extended format?

  

  

 Any info would be appreciated, thanks!

 Al Cornish

 Head, Library Systems

 Washington State University Libraries

 Pullman, WA  99164-5610

  

 509.335.1895 (voice)

 [EMAIL PROTECTED] (e-mail)

  

  

  

  

 

 -- next part --
 An HTML attachment was scrubbed...

 --

 Message: 4
 Date: Tue, 20 Mar 2007 13:08:12 +0100
 From: [EMAIL PROTECTED]
 Subject: [Dspace-tech] Cataloguing web resources
 To: dspace-tech@lists.sourceforge.net
 Message-ID:
   [EMAIL PROTECTED]
 Content-Type: text/plain; charset=ISO-8859-1; format=flowed

 Hallo,
 I would like to know if it is possible with DSpace to catalogue web
 resources (links to web pages or other digital resources). Essentially
 I need to catalogue URLs without having to upload files.
 I know that an instrument dedicated to this purpose is Scout Portal
 Toolkit (http://scout.wisc.edu/Projects/SPT/), but I'm interested also
 in classifying other documents too.
 Anyone knows how or if it's possible with DSpace?

 Thanks,
 Mauro



 --

 Message: 5
 Date: Tue, 20 Mar 2007 12:41:54 +
 From: orlando carvalho [EMAIL PROTECTED]
 Subject: [Dspace-tech] using SRW client on Dspace
 To: dspace-tech@lists.sourceforge.net
 Message-ID:
   [EMAIL PROTECTED]
 Content-Type: text/plain; charset=iso-8859-1

 Hi there!
 We put an SRW client working properly!
 ##
 import java.net.*;
 import org.apache.axis.*;
 import gov.loc.www.zing.srw.SearchRetrieveRequestType;
 import gov.loc.www.zing.srw.SearchRetrieveResponseType;
 import gov.loc.www.zing.srw.interfaces.SRWPort;
 import gov.loc.www.zing.srw.srw_sample_service.SRWSampleServiceLocator;

 /*
  * Created on 8/Mar/2007
  *
  * To change the template for this generated file go to
  * Windowgt;Preferencesgt;Javagt;Code Generationgt;Code and Comments
  */
 //response.getEchoedSearchRetrieveRequest().getQuery()
 /**
  * @author dspace
  *
  * To change the template for this generated type comment go to
  * Windowgt;Preferencesgt;Javagt;Code Generationgt;Code and Comments
  */
 public class cliente_java {

 public static void main(String[] args) {
 try{
 SRWSampleServiceLocator service = new SRWSampleServiceLocator();
 URL url = new URL(http://localhost:8080/SRW/search/DSpace;);
 SRWPort port = service.getSRW(url);
 SearchRetrieveRequestType request = new
 SearchRetrieveRequestType();
 request.setQuery(computer);
 SearchRetrieveResponseType response =
 port.searchRetrieveOperation(request);
 //Object i = response.getResultSetId();
 System.out.println(Resultados:+
 response.getEchoedSearchRetrieveRequest().getQuery());

 }
 catch (Exception e){
 e.printStackTrace();
 }


 }
 }
 ##
 It returns:
Results:1(this is the number of records finded)
Results:name (this is the author or the title of the item)

 Our question is, how can we change this code to the answer to our request
 returns the full description of the items stored on Dspace?
 Thanks for the help.
   


-
Take Surveys. Earn Cash. Influence the Future of IT
Join 

Re: [Dspace-tech] DSpace-tech Digest, Vol 10, Issue 45

2007-02-19 Thread Scott Yeadon
Hamish,

Check out 
http://www.mail-archive.com/dspace-tech@lists.sourceforge.net/msg00182.html, 
it's supposed to be an optional dependency but isn't mentioned in the 
PDFBox docs.

Scott.


Date: Mon, 19 Feb 2007 15:51:35 +1030
From: Brett, Hamish [EMAIL PROTECTED]
Subject: [Dspace-tech] Filter-media error
To: dspace-tech@lists.sourceforge.net
Message-ID:
   [EMAIL PROTECTED]
Content-Type: text/plain; charset=us-ascii

Hi,
 
Ever since upgrading to 1.4.1 when I run filter-media the following
error
 
ERROR filtering, skipping bitstream #1584 java.io.IOException: Invalid
header signature; read 290763650945099227, expected -2226271756974174256
java.io.IOException: Invalid header signature; read 290763650945099227,
expected -2226271756974174256
at
org.apache.poi.poifs.storage.HeaderBlockReader.init(HeaderBlockReader.
java:88)
at
org.apache.poi.poifs.filesystem.POIFSFileSystem.init(POIFSFileSystem.j
ava:83)
at
org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.j
ava:48)
at
org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.ja
va:97)
at
org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java
:155)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
rManager.java:327)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
ger.java:296)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
erManager.java:266)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media
FilterManager.java:234)
at
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja
va:185)
SKIPPED: bitstream 997 because '2000 J Thermal Stress (One-sided
repair).PDF.txt' already exists
SKIPPED: bitstream 2833 because 'iRoom at DSTO.pdf.txt' already exists
SKIPPED: bitstream 2835 because 'Network Enabled Warfare4.pdf.txt'
already exists
SKIPPED: bitstream 2837 because 'DORC99-Lin-Zhang.PDF.txt' already
exists
SKIPPED: bitstream 2839 because 'icota98.pdf.txt' already exists
Exception in thread main java.lang.NoClassDefFoundError:
org/bouncycastle/jce/provider/BouncyCastleProvider
at
org.pdfbox.pdmodel.PDDocument.openProtection(PDDocument.java:905)
at org.pdfbox.pdmodel.PDDocument.decrypt(PDDocument.java:489)
at
org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:204)
at
org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:149)
at
org.dspace.app.mediafilter.PDFFilter.getDestinationStream(PDFFilter.java
:110)
at
org.dspace.app.mediafilter.MediaFilter.processBitstream(MediaFilter.java
:155)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilte
rManager.java:327)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterMana
ger.java:296)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilt
erManager.java:266)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(Media
FilterManager.java:234)
at
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.ja
va:185)

Any ideas?
 
Thanks
 
Hamish
-- next part --
An HTML attachment was scrubbed...

--
  



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] DSpace not indexing MS Powerpoint files?

2007-01-31 Thread Scott Yeadon
Pan,

You'll need to write your own media filter class to handle the 
extraction of text from PowerPoint files as ppt text extraction isn't 
currently supported by the default set of media filters. Hopefully 
someone may have already done this and will share, but if not you'll 
have to write your own using OpenOffice or some other means.

Scott.

Date: Wed, 31 Jan 2007 15:07:21 -0800
From: Pan Family [EMAIL PROTECTED]
Subject: [Dspace-tech] DSpace not indexing MS Powerpoint files?
To: dspace-tech@lists.sourceforge.net
Message-ID:
   [EMAIL PROTECTED]
Content-Type: text/plain; charset=iso-8859-1

Hi,

I submitted a MS ppt file to my collection, but filter-media
does not want to index this ppt file.  I tried to shut down
the database (PostgreSQL) and restarted it, and ran
filter-media several times, but it did not help.  I made
sure that this ppt file is indeed in the collection by openning
it using View/Open.

I have no problem indexing MS Word, text, html, or pdf
files.  Do I need to do anything special for ppt files?

Thanks a lot!

-Pan
  



-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech