Re: [Dspace-tech] LAC e-theses harvesting of DSpace 3.1
Hi Graham: We ran into this recently as well -- in our case, it turned out that sometime late last year LAC decided that they required a publisher field in the etdms output, and while they were harvesting our theses, their process for generating MARC records for their catalogue silently failed in the absence of that field (or they ignored the errors). Note that in Anthony's example, they seem to have hard-coded the publisher output in their XSL; that's the approach that we're probably going to use as well, just to be expedient. Dan On 9 June 2014 14:28, Graham Faulkner graham.faulk...@uwaterloo.ca wrote: Hi all, I’m wondering if there is someone on the list who is running DSpace 3.1+ and has a collection being properly harvested by Library and Archives Canada’s ETD harvesting program. We recently upgraded from DSpace 1.8 to 3.1 and the LAC harvesting apparently stopped working. Any pointers in the right direction are greatly appreciated. Cheers, Graham - Graham Faulkner University of Waterloo Library -- HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing Easy Data Exploration http://p.sf.net/sfu/hpccsystems ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Want fast and easy access to all the code in your enterprise? Index and search up to 200,000 lines of code with a free copy of Black Duck Code Sight - the same software that powers the world's largest code search on Ohloh, the Black Duck Open Hub! Try it now. http://p.sf.net/sfu/bds___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] https
On 26/09/2007, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: Hi All, I would like to use https for my entire dspace website. Could anybody tell me how it could be done? Thank you, Sree A quick note from experience - you can't run the dspace-oai application under https, as https is not a recognized protocol for OAI harvesters. However, we're running everything else under https and there has been no noticeable server load thus far. Roughly speaking, I added the following to our vhost configuration to make this happen, assuming that you want your entire dspace site to be accessible from https://hostname/dspace/ with the exception of your http://hostname/dspace-oai/ application under HTTP: RewriteEngine on RewriteRule ^/(?!dspace-oai.*) https://%{HTTP_HOST}/$1 [L,R] (that's assuming that you have SSL mod_jk integration already functional, per the standard DSpace documentation) -- Dan Scott Laurentian University - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] OutOfMemory errors during large PDF indexing
error stack trace is below: Exception in thread main java.lang.OutOfMemoryError : Java heap space at java.util.HashMap.resize(Unknown Source) at java.util.HashMap.addEntry(Unknown Source) at java.util.HashMap.put (Unknown Source) at org.fontbox.cmap.CMap.addMapping(CMap.java:132) at org.fontbox.cmap.CMapParser.parse(CMapParser.java:153) at org.pdfbox.pdmodel.font.PDFont.parseCmap (PDFont.java: 535) at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:387) at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java :325) at org.pdfbox.util.operator.ShowText.process (ShowText.java:64) at org.pdfbox.util.PDFStreamEngine.processOperator (PDFStreamEngine.java :452 ) at org.pdfbox.util.PDFStreamEngine.processSubStream (PDFStreamEngine.java:21 5) at org.pdfbox.util.PDFStreamEngine.processStream (PDFStreamEngine.java:174) at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java: 336) at org.pdfbox.util.PDFTextStripper.processPages (PDFTextStripper.java: 259) at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216) at org.pdfbox.util.PDFTextStripper.getText (PDFTextStripper.java:149) at org.dspace.app.mediafilter.PDFFilter.getDestinationStream (PDFFilter.java :114) at org.dspace.app.mediafilter.MediaFilterManager.processBitstream (MediaFilt erManager.java:602) at org.dspace.app.mediafilter.MediaFilterManager.filterBitstream (MediaFilte rManager.java:513) at org.dspace.app.mediafilter.MediaFilterManager.filterItem (MediaFilterMana ger.java :461) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem (MediaFilt erManager.java:428) at org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems (Media FilterManager.java:391) at org.dspace.app.mediafilter.MediaFilterManager.main (MediaFilterManager.ja va:342) - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech ~ Mark R. Diggory - DSpace Systems Manager MIT Libraries, Systems and Technology Services Massachusetts Institute of Technology -- Tim Donohue Research Programmer, Illinois Digital Environment for Access to Learning and Scholarship (IDEALS) 135 Grainger Engineering Library University of Illinois at Urbana-Champaign email: [EMAIL PROTECTED] web: http://www.ideals.uiuc.edu phone: (217) 333-4648 fax: (217) 244-7764 - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Dan Scott Laurentian University - This SF.net email is sponsored by: Microsoft
Re: [Dspace-tech] Any definitive guide to setting up handle server?
On 07/09/2007, Mathias Hjelt [EMAIL PROTECTED] wrote: Does anyone have an up-to-date and complete step-by-step guide on how to set up the Handle server with DSpace 1.4.2? If such a guide exists, would't it be a good idea to include it in the DSpace documentation or put it on the Wiki? I'm currently in the process of trying to get this set up and I'm mildly annoyed by the lacking documentation in [dspace-source]/docs/install.html. The first problem I ran into was that make-handle-config gives me Warning: data not encrypted - a quick Google on that led me to http://www.thesesalive.ac.uk/archive/ERAInstallation-1.9.html#_Toc824950 72 which suggested running dsrun net.handle.server.SimpleSetup instead, which is also mentioned in DSpace's install.html. However, any documentation I've seen so far fails to mention that you should NOT put a passphrase on your keys - I did it, and now the handle server won't start. A quick Google on Invalid passphrase - found a thread suggesting that yes it's in the documentation if you read the make-handle-config script.. Helpful indeed, in retrospect. Furthermore, docs/install.html isn't very clear on how to proceed once the above obstacles are out of the way. More Googling - found http://sunsite.utk.edu/diglib/dspace/#handle which makes the important point that one should ignore the official Handle documentation altogether (e.g that about homing your prefix etc). It also gives some useful step-by-step instructions which, in my opinion, could be integrated into docs/install.html. (Install.html implies you should start the Handle server prior to even applying for a prefix, and fails to explain how to put the obtained prefix into the Handle server's config..) So, now that I'm waiting for a reply from CNRI on how to proceed with the passphrase issue (I did not find any solution for removing the passphrase OR for making the handle server startup prompt for it), I'm still not certain that I've got everything else right. Hence my question, does anyone have a complete and up-to-date guide on how to do this properly, without taking the trial-and-error path? Best regards Mathias Hjelt Hi Mathias: I managed to navigate most of the hurdles with Handle, but I agree with you in general about the need for an up to date guide. Additionally, I'd love to see two streams of documentation - one for the up to date handle.jar from CNRI, another for the 2005 version of handle.jar bundled in DSpace. There is a way of removing the pass phrase; it was described in a July 30, 2007 post to the list by Rafa Carreres. I, too, stumbled on this step; the pass phrase removal seems to work. Unfortunately, now that I have removed my passp hrase and completed the other steps in the DSpace install.txt file (including update-handle-prefix), I just get resolution errors from hdl.handle.net, but there's nothing useful in the /dspace/handle-server/error.log or /dspace/log/handle-server.log to help me debug the problem. It's probably a bad sign that I'm not seeing a handle-plugin.log being created, as requested by log4j-handle-plugin.properties. But that's just how my week has gone. -- Dan Scott Laurentian University - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Authorizing a user to add a file to a document
Hi: I chatted briefly with Rob Tansley about this on IRC, but I thought I would post to the list and see if I'm overlooking something. I'm attempting to grant a user the ability to add more files to a single document. The context is that the document is a book, and the author wants to be able to add chapters as they are completed so that it is an open work in progress. We're running DSpace 1.4.1 at the moment. In the process of trial and error, I added the user to the Collection Admin group for the collection in which the document appears. That wasn't enough; so I gave the user ADD and WRITE permissions on the item Bundle; that wasn't enough. I can't see any further permissions that I can add through the Item Policy editor. When the user tries to add an item, however, they receive an error message that they are not authorized to perform that action. The log file contains the following exception: authorize_error:org.dspace.authorize.AuthorizeException: Authorization denied for action WRITE on BITSTREAM:176 by user 6 But of course, I can't grant permissions to the user on the bitstream in advance because that is simply the next bitstream in sequence - it doesn't exist until the user attempts to upload it. From reading the pertinent code[1], it seems that the only way to add files to a document is if you are a DSpace administrator - and I really don't want to open things up this wide for a single user. Am I missing something obvious here? Or is DSpace not set up to enable this sort of granular authorization scheme? 1. http://dspace.svn.sourceforge.net/viewvc/dspace/trunk/dspace-api/src/main/java/org/dspace/authorize/AuthorizeManager.java?revision=1957view=markup, method authorize(), clause if isAdmin(c) { return true; } Thanks, -- Dan Scott Laurentian University - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] WG: newbie question
Hi Felix: Okay, so 99% of the files in /var/lib/tomcat5/* and /var/cache/tomcat5/* are owned by the dspace user -- that's good -- but you're still running tomcat as the tomcat5 user, so tomcat can't actually read the files -- that's bad. As I pointed out in my previous email, it sounds like you haven't followed step 13 of the wiki (http://wiki.dspace.org/index.php/Installing_DSpace_on_Ubuntu_6.06_%28LTS%29): # Append the following lines to /etc/default/tomcat5 to set the preferences necessary for dspace: TOMCAT5_USER=dspace JDK_DIRS=/usr/lib/jvm/java-1.5.0-sun TOMCAT5_SECURITY=no Please make these changes, stop tomcat, start tomcat, and try again :) Dan On 12/07/07, Felix Mayerhofer [EMAIL PROTECTED] wrote: Hi again, sorry for the delay, here's the output: ps wuax | grep tomcat : tomcat5 28146 0.0 0.1 3728 1188 ?S10:15 0:00 su -p -s /bin/sh tomcat5 -c /usr/sbin/rotatelogs /var/lib/tomcat5/logs/catalina_%F.log 86400 tomcat5 28148 0.0 0.1 6228 1536 ?S10:15 0:00 /usr/sbin/rotatelogs /var/lib/tomcat5/logs/catalina_%F.log 86400 tomcat5 28155 0.0 3.8 258584 30092 ?Sl 10:15 0:11 /usr/lib/jvm/java-6-sun-1.6.0.00/bin/java -Djava.awt.headless=true -Xmx128M -Djava.endorsed.dirs=/usr/share/tomcat5/common/endorsed -classpath /usr/lib/jvm/java-6-sun-1.6.0.00/lib/tools.jar:/usr/share/tomcat5/bin/commons-launcher.jar:/usr/share/tomcat5/bin/commons-logging-api.jar:/usr/share/tomcat5/bin/jmx.jar:/usr/lib/jvm/java-6-sun-1.6.0.00/jre//lib/jcert.jar:/usr/lib/jvm/java-6-sun-1.6.0.00/jre//lib/jnet.jar:/usr/lib/jvm/java-6-sun-1.6.0.00/jre//lib/jsse.jar:/usr/share/tomcat5/bin/bootstrap.jar:/usr/share/tomcat5/bin/commons-logging-api.jar -Dcatalina.base=/var/lib/tomcat5 -Dcatalina.home=/usr/share/tomcat5 -Djava.io.tmpdir=/var/lib/tomcat5/temp org.apache.catalina.startup.Bootstrap start root 10791 0.0 0.1 2920 808 pts/3R+ 20:14 0:00 grep tomcat I put the ls commands in the 2 textfiles, cause the output is just too big. Thanks again for the help!!! greetz -Ursprüngliche Nachricht- Von: Dan Scott [mailto:[EMAIL PROTECTED] Gesendet: Mittwoch, 11. Juli 2007 19:24 An: Felix Mayerhofer Cc: dspace-tech@lists.sourceforge.net Betreff: Re: [Dspace-tech] WG: newbie question Hi Felix: I wrote the Ubuntu install instructions and have confirmed with other people that they work, if followed step by step. If tomcat is running under the tomcat5 user, then you must have missed step 8 (the second step 8; I wasn't able to figure out how to get mediawiki to continue numbering, unfortunately). Assuming that making those changes, stopping, and restarting tomcat5 service doesn't resolve the problem, It might help us help you if you could send the results of the following commands $ ps wuax | grep tomcat $ ls -lR /var/lib/tomcat5 $ ls -lR /var/cache/tomcat5 Thanks, Dan On 11/07/07, Felix Mayerhofer [EMAIL PROTECTED] wrote: -Ursprüngliche Nachricht- Von: Felix Mayerhofer Gesendet: Mittwoch, 11. Juli 2007 13:40 An: 'Graham Triggs' Betreff: AW: [Dspace-tech] newbie question Tomcat5 is running at user tomcat5 Still no change, same error, any other ideas? Thanks for the help! -Ursprüngliche Nachricht- Von: Graham Triggs [mailto:[EMAIL PROTECTED] Gesendet: Mittwoch, 11. Juli 2007 12:37 An: Felix Mayerhofer Cc: dspace-tech@lists.sourceforge.net Betreff: Re: [Dspace-tech] newbie question You changed the user that Tomcat is being launched as, right? ;) If so, then the '/var/lib/tomcat5/work' folder will still have the old tomcat5:root permissions. You will need to 'chown -hR' it for Tomcat to be able to compile the JSP files. G On Wed, 2007-07-11 at 10:44 +0200, Felix Mayerhofer wrote: Hi all! I just got through the turorial Installing Dspace on ubuntu 6.06 Everything worked fine so far but when I try to start the web interface I get an internal server error: HTTP Status 500 - __ typeException report message description The server encountered an internal error () that prevented it from fulfilling this request. exception java.io.FileNotFoundException: /var/lib/tomcat5/work/Catalina/localhost/dspace/org/apache/jsp/index_jsp.java (No such file or directory) java.io.FileOutputStream.open(Native Method) java.io.FileOutputStream.init(FileOutputStream.java:179) java.io.FileOutputStream.init(FileOutputStream.java:70) org.apache.jasper.compiler.Compiler.generateJava(Compiler.java:188) org.apache.jasper.compiler.Compiler.compile(Compiler.java:495) org.apache.jasper.compiler.Compiler.compile(Compiler.java:476) org.apache.jasper.compiler.Compiler.compile(Compiler.java:464
Re: [Dspace-tech] WG: newbie question
Hi Felix: I wrote the Ubuntu install instructions and have confirmed with other people that they work, if followed step by step. If tomcat is running under the tomcat5 user, then you must have missed step 8 (the second step 8; I wasn't able to figure out how to get mediawiki to continue numbering, unfortunately). Assuming that making those changes, stopping, and restarting tomcat5 service doesn't resolve the problem, It might help us help you if you could send the results of the following commands $ ps wuax | grep tomcat $ ls -lR /var/lib/tomcat5 $ ls -lR /var/cache/tomcat5 Thanks, Dan On 11/07/07, Felix Mayerhofer [EMAIL PROTECTED] wrote: -Ursprüngliche Nachricht- Von: Felix Mayerhofer Gesendet: Mittwoch, 11. Juli 2007 13:40 An: 'Graham Triggs' Betreff: AW: [Dspace-tech] newbie question Tomcat5 is running at user tomcat5 Still no change, same error, any other ideas? Thanks for the help! -Ursprüngliche Nachricht- Von: Graham Triggs [mailto:[EMAIL PROTECTED] Gesendet: Mittwoch, 11. Juli 2007 12:37 An: Felix Mayerhofer Cc: dspace-tech@lists.sourceforge.net Betreff: Re: [Dspace-tech] newbie question You changed the user that Tomcat is being launched as, right? ;) If so, then the '/var/lib/tomcat5/work' folder will still have the old tomcat5:root permissions. You will need to 'chown -hR' it for Tomcat to be able to compile the JSP files. G On Wed, 2007-07-11 at 10:44 +0200, Felix Mayerhofer wrote: Hi all! I just got through the turorial Installing Dspace on ubuntu 6.06 Everything worked fine so far but when I try to start the web interface I get an internal server error: HTTP Status 500 - __ typeException report message description The server encountered an internal error () that prevented it from fulfilling this request. exception java.io.FileNotFoundException: /var/lib/tomcat5/work/Catalina/localhost/dspace/org/apache/jsp/index_jsp.java (No such file or directory) java.io.FileOutputStream.open(Native Method) java.io.FileOutputStream.init(FileOutputStream.java:179) java.io.FileOutputStream.init(FileOutputStream.java:70) org.apache.jasper.compiler.Compiler.generateJava(Compiler.java:188) org.apache.jasper.compiler.Compiler.compile(Compiler.java:495) org.apache.jasper.compiler.Compiler.compile(Compiler.java:476) org.apache.jasper.compiler.Compiler.compile(Compiler.java:464) org.apache.jasper.JspCompilationContext.compile(JspCompilationContext.java:511) org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:295) org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:292) org.apache.jasper.servlet.JspServlet.service(JspServlet.java:236) javax.servlet.http.HttpServlet.service(HttpServlet.java:802) note The full stack trace of the root cause is available in the Apache Tomcat/5.0 logs. __ Apache Tomcat/5.0 The directory org/apache/jsp/ doesn't exist on my server, what did I do wrong? Greetz from Austria Felix Mayerhofer - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech This e-mail is confidential and should not be used by anyone who is not the original intended recipient. BioMed Central Limited does not accept liability for any statements made which are clearly the sender's own and not expressly made on behalf of BioMed Central Limited. No contracts may be concluded on behalf of BioMed Central Limited by means of e-mail communication. BioMed Central Limited Registered in England and Wales with registered number 3680030 Registered Office Middlesex House, 34-42 Cleveland Street, London W1T 4LB - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Dan Scott Laurentian University - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2