[Dspace-tech] internal system error
Hi all, I have some problems when register a new user. It occurs Internal System Error. I have DSpace 1.5.1 on Windows 2003 server + Tomcat 5.5.17 + JDK 1.5.0_11. Any ideas? Thanks monic mail server log: An internal server error occurred on http://192.168.25.179:8080/jspui: Date: 6/12/09 2:33 PM Session ID: 66BA27F8D140EA4C329B6CA1E6AE554C -- URL Was: http://192.168.25.179:8080/jspui/register -- Method: POST -- Parameters were: -- password: 123456 -- submit: Register -- netid: coba -- email: c...@library.uksw.edu -- step: 1 Exception: java.lang.NullPointerException at java.util.Hashtable.put(Unknown Source) at org.dspace.app.webui.servlet.RegisterServlet.processEnterEmail(RegisterServlet.java:325) at org.dspace.app.webui.servlet.RegisterServlet.doDSPost(RegisterServlet.java:202) at org.dspace.app.webui.servlet.DSpaceServlet.processRequest(DSpaceServlet.java:147) at org.dspace.app.webui.servlet.DSpaceServlet.doPost(DSpaceServlet.java:105) at javax.servlet.http.HttpServlet.service(HttpServlet.java:709) at javax.servlet.http.HttpServlet.service(HttpServlet.java:802) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:252) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:173) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:213) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:178) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:126) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:105) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:107) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:148) at org.apache.coyote.http11.Http11AprProcessor.process(Http11AprProcessor.java:833) at org.apache.coyote.http11.Http11AprProtocol$Http11ConnectionHandler.process(Http11AprProtocol.java:639) at org.apache.tomcat.util.net.AprEndpoint$Worker.run(AprEndpoint.java:1285) at java.lang.Thread.run(Unknown Source) New Email addresses available on Yahoo! Get the Email name you#39;ve always wanted on the new @ymail and @rocketmail. Hurry before someone else does! http://mail.promotions.yahoo.com/newdomains/aa/-- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Statistics in Dspace 1.5.2 xmlui version
Dear group I need some help in calculating the number of articles in each collection and displaying it against the collection name in the home page so that users can easily identify if there is something in that collection or not before even clicking on the collection for example Choose a community to browse its collections. Academic Services http://spiral.imperial.ac.uk/handle/10044/1/489 [2] Administrative Services http://spiral.imperial.ac.uk/handle/10044/1/491 [3] Faculty of Engineering http://spiral.imperial.ac.uk/handle/10044/1/1 [754] Faculty of Medicine http://spiral.imperial.ac.uk/handle/10044/1/9 [75] Faculty of Natural Sciences http://spiral.imperial.ac.uk/handle/10044/1/7 [653] Humanities http://spiral.imperial.ac.uk/handle/10044/1/1197 [17] Imperial College Business School http://spiral.imperial.ac.uk/handle/10044/1/643 [1216] Thank you Kind regards Maitumelo Morupisi University of Botswana -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Statistics in Dspace 1.5.2 xmlui version
In your dspace.cfg, find a line: webui.strengths.show = false Change it into true. The item count will be displayed. Allen. HKU Hub Administrator, http://hub.hku.hk Morupisi, Maitumelo wrote: Dear group I need some help in calculating the number of articles in each collection and displaying it against the collection name in the home page so that users can easily identify if there is something in that collection or not before even clicking on the collection for example Choose a community to browse its collections. Academic Services http://spiral.imperial.ac.uk/handle/10044/1/489 [2] Administrative Services http://spiral.imperial.ac.uk/handle/10044/1/491 [3] Faculty of Engineering http://spiral.imperial.ac.uk/handle/10044/1/1 [754] Faculty of Medicine http://spiral.imperial.ac.uk/handle/10044/1/9 [75] Faculty of Natural Sciences http://spiral.imperial.ac.uk/handle/10044/1/7 [653] Humanities http://spiral.imperial.ac.uk/handle/10044/1/1197 [17] Imperial College Business School http://spiral.imperial.ac.uk/handle/10044/1/643 [1216] Thank you Kind regards Maitumelo Morupisi University of Botswana -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Can we change the layout posistions of dspace in jspui
Hi, Is it possible to change the layout posistions in jspui. I want to have the navigation bar at right side and below it i want side bar. Can it be possible. -- Regards, Yureshwar Ravuri. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Change Default Layout IN JSPUI
Hi Yureshwar It is possible to bring navigation bar to rightside and side bar under it. For that you need to change layout of pages. 1. Modify header-default.jsp , Give comments to code where navigation bar is displayed. 2.set request varibale in header-default.jsp request.setAttribute(navBar,request.getAttribute(dspace.layout.navbar)); 3. Get it on footer-default.jsp 4. Then In footer-default.jsp add the code with if condition , just below %-- Right-hand side bar if appropriate --% Best Regards Hardik Mishra Jr. Software Engineer WebInito Technologies - Ahmedabad - Gujarat (India) Give a man a fish, and you feed him for a day. Teach a man to catch fish and you feed him for a lifetime. From: Yureshwar Dspace dspace.yuresh...@gmail.com To: hardik mishra hardik_h...@yahoo.com Cc: dspace-tech@lists.sourceforge.net Sent: Wednesday, June 17, 2009 11:49:06 AM Subject: Re: Change Default Layout Text IN JSPUI Thanks hardik mishra. It really worked. I have one more question. How to bring navigation bar to rightside and under it i want sidebar. Can we change the posistions? On Wed, Jun 17, 2009 at 10:52 AM, hardik mishra hardik_h...@yahoo.com wrote: Hi Yureshwar Follow this steps : Go to your dspace source directory. C:\dspace-1.5.2-src-release\dspace-api\src\main\resources\Messages.properties Search jsp.layout.header-default.alt , and change whateve text you want. Then recompile code with mvn package, Copy new jar file which is created under C:\dspace-1.5.2-src-release\dspace-api\target\dspace-api-1.5.2 , to target directory and then run with ANT. Hope this will be helpful Best Regards Hardik Mishra Jr. Software Engineer WebInito Technologies - Ahmedabad - Gujarat (India). Give a man a fish, and you feed him for a day. Teach a man to catch fish and you feed him for a lifetime. -- Regards, Yureshwar Ravuri. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Statistics in Dspace 1.5.2 xmlui version
Thank you but there is this comment though I have set it to true and started tomcat but still it does not show. It talked about creating themes for xmlui. whether to display collection and community strengths # (This configuration is not used by XMLUI. To show strengths in the # XMLUI, you just need to create a theme which displays them) webui.strengths.show = true -Original Message- From: dspace-tech-requ...@lists.sourceforge.net [mailto:dspace-tech-requ...@lists.sourceforge.net] Sent: Wednesday, June 17, 2009 11:23 AM To: dspace-tech@lists.sourceforge.net Subject: DSpace-tech Digest, Vol 38, Issue 41 Send DSpace-tech mailing list submissions to dspace-tech@lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/dspace-tech or, via email, send a message with subject or body 'help' to dspace-tech-requ...@lists.sourceforge.net You can reach the person managing the list at dspace-tech-ow...@lists.sourceforge.net When replying, please edit your Subject line so it is more specific than Re: Contents of DSpace-tech digest... Today's Topics: 1. Re: Statistics in Dspace 1.5.2 xmlui version (Allen Lam) 2. Re: DSpace search weirdness (mikan.d.dspace listmail) 3. DSpace Versions Compatability (Dr. Ahmad Al-Zubi) 4. Can we change the layout posistions of dspace in jspui (Yureshwar Dspace) 5. Re: Change Default Layout IN JSPUI (hardik mishra) -- Message: 1 Date: Wed, 17 Jun 2009 15:24:21 +0800 From: Allen Lam allen.dsp...@gmail.com Subject: Re: [Dspace-tech] Statistics in Dspace 1.5.2 xmlui version To: dspace-tech@lists.sourceforge.net Message-ID: 4a389a25.7080...@gmail.com Content-Type: text/plain; charset=ISO-8859-1; format=flowed In your dspace.cfg, find a line: webui.strengths.show = false Change it into true. The item count will be displayed. Allen. HKU Hub Administrator, http://hub.hku.hk Morupisi, Maitumelo wrote: Dear group I need some help in calculating the number of articles in each collection and displaying it against the collection name in the home page so that users can easily identify if there is something in that collection or not before even clicking on the collection for example Choose a community to browse its collections. Academic Services http://spiral.imperial.ac.uk/handle/10044/1/489 [2] Administrative Services http://spiral.imperial.ac.uk/handle/10044/1/491 [3] Faculty of Engineering http://spiral.imperial.ac.uk/handle/10044/1/1 [754] Faculty of Medicine http://spiral.imperial.ac.uk/handle/10044/1/9 [75] Faculty of Natural Sciences http://spiral.imperial.ac.uk/handle/10044/1/7 [653] Humanities http://spiral.imperial.ac.uk/handle/10044/1/1197 [17] Imperial College Business School http://spiral.imperial.ac.uk/handle/10044/1/643 [1216] Thank you Kind regards Maitumelo Morupisi University of Botswana -- Message: 2 Date: Wed, 17 Jun 2009 10:49:22 +0300 From: mikan.d.dspace listmail mikan.dsp...@gmail.com Subject: Re: [Dspace-tech] DSpace search weirdness To: bill.ander...@library.gatech.edu Cc: Dspace Tech dspace-tech@lists.sourceforge.net Message-ID: 417313850906170049i37d166bfoe8aaf5d6f5f5b...@mail.gmail.com Content-Type: text/plain; charset=ISO-8859-1 Hi Bill, You have got it right. I cloned the new DSpace instance roughly by: 1. first doing a fresh install of DSpace source 2. Importing database dump from the other server (taken with pg_dump, I also tried pg_restore btw.) 3. I created assetstore.tar.gz from my old server and copied it to the new server. When I run media-filter or media-filter --force, the extracted text doesnt get the special characters (say ?, ?, ?) right, but has '?' -mark instead of them. On my original server everything works fine. And on my new server, the new submissions work fine after filter-media. I just re-ran filter-media -f and no error messages come up. Maybe I should dig the assetstore to see what the files look like from the command line? How could I find out the assetstore path for a specific item? Thanks, Mika 2009/6/16 bill.ander...@library.gatech.edu: Correct me if I don't have this right: ?you had an existing instance of dspace, where search worked properly. ?You cloned the instance to new server, and after the transfer, media filter wasn't able to extract full text properly from PDFs with special characters in them. ?When you re-submit the PDFs to the new instance, media filter (and thus search) works as it should? It's possible the pdfs were damaged in the transfer. ?How did you transfer them? I assume you're not seeing any errors in the media filter output, right? Cheers, Bill Bill Anderson Software Developer Digital Library Development Georgia Tech Library - mikan.d.dspace listmail mikan.dsp...@gmail.com
Re: [Dspace-tech] adding single item in more than one community
Hi Shali, an item is owned by one collection but it is possible to mirror an item to other collections. This is done with the item mapper. The item thus mapped will be listed within the collection and its ancestors. Hope that helps Claudia Jürgen shali Kallumpuram schrieb: Dear sir, i am a dspace user, i need to upload one item in two communities is there any methods ???/ Thanking you, shali -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] worrying about usage event reporting when the number of articles is very large
I am working on a proprietary digital library whilst at the same time considering how dspace might have been used to solve the same problems (it won't be, but that's another story). When I consider usage event reporting there are some concerns that arise when the number of articles and visitors is very large. The current site has around 6 million articles and roughly 20 million hits per day. With these sorts of volumes, weblogs are IMHO not the way to go. Also there are problems of scale when using event logs. Basically a file-based approach is only suitable for small volumes of data. I considered using a RDBMS and this goes get you further but unfortunately, not far enough. A RDBMS can cope with millions of rows but starts to struggle when you reach tens of millions or hundreds of millions. Let's do some maths. In these calculations, there is a requirement to produce year to date (YTD) figures (this is a requirement of COUNTER). I will assume that a RDBMS system will calculate the YTD, rather than store a running total when the current month is processed. This means that figures for 12 months needs to be retained. Of those 20 million hits, some will be for the same article(s). So after article level aggregation has been performed there will be a maximum of 6 million rows for one day. This is 180 million for one month, 2,160 million for 12 months. Now 2 billion rows seems a bit on the large side to me :-) One way around this would be to have a table for each month. Thus a table might have to cope with 180 million rows, which is managable, even though it is large. But in calculating the YTD figures one would need to do a 12 table join. That's a bit unwieldy. I am beginning to wonder if a different technology would provide better scalability. I have in mind the open source column store, LucidDB. What do people think? -- Regards, Andrew M. http://www.andrewpetermarlow.co.uk -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] worrying about usage event reporting when the number of articles is very large
On Wed, Jun 17, 2009 at 03:35:14PM +0100, Andrew Marlow wrote: I am working on a proprietary digital library whilst at the same time considering how dspace might have been used to solve the same problems (it won't be, but that's another story). When I consider usage event reporting there are some concerns that arise when the number of articles and visitors is very large. The current site has around 6 million articles and roughly 20 million hits per day. With these sorts of volumes, weblogs are IMHO not the way to go. Also there are problems of scale when using event logs. Basically a file-based approach is only suitable for small volumes of data. Absolutely. The event sink classes that come with DSpace are samples, not meant for serious production use. The class we use here is built around an RDBMS. (But we haven't even dreamed of 20 million hits/day.) I considered using a RDBMS and this goes get you further but unfortunately, not far enough. A RDBMS can cope with millions of rows but starts to struggle when you reach tens of millions or hundreds of millions. Let's do some maths. In these calculations, there is a requirement to produce year to date (YTD) figures (this is a requirement of COUNTER). I will assume that a RDBMS system will calculate the YTD, rather than store a running total when the current month is processed. This means that figures for 12 months needs to be retained. Of those 20 million hits, some will be for the same article(s). So after article level aggregation has been performed there will be a maximum of 6 million rows for one day. This is 180 million for one month, 2,160 million for 12 months. Now 2 billion rows seems a bit on the large side to me :-) OLTP systems cope with billions of rows/day giving reasonable performance, but your requirements may be quite different. They probably journal that stuff immediately and post it into tables later, like the guy with the green eyeshade used to do on paper. One way around this would be to have a table for each month. Thus a table might have to cope with 180 million rows, which is managable, even though it is large. But in calculating the YTD figures one would need to do a 12 table join. That's a bit unwieldy. How quickly do you have to do those calculations? I've been given to understand that another requirement of COUNTER is auditing, so we are not talking subsecond response times here. :-) Once a body of data (say, last month, or even yesterday) can be considered static, it can be rolled up and the sums dropped into much smaller summary tables for quick grand-totalling. The rollup jobs can run at low priority whenever it's convenient. Six million rows plus up to eleven plus up to thirty doesn't sound quite so daunting. In capturing high-volume data you need to winkle out every opportunity to aggregate. Have you benchmarked totalling a 2-giga-row column? How long did it take to sum it? How long did it take to add the final row? What did your system load monitoring tools show? If you include a column of texts randomly selected from a fixed set, how well does it perform on subsets of realistic size? Are you better off indexing this title column (and paying the price for index maintenance with each INSERT) or not (and doing sequential scans instead of index scans)? Does it help to place the titles in a moderate-sized lookaside table, look them up, and use the corresponding serial number in detail rows? Can you improve performance with preallocated storage? Can you win with splitting tables up by columns (not forgetting their indices) and put them in multiple tablespaces on multiple drives? Do you need to move indices to a SSD? (PostgreSQL can't split single tables across tablespaces, but can place indices and tables individually.) Do you really need to go this far? In high-performance systems you really have to try different ideas and compare their real-world performance -- theory gets you only so far. -- Mark H. Wood, Lead System Programmer mw...@iupui.edu Friends don't let friends publish revisable-form documents. pgpn0Vol3ppwb.pgp Description: PGP signature -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Export/import of noncontent objects (Epersons, etc.)?
We're working with a partner who want to keep a separate test instance with content tracking the (sizable) live repository fairly closely. The requirement that I've been given is to entirely replace the content from live every week or two. Deleting 17,000 items (and over 20,000 bitstreams) is an all-day operation, and then comes the loading phase. It would save a lot of time if I could export the Community/Collection structure, EPerson and Group objects, registries, and anything else that's not an Item, Bundle, or Bitstream; drop and recreate the database; empty the assetstore and history; reload the noncontent tables; and then begin loading. So I'm looking at adding export/import for all of those objects, probably to XML. In the case of Community and Collection I guess the best thing would be to just do a single exporter producing the same XML dialect consumed by the existing Community and Collection Structure Importer. Likewise for the registries, it seems. The other classes would need importers built as well as exporters. Comments? Or is there a smarter way to make a consistent clone of a DSpace instance, with its own Handles, that is writable but doesn't affect the original? (The Handle business, plus the need to quiesce the production site to ensure consistency across database and assetstore, is why I don't just use tar and pg_dump.) -- Mark H. Wood, Lead System Programmer mw...@iupui.edu Friends don't let friends publish revisable-form documents. pgpHSF8yoCj3x.pgp Description: PGP signature -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] internal system error
Hi Monic, Have you got LDAP enabled in dspace.cfg (ldap.enable = true)? If so, what is your setting for 'ldap_provider_url' in dspace.cfg? The error message suggests that LDAP is enabled, but you have not set your LDAP provider URL. Thanks, Stuart Lewis Digital Services Programmer Te Tumu Herenga The University of Auckland Library Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand Ph: 64 9 373-7599 x81928 http://www.library.auckland.ac.nz http://www.library.auckland.ac.nz/ / From: monic nia [mailto:cadriwinia_b...@yahoo.com] Sent: Wednesday, 17 June 2009 6:45 p.m. To: dspace-tech@lists.sourceforge.net Subject: [Dspace-tech] internal system error Hi all, I have some problems when register a new user. It occurs Internal System Error. I have DSpace 1.5.1 on Windows 2003 server + Tomcat 5.5.17 + JDK 1.5.0_11. Any ideas? Thanks monic mail server log: An internal server error occurred on http://192.168.25.179:8080/jspui: Date: 6/12/09 2:33 PM Session ID: 66BA27F8D140EA4C329B6CA1E6AE554C -- URL Was: http://192.168.25.179:8080/jspui/register -- Method: POST -- Parameters were: -- password: 123456 -- submit: Register -- netid: coba -- email: c...@library.uksw.edu -- step: 1 Exception: java.lang.NullPointerException at java.util.Hashtable.put(Unknown Source) at org.dspace.app.webui.servlet.RegisterServlet.processEnterEmail(RegisterS ervlet.java:325) at org.dspace.app.webui.servlet.RegisterServlet.doDSPost(RegisterServlet.ja va:202) at org.dspace.app.webui.servlet.DSpaceServlet.processRequest(DSpaceServlet. java:147) at org.dspace.app.webui.servlet.DSpaceServlet.doPost(DSpaceServlet.java:105 ) at javax.servlet.http.HttpServlet.service(HttpServlet.java:709) at javax.servlet.http.HttpServlet.service(HttpServlet.java:802) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(Applica tionFilterChain.java:252) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilt erChain.java:173) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValv e.java:213) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValv e.java:178) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java :126) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java :105) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve. java:107) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:1 48) at org.apache.coyote.http11.Http11AprProcessor.process(Http11AprProcessor.j ava:833) at org.apache.coyote.http11.Http11AprProtocol$Http11ConnectionHandler.proce ss(Http11AprProtocol.java:639) at org.apache.tomcat.util.net.AprEndpoint$Worker.run(AprEndpoint.java:1285) at java.lang.Thread.run(Unknown Source) New Email names for you! http://sg.rd.yahoo.com/aa/mail/domainchoice/mail/signature/*http:/mail. promotions.yahoo.com/newdomains/aa/ Get the Email name you've always wanted on the new @ymail and @rocketmail. Hurry before someone else does! -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] DSpace search weirdness
Hi Mika, To find the path for a file in the assetstore, find your item in the item table, then follow item2bundle, and then bunle2bitstream. Once you have identified the bitstream, look for the 'internal_id'. It will be something like: 98734354183028877080733439721306881639 A file by that name will exist in [dspace]/assetstore/98/73/43/ (look for the first 3 sets of two digits for the directories). Just out of interest - have you run the checksum checker to ensure the files copied perfectly? And have you tired setting your jvm file encoding to utf-8? Thanks, Stuart Lewis Digital Services Programmer Te Tumu Herenga The University of Auckland Library Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand Ph: 64 9 373-7599 x81928 http://www.library.auckland.ac.nz/ -Original Message- From: mikan.d.dspace listmail [mailto:mikan.dsp...@gmail.com] Sent: Wednesday, 17 June 2009 7:49 p.m. To: bill.ander...@library.gatech.edu Cc: Dspace Tech; Stuart Lewis Subject: Re: [Dspace-tech] DSpace search weirdness Hi Bill, You have got it right. I cloned the new DSpace instance roughly by: 1. first doing a fresh install of DSpace source 2. Importing database dump from the other server (taken with pg_dump, I also tried pg_restore btw.) 3. I created assetstore.tar.gz from my old server and copied it to the new server. When I run media-filter or media-filter --force, the extracted text doesnt get the special characters (say ä, ö, å) right, but has '?' -mark instead of them. On my original server everything works fine. And on my new server, the new submissions work fine after filter-media. I just re-ran filter-media -f and no error messages come up. Maybe I should dig the assetstore to see what the files look like from the command line? How could I find out the assetstore path for a specific item? Thanks, Mika 2009/6/16 bill.ander...@library.gatech.edu: Correct me if I don't have this right: you had an existing instance of dspace, where search worked properly. You cloned the instance to new server, and after the transfer, media filter wasn't able to extract full text properly from PDFs with special characters in them. When you re-submit the PDFs to the new instance, media filter (and thus search) works as it should? It's possible the pdfs were damaged in the transfer. How did you transfer them? I assume you're not seeing any errors in the media filter output, right? Cheers, Bill Bill Anderson Software Developer Digital Library Development Georgia Tech Library - mikan.d.dspace listmail mikan.dsp...@gmail.com wrote: | Hi Stuart, | As I mentioned in my earlier post, runnin filter-media with --force | (-f) switch didnt fix the problem. | | -Mika | | 2009/6/16 Stuart Lewis s.le...@auckland.ac.nz: | Hi Mika, | | Since running filter-media on new items seems OK, have you tried | running: | | [dspace]/bin/filter-media -f | | -f forces all the bitstreams to be re-filtered. | | Thanks, | | | Stuart Lewis | Digital Services Programmer | Te Tumu Herenga The University of Auckland Library | Auckland Mail Centre, Private Bag 92019, Auckland 1142, New Zealand | Ph: 64 9 373-7599 x81928 | http://www.library.auckland.ac.nz/ | | | | -Original Message- | From: mikan.d.dspace listmail [mailto:mikan.dsp...@gmail.com] | Sent: Tuesday, 16 June 2009 1:05 a.m. | To: Terrance Davis | Cc: Dspace Tech | Subject: Re: [Dspace-tech] DSpace search weirdness | | Nope. | The server 1 has Debian 5 with Java version 1.6.0_12. and server | 2 | has RHEL and Java version 1.5.0_18. Could this cause the | problem? | | Another strange thing I noticed, is that if I re-submit the entire | item file and then run filter-media, the text is extracted | correctly?? So, to me it seems that the old data in the | transferred | assetstore is handled incorrectly. Strange, eh? | | -Mika | | | | | 2009/6/15 Terrance Davis terrance.da...@utah.edu: | Hi Mika, | | Are both systems using the same OS version and the same version of | Java? | | Best regards, | | Terrance | | -- | Web Applications Programmer | Institute for Clean and Secure Energy | University of Utah | http://www.ices.utah.edu | | | On Jun 15, 2009, at 2:01 AM, mikan.d.dspace listmail wrote: | | Hi Terrance, | | I double-checked the indexes in configuration and they do match. | What | I noticed though, is that the text extracted from pdf files | differ, | which might be the cause of this problem. It seems that when | filter-media extracts the text on the other server, it messes up | some | special characters, thus making them unsearchable. What might be | causing this? Both databases are set to UNICODE when created. Is | there some other system setting that might be causing this? | | Example of extracted text is below: | | Server 1: (correct encoding) | 3. PUNAISEN
[Dspace-tech] XMLUI controlled vocabulary
Another request I have is to use Controlled Vocabulary and I found some good instructions for the JSPUI but we are using XMLUI. Am I out of luck here? Does the XMLUI use programs from the JSPUI as well. Please consider the environment before printing this e-mail. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Export/import of noncontent objects (Epersons, etc.)?
I recommend looking at the old AIP prototype (see http://wiki.dspace.org/index.php/AipPrototype for doc and fossilized code) for a start on this. It extended the METS implementation to cover Communities, Collections, and every aspect of Items, including most administrative metadata. Although it stopped short of fully representing the EPerson, Group and Policy objects those would be straightforward to add. It does show some of the issues involved in building a copy of an archive from scratch. Migrating and mirroring content between repositories was one of the use cases for AIPs. *Every* existing interchange mechanism (batch import, packager, etc) loses *some* details of the Item and its child objects. Only the AIP was complete, and iirc it also had a little bug or two (e.g. bitstream sequence IDs weren't always restored perfectly). good luck! -- Larry On Jun 17, 2009, at 4:56 PM, Mark H. Wood wrote: We're working with a partner who want to keep a separate test instance with content tracking the (sizable) live repository fairly closely. The requirement that I've been given is to entirely replace the content from live every week or two. Deleting 17,000 items (and over 20,000 bitstreams) is an all-day operation, and then comes the loading phase. It would save a lot of time if I could export the Community/Collection structure, EPerson and Group objects, registries, and anything else that's not an Item, Bundle, or Bitstream; drop and recreate the database; empty the assetstore and history; reload the noncontent tables; and then begin loading. So I'm looking at adding export/import for all of those objects, probably to XML. In the case of Community and Collection I guess the best thing would be to just do a single exporter producing the same XML dialect consumed by the existing Community and Collection Structure Importer. Likewise for the registries, it seems. The other classes would need importers built as well as exporters. Comments? Or is there a smarter way to make a consistent clone of a DSpace instance, with its own Handles, that is writable but doesn't affect the original? (The Handle business, plus the need to quiesce the production site to ensure consistency across database and assetstore, is why I don't just use tar and pg_dump.) -- Mark H. Wood, Lead System Programmer mw...@iupui.edu Friends don't let friends publish revisable-form documents. -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech