[Dspace-tech] How does wildcard behave in Discovery search filters?
Hi, I was looking at our Discovery sidebar facet configuration and I noticed we have items using both dc.type as well as dc.type.output fields. Now I'm wondering how the following search filter configuration would behave: valuedc.type.*/value Would that capture both dc.type as well as dc.type.output, or only metadata values under the dc.type.* namespace? I'm from a computer science background rather than library science, so the answer is not obvious to me in this context! In any case, it is my long-term plan to move away from dc.type.output, as it seems we added it in error, and dc.type is standard Dublin Core so that should have preference. Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How does wildcard behave in Discovery search filters?
Ah, indeed! I saw that dc.subject.* in the default discovery.xml but didn't think to test it. Thanks, Bram. :) Alan On Thu, Aug 20, 2015 at 3:01 PM Bram Luyten b...@atmire.com wrote: Hi, the wild card filter is already present in the default configuration for dc.subject.*: https://github.com/DSpace/DSpace/blob/master/dspace/config/spring/api/discovery.xml#L389 Just did a quick test on demo.dspace.org to see how it behaves: http://demo.dspace.org/xmlui/handle/10673/279 most terms are in dc.subject (without qualifier) and I added MyWeirdSubject in dc.subject.other They all show up in the sidebar facet. rgds Bram -- [image: logo] *Bram Luyten* *250 Lucius Gordon Drive, Suite B-3A, West Henrietta, NY 14586* *Esperantolaan 4, Heverlee 3001, Belgium* www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=braml On 20 August 2015 at 08:44, Alan Orth alan.o...@gmail.com wrote: Hi, I was looking at our Discovery sidebar facet configuration and I noticed we have items using both dc.type as well as dc.type.output fields. Now I'm wondering how the following search filter configuration would behave: valuedc.type.*/value Would that capture both dc.type as well as dc.type.output, or only metadata values under the dc.type.* namespace? I'm from a computer science background rather than library science, so the answer is not obvious to me in this context! In any case, it is my long-term plan to move away from dc.type.output, as it seems we added it in error, and dc.type is standard Dublin Core so that should have preference. Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to update Discovery index for a specific handle?
Thanks for the responses, both. Actually I think I've just stumbled on a good strategy after considering your approaches and reading the linked discussions. For reference, in my case I was deleting a metadatavalue record with an invalid dc.date.issued value (here s.f. is Spanish for sin fecha / no date, which of course isn't ISO 8601!). In PostgreSQL: # select * from metadatavalue where text_value='s.f.'; # delete from metadatavalue where text_value='s.f.' and resource_id='57762'; # select handle from item, handle where handle.resource_id = item.item_id AND item.item_id = '57762'; Then removing the item from the index (using the handle from above) and starting regular index update: $ [dspace]/bin/dspace index-discovery -r 10568/56338 $ [dspace]/bin/dspace index-discovery The DSpace logs confirm that the item was removed and then added back to the index, and that could work for a collection or community if I needed it. This is a much more acceptable strategy than waiting a few hours for a full re-index, and now I understand the SQL / Solr relationship better. :) Thanks for the discussion! Alan On Mon, Aug 17, 2015 at 10:27 PM Terry Brady terry.br...@georgetown.edu wrote: Alan, If your changes can be scoped to a specific community or collection, we have a process that will re-index a collection/community. We have present the user with a form that allows them to select either a community or a collection. We end up with 2 parameters - comm or coll - database id for the community or collection The following script deletes the discovery entries for that community or collection. It then runs index-discovery which rebuilds the missing entries. - https://github.com/Georgetown-University-Libraries/batch-tools/blob/master/bin-src/dspaceBatch.sh#L282-L291 Background thread: http://sourceforge.net/p/dspace/mailman/message/31838611/ Terry On Mon, Aug 17, 2015 at 6:57 AM, Alan Orth alan.o...@gmail.com wrote: Hi, When I edit items in the web user interface, either by batch CSV upload or one by one in Edit this Item, DSpace updates the Discovery indexes for the item(s) in question. Sometimes I have to make changes in SQL directly, like to delete empty metadatavalues, and as far as I know the only way to capture those changes in the Solr indexes is to perform an `index-discovery -b`, but that takes *hours* for our repository of 50,000 items (despite having an SSD and plenty of RAM on our Linode-powered VPS!). Is there a way to force the updating of a specific handle's Solr indexes? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Terry Brady Applications Programmer Analyst Georgetown University Library Information Technology https://www.library.georgetown.edu/lit/code 425-298-5498 (Seattle, WA) -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] How to update Discovery index for a specific handle?
Hi, When I edit items in the web user interface, either by batch CSV upload or one by one in Edit this Item, DSpace updates the Discovery indexes for the item(s) in question. Sometimes I have to make changes in SQL directly, like to delete empty metadatavalues, and as far as I know the only way to capture those changes in the Solr indexes is to perform an `index-discovery -b`, but that takes *hours* for our repository of 50,000 items (despite having an SSD and plenty of RAM on our Linode-powered VPS!). Is there a way to force the updating of a specific handle's Solr indexes? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Comparison of DSpace 4 and DSpace 5 PDF thumbnails
Hey, all. I was just doing some testing of DSpace 5's improved, ImageMagick-powered thumbnailing functionality. Thumbnails are often used in tweets, blog posts, services integrated with our repository, etc, so I wanted to see how they looked compared to DSpace 4's Xpdf-based thumbnails. I'm sharing my results, just in case people are curious: https://github.com/ilri/DSpace/issues/128 I tried to pick PDFs heavy in images, text, and others that create interesting issues in thumbnails. Does anyone else have any interesting results to share? We're currently using DSpace 5.1, but it doesn't look like there were any thumbnail-related issues in 5.2 or 5.3. Regards, -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] ant update while Tomcat running
For the record, I'm a bit superstitious about this. I'm using webapp folders as opposed to WARs and I always stop Tomcat during `ant update`. :) Alan On Wed, Jul 29, 2015 at 11:17 PM, Brown, Jacob j.h.br...@tcu.edu wrote: In what situations, if ever, is it safe to run `ant update` while Tomcat is running? I’ve mapped my webapps for Tomcat using “Technique A” (defining a context that points to [dspace]/webapps/[app]). In a development environment, I’ve played around with running `ant update` while Tomcat is running. Most of the time, Tomcat seems to reload the web application appropriately and continues to serve requests. Occasionally I will get errors or a blank page after doing so, which continue until I restart Tomcat (which resolves the issue). Would using WARs make this safe? That is, if I added the “-Dwars=true” option to `ant update` and updated my Tomcat contexts to point to [dspace]/webapps/[app].war instead, would it be okay to update my webapps while Tomcat is running? Or is there other stuff going on in `ant update` that is potentially going to break Tomcat? Thanks, Jacob -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to find item with strange metadata value of |||
Bingo, I found the item, and then got its handle using the item_id: dspace=# SELECT * FROM item, handle WHERE handle.resource_id = item.item_id AND item.item_id = '66841'; The item did indeed have empty contents for that field. I deleted it from the Edit this item function in the web interface and now my facets are clean again. ;) Thanks a bunch, Alan On Sun, Aug 2, 2015 at 5:24 AM Kim Shepherd kim.sheph...@gmail.com wrote: Hi Alan, that's close, but not quite what I was suggesting: the ||| will not be stored in the database, that's added in when indexing the values to Solr, so I think a SQL query like: select item_id from metadatavalue where metadata_field_id=86 and (text_value='' OR text_value IS NULL); Will be closer to what you're after. If you want to try and find items that might match in solr, you could try something like curl http://localhost:8080/solr/search/select?query=*:*fq=FIELDNAME_filter :||| Where FIELDNAME is the name of the field as configured in the discovery spring configuration (eg. 'subject', 'type', 'author', and so on) Cheers! Kim M: k...@shepherd.nz T: @kimshepherd P: +6421883635 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC https://keybase.io/kshepherd On 1 August 2015 at 01:28, Alan Orth alan.o...@gmail.com wrote: So I found the metadata_field_id for this metadata item using a known valid value: dspace=# select * from metadatavalue where text_value='DRYLAND CEREALS'; ... and then I tried to find text values containing several variations of |||, like '\|\|\|', '%|%', '%\|%' etc... dspace=# select item_id from metadatavalue where metadata_field_id=86 and text_value='|||'; But I am not getting any hits. I also figured that, since we figured CSV export by nature wouldn't expose this anomaly, that I could simple export and re-import, but DSpace said no changes were detected upon re-import. Hmmm. Anyone know how I could query Solr directly for this information? Alan On Fri, Jul 31, 2015 at 9:35 AM Alan Orth alan.o...@gmail.com wrote: Thanks for the interesting suggesting, Kim! This is happening in DSpace 4.3. I will try to poke around in the database, as you're probably right that this anomaly probably wouldn't make it through a CSV export... Alan On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd kim.sheph...@gmail.com wrote: Hi Alan, Discovery filters use ||| as a separator between lowercase value and a display/stored value, eg. something|||SomeThing I'm sort of surprised it displays in the sidebar, because acting as the separator, it should be pulled out, but I think the first thing I'd look for is empty or null metadata values in your database (probably easier than CSV because of how metadata export works), for the field in question. What version is this happening in? M: k...@shepherd.nz T: @kimshepherd P: +6421883635 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC https://keybase.io/kshepherd On 30 July 2015 at 20:45, Alan Orth alan.o...@gmail.com wrote: Hi, I've got an strange metadata value shown in one of our XMLUI Discovery sidebar facets, and I'm not sure how to find the offending item to fix it. The value is displaying as |||, and I've narrowed down the collection the item belongs to by navigating through my communities and looking at the facet in question. I've exported the metadata as CSV and attempted to find the item using LibreOffice and OpenRefine but nothing strange sticks out... Does anyone have any suggestions? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to find item with strange metadata value of |||
So I found the metadata_field_id for this metadata item using a known valid value: dspace=# select * from metadatavalue where text_value='DRYLAND CEREALS'; ... and then I tried to find text values containing several variations of |||, like '\|\|\|', '%|%', '%\|%' etc... dspace=# select item_id from metadatavalue where metadata_field_id=86 and text_value='|||'; But I am not getting any hits. I also figured that, since we figured CSV export by nature wouldn't expose this anomaly, that I could simple export and re-import, but DSpace said no changes were detected upon re-import. Hmmm. Anyone know how I could query Solr directly for this information? Alan On Fri, Jul 31, 2015 at 9:35 AM Alan Orth alan.o...@gmail.com wrote: Thanks for the interesting suggesting, Kim! This is happening in DSpace 4.3. I will try to poke around in the database, as you're probably right that this anomaly probably wouldn't make it through a CSV export... Alan On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd kim.sheph...@gmail.com wrote: Hi Alan, Discovery filters use ||| as a separator between lowercase value and a display/stored value, eg. something|||SomeThing I'm sort of surprised it displays in the sidebar, because acting as the separator, it should be pulled out, but I think the first thing I'd look for is empty or null metadata values in your database (probably easier than CSV because of how metadata export works), for the field in question. What version is this happening in? M: k...@shepherd.nz T: @kimshepherd P: +6421883635 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC https://keybase.io/kshepherd On 30 July 2015 at 20:45, Alan Orth alan.o...@gmail.com wrote: Hi, I've got an strange metadata value shown in one of our XMLUI Discovery sidebar facets, and I'm not sure how to find the offending item to fix it. The value is displaying as |||, and I've narrowed down the collection the item belongs to by navigating through my communities and looking at the facet in question. I've exported the metadata as CSV and attempted to find the item using LibreOffice and OpenRefine but nothing strange sticks out... Does anyone have any suggestions? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to find item with strange metadata value of |||
Thanks for the interesting suggesting, Kim! This is happening in DSpace 4.3. I will try to poke around in the database, as you're probably right that this anomaly probably wouldn't make it through a CSV export... Alan On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd kim.sheph...@gmail.com wrote: Hi Alan, Discovery filters use ||| as a separator between lowercase value and a display/stored value, eg. something|||SomeThing I'm sort of surprised it displays in the sidebar, because acting as the separator, it should be pulled out, but I think the first thing I'd look for is empty or null metadata values in your database (probably easier than CSV because of how metadata export works), for the field in question. What version is this happening in? M: k...@shepherd.nz T: @kimshepherd P: +6421883635 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC https://keybase.io/kshepherd On 30 July 2015 at 20:45, Alan Orth alan.o...@gmail.com wrote: Hi, I've got an strange metadata value shown in one of our XMLUI Discovery sidebar facets, and I'm not sure how to find the offending item to fix it. The value is displaying as |||, and I've narrowed down the collection the item belongs to by navigating through my communities and looking at the facet in question. I've exported the metadata as CSV and attempted to find the item using LibreOffice and OpenRefine but nothing strange sticks out... Does anyone have any suggestions? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] How to find item with strange metadata value of |||
Hi, I've got an strange metadata value shown in one of our XMLUI Discovery sidebar facets, and I'm not sure how to find the offending item to fix it. The value is displaying as |||, and I've narrowed down the collection the item belongs to by navigating through my communities and looking at the facet in question. I've exported the metadata as CSV and attempted to find the item using LibreOffice and OpenRefine but nothing strange sticks out... Does anyone have any suggestions? Thanks, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Ciphers in server.xml
Hey, David. First, don't worry: TLS cipher suites aren't sensitve, they are anyways enumerable by any client using a web browser. Here are the cipher suites we use on the web server that does TLS termination in front of our DSpace server: https://github.com/ilri/rmg-ansible-public/blob/master/group_vars/dspace#L22 Furthermore, those ciphers come straight from Mozilla's excellent TLS Server Security wiki: https://wiki.mozilla.org/Security/Server_Side_TLS I'd recommend using the moderate TLS cipher suite from Mozilla's guide. This should appease your security people. Alan On Wed, Jul 15, 2015 at 9:59 PM, Hilton Gibson hilton.gib...@gmail.com wrote: Hi David, Perhaps this will help: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Secure_Internet_Connections/S04#SSL_Ciphers_Notes : Cheers hg *Hilton Gibson* Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html On 15 July 2015 at 20:54, Schuster, David dschus...@mail.twu.edu wrote: At the request of our University security team we added several new ciphers to our Dspace Tomcat7 setup yesterday and now today I received a notification that Google was not able to crawl our site. I don’t understand this part of security so are there any suggestions or comments about this? Should I remove them? I hesitate to email the list of ciphers we implemented as I don’t understand that part of networking. David Schuster Texas Woman's University Director of Library Information Technology Technical Support Phone: 940-898-3909 dschus...@twu.edu [image: emailsignature][image: Alma_Certified_Admin_Logo] -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Don't Limit Your Business. Reach for the Cloud. GigeNET's Cloud Solutions provide you with the tools and support that you need to offload your IT needs and focus on growing your business. Configured For All Businesses. Start Your Cloud Today. https://www.gigenetcloud.com/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Vagrant or Docker
My opinion: deploy on a dedicated machine environment (hardware or software). Containers are cool but still, I haven't figured out how they fit into my workflow yet. Even for simple testing, a VM is an easier concept to grasp, rather than figuring out all the hipster devops Docker jargon, and THEN struggling with the DSpace learning curve. I guess that means I'd recommend Vagrant, because it's more likely to reflect what you'll be running DSpace on in production. On a related note, I've deployed enough DSpace instances that I automated most of the process using a 'dspace' role in the Ansible playbooks for our infrastructure (see: roles - dspace - tasks - main.yml): https://github.com/ilri/rmg-ansible-public It's not turnkey, but gets most of the monotonous setup done (package dependencies, Tomcat templates, git clone of DSpace repo, web server setup, TLS cipher suite configuration, etc). Alan On Wed, Jun 24, 2015 at 11:13 AM Hilton Gibson hilton.gib...@gmail.com wrote: Hi All, All this talk of automated service provisioning, reminded me to ask the list if anyone has considered a juju charm? See: https://jujucharms.com It would be great if you could: # juju dspace And then all the linux stuff is done automatically, all that would remain is customisation via the new planned admin gui. Cheers hg *Hilton Gibson* Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html On 24 June 2015 at 09:43, helix84 heli...@centrum.sk wrote: Hi Monika, this will eventually boil down to a generic discussion of VMs vs. containers which you can look up anywhere. Let me just point out one important consideration - security. With Vagrant, you get an official way of creating a VM using scripts made by DSpace commiters, whom you already trust if you use DSpace. With Docker, one of the advantages is being able to use the Docker Hub Registry of images, which is just code from random people on the internet. The question is do you want to use these, even in a development environment? Of course, you have the option of creating the whole container from scratch including the OS - you'll lose some of the benefits of Docker this way, but you'll have more certainty that you're not running any malicious code. Second security consideration is that of container isolation in Linux, which is not particularly strong, but this matters only if you run multiple containers per machine. I also talked about this briefly with Hardy and he pointed out one benefit of Vagrant - vagrant-share. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Monitor 25 network devices or servers for free with OpManager! OpManager is web-based network management software that monitors network devices and physical virtual servers, alerts via email sms for fault. Monitor 25 devices for free with no restriction. Download now http://ad.doubleclick.net/ddm/clk/292181274;119417398;o___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to search items by metadata using REST API?
That's interesting, Terry. I think I'll just wait until DSpace 5.0 to get the search-by-metadata feature, as there 5.0 is already a high priority for us for a number of other reasons as well! Alan On Tue, Jun 16, 2015 at 2:52 AM Terry Brady terry.br...@georgetown.edu wrote: Alan, The following PR has some code that will search metadata via the API: https://github.com/DSpace/DSpace/pull/954 I am also on 4.3, so this code is compatible with DSpace4.3. Terry On Thu, Jun 11, 2015 at 5:58 AM, Alan Orth alan.o...@gmail.com wrote: Ah, it seems the `/rest/items/find-by-metadata-field` endpoint doesn't exist in DSpace 4.x. Neither does the `/rest/login` endpoint. The docs need to make more of a point to say which versions these are supposed to work in... ;) Alan On Thu, Jun 11, 2015 at 12:24 PM Alan Orth alan.o...@gmail.com wrote: Thanks. It seems my query is well formed but it returns an HTTP 405 Method Not Allowed response. Perhaps DSpace 4.x's REST API isn't sufficient for the find-by-metadata-field query. Or perhaps I need to log in as you illustrated here: https://github.com/BrunoNZ/dspace-rest-requests I guess I'll fork that repo and give this a more structured test. I already found some syntax and usability bugs there (mixing sh and bash, not reading the sourced variables, etc). Alan On Wed, Jun 10, 2015 at 8:30 PM, Bruno Zanette brunonzane...@gmail.com wrote: Hey Alan, Take a look at the commentaries on the REST-API's documentation's page. I've answered a similar question there. https://wiki.duraspace.org/display/DSDOC5x/REST+API?focusedCommentId=68068154#comment-68068154 But basically, the request that you have to send is something like this: curl -k -4 -H accept: application/json -H Content-Type: application/json -X POST https://demo.dspace.org/rest/items/find-by-metadata-field; -d '{key: dc.title,value: Test Webpage,language: en_US}' Em qua, 10 de jun de 2015 às 14:08, Alan Orth alan.o...@gmail.com escreveu: Hey, I'm using DSpace 4.3 and trying to search for items matching certain metadata. The REST API docs on GitHub[0] mention that you can POST to `rest/items/find-by-metadata-field`, but it doesn't give any more information than that (like how to specify query parameters). I've tried several variations of the following: $ http --print Hhb POST ' http://localhost:8080/rest/items/find-by-metadata-field' dc.subject=LACTATION It always returns with an HTTP 405. What is the proper way to use this? Does the POST require I login first or something? Thanks, [0] https://github.com/DSpace/DSpace/tree/master/dspace-rest -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Terry Brady Applications Programmer Analyst Georgetown University Library Information Technology https://www.library.georgetown.edu/lit/code 425-298-5498 (Seattle, WA) -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] OAI behind a proxy
Hi, Domingo. It looks like your JSPUI is running on /dspace, so you just need to make sure you proxy to the OAI application as /oai. The OAI app will use the same dspace.hostname parameter from dspace.cfg, just like JSPUI and XMLUI, etc. Alan On Tue, Jun 16, 2015 at 9:33 AM Domingo Iglesias digles...@ub.edu wrote: Hi, Our dspace box is behind a proxy, so the public name of the repository ( diposit.ub.edu) differs from the name of the box dspace is running, that is not accessible from internet. What is the best way to customize the oai module in order to build the response pages with the public repository url instead of the local name? Thanks in advance! --- Domingo Iglesias Universitat de Barcelona Area de Tecnologies -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] What's the plan for Browse by functionality?
That's good to hear, Tim. I was mistaken about Browse by still using Lucene. Recently I found some metadata out of date on our DSpace instance, and it turns out I had to re-activate the Lucene indexing cron jobs which I had disabled after upgrading to DSpace 4.x last year. But actually this was for one of our third-party modules, not Browse by. Now to pressure the developer of the third-party module to update to Discovery. ;) Regards, Alan On Fri, Jun 12, 2015 at 5:29 PM Tim Donohue tdono...@duraspace.org wrote: Hi Alan, In recent versions of DSpace (4.x and above), Discovery does *both* Browse and Search. So, those Browse By options are now handled by Discovery. These days, if you install DSpace will default options, Lucene is not used at all, and everything runs through Discovery (with Apache Solr) behind it. The plan going forward is that Discovery will be the search/browse system. The old Lucene / DB-based search and browse system is deprecated and will be removed in the near future. - Tim On 6/10/2015 5:22 AM, Alan Orth wrote: Hi, Discovery's great but Browse by does some things that we like, such as being able to paginate the results (compare browse[0] with discovery[1])! The problem is that Browse by is terrible because it still uses Lucene, which means we have to maintain parallel lists of indexes for both Lucene and Discovery, and Lucene is a systems admin nightmare because you have to shut down Tomcat for 15 or 20 minutes while you init the indexes after adding new metadata! I guess I'm curious what the long-term plan is for the Browse by functionality in DSpace? Thanks, [0] Browse by: https://cgspace.cgiar.org/browse?type=ilrisubject [1] Filter by: https://cgspace.cgiar.org/search-filter?field=ilrisubject -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] URL REST Call to return XML
You need to set the Accept HTTP header to tell the REST API which format you want. From the command line you can do it like this: $ curl -s -H Accept: application/xml http://localhost:8080/rest/communities I'm not familiar with the DSpace module for Drupal though, so I'm not sure if it allows you to configure request headers... Alan On Thu, Jun 11, 2015 at 1:11 PM, Hicham Zahnan h...@aub.edu.lb wrote: Hi, I’m using a Drupal Module called DSpace, that requires a REST URL to return XML. My problem is when I use the REST URL i.e. ( www.abcd.com/rest/communities/6/items), into a browser I sometimes get JSON or XML, depending on the browser. Because its returning a JSON, its causing the module not to work. So is there a way to only get XML through http REST request? We are using DSpace 5.2 Thank you, Hicham -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to search items by metadata using REST API?
Thanks. It seems my query is well formed but it returns an HTTP 405 Method Not Allowed response. Perhaps DSpace 4.x's REST API isn't sufficient for the find-by-metadata-field query. Or perhaps I need to log in as you illustrated here: https://github.com/BrunoNZ/dspace-rest-requests I guess I'll fork that repo and give this a more structured test. I already found some syntax and usability bugs there (mixing sh and bash, not reading the sourced variables, etc). Alan On Wed, Jun 10, 2015 at 8:30 PM, Bruno Zanette brunonzane...@gmail.com wrote: Hey Alan, Take a look at the commentaries on the REST-API's documentation's page. I've answered a similar question there. https://wiki.duraspace.org/display/DSDOC5x/REST+API?focusedCommentId=68068154#comment-68068154 But basically, the request that you have to send is something like this: curl -k -4 -H accept: application/json -H Content-Type: application/json -X POST https://demo.dspace.org/rest/items/find-by-metadata-field; -d '{key: dc.title,value: Test Webpage,language: en_US}' Em qua, 10 de jun de 2015 às 14:08, Alan Orth alan.o...@gmail.com escreveu: Hey, I'm using DSpace 4.3 and trying to search for items matching certain metadata. The REST API docs on GitHub[0] mention that you can POST to `rest/items/find-by-metadata-field`, but it doesn't give any more information than that (like how to specify query parameters). I've tried several variations of the following: $ http --print Hhb POST ' http://localhost:8080/rest/items/find-by-metadata-field' dc.subject=LACTATION It always returns with an HTTP 405. What is the proper way to use this? Does the POST require I login first or something? Thanks, [0] https://github.com/DSpace/DSpace/tree/master/dspace-rest -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] How to search items by metadata using REST API?
Ah, it seems the `/rest/items/find-by-metadata-field` endpoint doesn't exist in DSpace 4.x. Neither does the `/rest/login` endpoint. The docs need to make more of a point to say which versions these are supposed to work in... ;) Alan On Thu, Jun 11, 2015 at 12:24 PM Alan Orth alan.o...@gmail.com wrote: Thanks. It seems my query is well formed but it returns an HTTP 405 Method Not Allowed response. Perhaps DSpace 4.x's REST API isn't sufficient for the find-by-metadata-field query. Or perhaps I need to log in as you illustrated here: https://github.com/BrunoNZ/dspace-rest-requests I guess I'll fork that repo and give this a more structured test. I already found some syntax and usability bugs there (mixing sh and bash, not reading the sourced variables, etc). Alan On Wed, Jun 10, 2015 at 8:30 PM, Bruno Zanette brunonzane...@gmail.com wrote: Hey Alan, Take a look at the commentaries on the REST-API's documentation's page. I've answered a similar question there. https://wiki.duraspace.org/display/DSDOC5x/REST+API?focusedCommentId=68068154#comment-68068154 But basically, the request that you have to send is something like this: curl -k -4 -H accept: application/json -H Content-Type: application/json -X POST https://demo.dspace.org/rest/items/find-by-metadata-field; -d '{key: dc.title,value: Test Webpage,language: en_US}' Em qua, 10 de jun de 2015 às 14:08, Alan Orth alan.o...@gmail.com escreveu: Hey, I'm using DSpace 4.3 and trying to search for items matching certain metadata. The REST API docs on GitHub[0] mention that you can POST to `rest/items/find-by-metadata-field`, but it doesn't give any more information than that (like how to specify query parameters). I've tried several variations of the following: $ http --print Hhb POST ' http://localhost:8080/rest/items/find-by-metadata-field' dc.subject=LACTATION It always returns with an HTTP 405. What is the proper way to use this? Does the POST require I login first or something? Thanks, [0] https://github.com/DSpace/DSpace/tree/master/dspace-rest -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] What's the plan for Browse by functionality?
Hi, Discovery's great but Browse by does some things that we like, such as being able to paginate the results (compare browse[0] with discovery[1])! The problem is that Browse by is terrible because it still uses Lucene, which means we have to maintain parallel lists of indexes for both Lucene and Discovery, and Lucene is a systems admin nightmare because you have to shut down Tomcat for 15 or 20 minutes while you init the indexes after adding new metadata! I guess I'm curious what the long-term plan is for the Browse by functionality in DSpace? Thanks, [0] Browse by: https://cgspace.cgiar.org/browse?type=ilrisubject [1] Filter by: https://cgspace.cgiar.org/search-filter?field=ilrisubject -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] How to search items by metadata using REST API?
Hey, I'm using DSpace 4.3 and trying to search for items matching certain metadata. The REST API docs on GitHub[0] mention that you can POST to `rest/items/find-by-metadata-field`, but it doesn't give any more information than that (like how to specify query parameters). I've tried several variations of the following: $ http --print Hhb POST ' http://localhost:8080/rest/items/find-by-metadata-field' dc.subject=LACTATION It always returns with an HTTP 405. What is the proper way to use this? Does the POST require I login first or something? Thanks, [0] https://github.com/DSpace/DSpace/tree/master/dspace-rest -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] DSpace XMLUI theme customisation help wiki
Fantastic guide, Hilton. Maybe this will inspire me to update our themes to Mirage 2. Our DSpace 1.6-era Mirage 1-based themes are really starting to show their age! We have 20+ custom themes in our institutional repository[0]... it's quite an undertaking! Alan [0] https://github.com/ilri/DSpace/tree/4_x-prod/dspace/modules/xmlui/src/main/webapp/themes On Thu, May 28, 2015 at 6:25 PM Hilton Gibson hilton.gib...@gmail.com wrote: On 28 May 2015 at 16:48, Germán Biozzoli germanbiozz...@gmail.com wrote: Perhaps it could be linked from the official DSpace wiki? For instance, PHP has a lot of user contributed comments that are as interesting as the proper documentation. Hi Germán Thank you. Whether or not to include, is up to official DSpace documenters - not me. Cheers *Hilton Gibson* Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] How to back up Solr data?
I currently back up PostgreSQL and assetstore data via nightly snapshots and syncing offsite (S3, whatever), but as far as I know there's no way to atomically snapshot DSpace's Solr; ie, during the time it takes to `cp` my 8GB of Solr data, the data has changed, so the backup is not atomic and likely inconsistent. I can shut down Tomcat of course, but... that seems a bit excessive. Upstream Solr has some discussion of backing up from read-only slaves[0], but I wanted to know if anyone had a solution for this. We're using DSpace 4.3. Cheers, [0] https://cwiki.apache.org/confluence/display/solr/Backing+Up -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Handle server limit (99999 items?)
Hi, I'm wondering what happens when the handle server reaches item number 9? Our handle server is currently at 65000 or so, so this could be a problem for us in the next year or less. Thanks, -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Handle server limit (99999 items?)
Thanks for pointing out the Cambridge example, Hilton! So they passed the event horizon (and lived to tell the tale) before we had to. *phew* Kenya's great. The GNU/Linux community[0] is getting stronger by the day! :P Regards, [0] https://nairobilug.or.ke/ On Thu, May 7, 2015 at 10:27 AM, Hilton Gibson hilton.gib...@gmail.com wrote: Hi Alan Check: https://www.repository.cam.ac.uk/browse?type=title They are up to 195598 so far, so 99 seems to be the limit. One million is not bad? How is Kenya? Cheers hg *Hilton Gibson* Ubuntu Linux Systems Administrator Stellenbosch University Library http://staff.lib.sun.ac.za/~hgibson/docs/cv/cv.html On 7 May 2015 at 09:15, Alan Orth alan.o...@gmail.com wrote: Hi, I'm wondering what happens when the handle server reaches item number 9? Our handle server is currently at 65000 or so, so this could be a problem for us in the next year or less. Thanks, -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- One dashboard for servers and applications across Physical-Virtual-Cloud Widest out-of-the-box monitoring support with 50+ applications Performance metrics, stats and reports that give you Actionable Insights Deep dive visibility with transaction tracing using APM Insight. http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Moving collections in DSpace = 4.0?
Hi, I'm wondering how to move a collection to a new community in DSpace versions = 4.0. In the past, with DSpace 3.x, I would simply[0] find the resource_id for the collection, its parent, and the new parent, then update the relationships in SQL: UPDATE community2collection SET community_id=$NEW_PARENT WHERE community_id=$OLD_PARENT and collection_id=$COLLECTION; Assuming I update Solr afterwards using`dspace index-discovery` is that sufficient? Is it perhaps required to do a complete re-index with the -b flag? Thanks, Alan [0] For finding the resource_id of each handle: SELECT resource_id FROM handle WHERE handle = '123456789/11287'; -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Modules?
Hey, Susan. Yes, we definitely use modules for overrides. Our institutional DSpace repository makes heavy use of modules for our XMLUI themes (~20 of them right now). The source code is publicly available here: http://github.com/ilri/DSpace I haven't tried overriding anything other than that, though... Alan On Tue, Mar 17, 2015 at 5:54 PM, Borda, Susan susan.bo...@montana.edu wrote: Hi- Does anyone use the Modules option for advanced configuration? I was wondering if that's where I should be putting a copy of dspace.cfg, discovery.xml and messages.xml after I modify the facets and filters? Thanks, susan -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Split up input-forms.xml?
Hey, Hilton and Claudia. That's great, so we can just use pure XML semantics to do this... good idea. Also, Bram Luyten pointed out to me that I could use hierarchical Discovery facets: https://wiki.duraspace.org/display/DSDOC4x/Discovery#Discovery-Hierarchical%28taxonomiesbased%29sidebarfacets Thanks for the advice, I'll play around and see what I can come up with. Cheers, Alan On Thu, Mar 12, 2015 at 5:18 PM, Claudia Jürgen claudia.juer...@ub.tu-dortmund.de wrote: Hey Alan, we did split the input forms up into different forms - using quite a variety and both in English and German, the form map etc. Here ist how you can do it input-forms.xml ?xml version=1.0 encoding='iso-8859-1' ? !DOCTYPE input-forms SYSTEM input-forms.dtd [ !ENTITY map SYSTEM path to the form map !ENTITY basic SYSTEM path to the form basic ... ] input-forms map; form-definitions basic; /form-definitions ... /input-forms We are using 4.2. Hope this helps Claudia Jürgen Am 12.03.2015 um 14:07 schrieb Alan Orth: Hey, all. Our institution has many groups collaborating on one DSpace repository and we have many sets of metadata terms. In the beginning we thought it was clever to add them to input-forms.xml, but now this file is nearly 6,000 lines, which is a pain to manage and causes problems when we rebase on top of new upstream DSpace releases in git. :) I had a look at the controlled vocabulary functionality, which allows you to break out terms to separate XML files, but this has the unfortunate side effect of then storing the values as a hierarchy, for example, Organization A::Subjects::Subject A, as opposed to just Subject A. This breaks basically everything that relies on metadata aggregation or searching (not to mention it is ugly). Is there some way to split up input-forms.xml? What are other DSpace users at large institutions doing? For the record, we're using DSpace 4.3. Thanks, Alan -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Split up input-forms.xml?
Hey, all. Our institution has many groups collaborating on one DSpace repository and we have many sets of metadata terms. In the beginning we thought it was clever to add them to input-forms.xml, but now this file is nearly 6,000 lines, which is a pain to manage and causes problems when we rebase on top of new upstream DSpace releases in git. :) I had a look at the controlled vocabulary functionality, which allows you to break out terms to separate XML files, but this has the unfortunate side effect of then storing the values as a hierarchy, for example, Organization A::Subjects::Subject A, as opposed to just Subject A. This breaks basically everything that relies on metadata aggregation or searching (not to mention it is ugly). Is there some way to split up input-forms.xml? What are other DSpace users at large institutions doing? For the record, we're using DSpace 4.3. Thanks, Alan -- Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Character encoding issues in Discovery search results
Hey, Aleksander. Actually, I just fixed this... somehow on accident. Today we did a batch metadata cleanup via SQL and modified the Discovery sidebar facet configs, so I had to rebuild the indexes with `index-discovery -b`. Also, we wanted to re-generate all of our PDF thumbnails for DSpace 4's higher-quality versions (we upgraded a few months ago but hadn't yet re-generated thumbnails for existing items). I'm not sure why this didn't work before when I was doing my research in December[0], but I'm glad that it's fixed! Thanks for following up with me! I hope this helps someone else... Alan [0] https://github.com/ilri/DSpace/issues/43 On Fri Feb 13 2015 at 3:56:20 PM Aleksandar Stojanov a.stoja...@losisin.com wrote: Hi, I've visited the repository link ( https://cgspace.cgiar.org/handle/10568/51393) on the GitHub discussion page and made some searching there. I've noticed that it happens on a lot of pdf's there and always at the same place which is after page number. It then inserts form feed character which is Unicode \u000c character for new page or new line. Although, this is valid HTML, it's invalid XHTML and recommended practice would be to threat it as zero-width character because it has no semantic meaning. http://www.w3.org/TR/unicode-xml/#White We had similar problem with search results and weird characters and this helped: http://sourceforge.net/p/dspace/mailman/message/31212700/ Can you try that solution and post back the results? Also, don't forget to make a back up first. Cheers, Aleksandar Stojanov On Thu, Feb 12, 2015 at 10:53 AM, Alan Orth alan.o...@gmail.com wrote: Hey, bender. No, we didn't figure this out. In fact, it's still an open issue on our institution's GitHub issue tracker! https://github.com/ilri/DSpace/issues/43 I've posted a few notes there but haven't come to any conclusion. :( Alan On Fri Jan 02 2015 at 8:54:00 PM bender bender.bending.1...@gmail.com wrote: Hi Alan: Did you solved this issue? And how? If you did. Bender 2014-12-09 13:09 GMT-03:00 Alan Orth alan.o...@gmail.com: Antoine, In this case the dspace script respects the environment's JAVA_OPTS if it is set; the one in the script is only used if JAVA_OPTS is empty. Alan On Tue, Dec 9, 2014 at 6:54 PM, Antoine Snyers anto...@atmire.com wrote: Hi Alan Orth -Dfile.encoding=UTF-8 should be added to the bin/dspace command. Here is the line: https://github.com/DSpace/DSpace/blob/dspace-4.2/dspace/bin/dspace#L75 Then rerun 'index-discovery -b'. I believe this will resolve your problem. Antoine Snyers Alan Orth schreef op 09/12/14 14:49: Hi, Our DSpace 4.2's Discovery search results displays snippets from the item's full-text PDF extract, but we get mojibake (strange characters) in the summaries (see attached photo). Browsing to the item's PDF-extracted text bitstream indeed shows the strange characters, and Firefox's developer tools show the encoding is ISO-8859-1. What's strange is, if I download the file the resulting encoding is UTF-8, and these characters display properly. I have tried the following: - Confirmed our Tomcat connectors are using URIEncoding=UTF-8 - Forced -Dfile.encoding=UTF-8 in JAVA_OPTS and manually re-run `filter-media' as well as `index-discovery -b' What could I be missing? Thanks! -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREEhttp://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk ___ DSpace-tech mailing listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- [image: logo] *Antoine Snyers* *2888 Loker Avenue East, Suite 315, Carlsbad, CA. 92010* *Esperantolaan 4, Heverlee 3001, Belgium* www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=antoine -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity
Re: [Dspace-tech] Character encoding issues in Discovery search results
Hey, bender. No, we didn't figure this out. In fact, it's still an open issue on our institution's GitHub issue tracker! https://github.com/ilri/DSpace/issues/43 I've posted a few notes there but haven't come to any conclusion. :( Alan On Fri Jan 02 2015 at 8:54:00 PM bender bender.bending.1...@gmail.com wrote: Hi Alan: Did you solved this issue? And how? If you did. Bender 2014-12-09 13:09 GMT-03:00 Alan Orth alan.o...@gmail.com: Antoine, In this case the dspace script respects the environment's JAVA_OPTS if it is set; the one in the script is only used if JAVA_OPTS is empty. Alan On Tue, Dec 9, 2014 at 6:54 PM, Antoine Snyers anto...@atmire.com wrote: Hi Alan Orth -Dfile.encoding=UTF-8 should be added to the bin/dspace command. Here is the line: https://github.com/DSpace/DSpace/blob/dspace-4.2/dspace/bin/dspace#L75 Then rerun 'index-discovery -b'. I believe this will resolve your problem. Antoine Snyers Alan Orth schreef op 09/12/14 14:49: Hi, Our DSpace 4.2's Discovery search results displays snippets from the item's full-text PDF extract, but we get mojibake (strange characters) in the summaries (see attached photo). Browsing to the item's PDF-extracted text bitstream indeed shows the strange characters, and Firefox's developer tools show the encoding is ISO-8859-1. What's strange is, if I download the file the resulting encoding is UTF-8, and these characters display properly. I have tried the following: - Confirmed our Tomcat connectors are using URIEncoding=UTF-8 - Forced -Dfile.encoding=UTF-8 in JAVA_OPTS and manually re-run `filter-media' as well as `index-discovery -b' What could I be missing? Thanks! -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREEhttp://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk ___ DSpace-tech mailing listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- [image: logo] *Antoine Snyers* *2888 Loker Avenue East, Suite 315, Carlsbad, CA. 92010* *Esperantolaan 4, Heverlee 3001, Belgium* www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=antoine -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Dive into the World of Parallel Programming. The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Enabling SSL
Also, I realized early on there are many reasons to terminate your SSL with something other than Tomcat. We used Apache http for a while, and now I've switched to nginx. For reference, here's our nginx vhost config: https://gist.github.com/alanorth/ddde5e9d6c55b3637513 You'll want the xmlui.force.ssl option as helix84 pointed to, and there are a few variables in our config that are interpolated from ansible during deployment of the server... but you get the picture. Alan On Thu Dec 11 2014 at 4:02:24 PM helix84 heli...@centrum.sk wrote: Forgot to include the link for SSL configuration in Tomcat (both options): http://tomcat.apache.org/tomcat-7.0-doc/ssl-howto.html#Configuration Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151; iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+ Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Character encoding issues in Discovery search results
Hi, Our DSpace 4.2's Discovery search results displays snippets from the item's full-text PDF extract, but we get mojibake (strange characters) in the summaries (see attached photo). Browsing to the item's PDF-extracted text bitstream indeed shows the strange characters, and Firefox's developer tools show the encoding is ISO-8859-1. What's strange is, if I download the file the resulting encoding is UTF-8, and these characters display properly. I have tried the following: - Confirmed our Tomcat connectors are using URIEncoding=UTF-8 - Forced -Dfile.encoding=UTF-8 in JAVA_OPTS and manually re-run `filter-media' as well as `index-discovery -b' What could I be missing? Thanks! -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Character encoding issues in Discovery search results
Antoine, In this case the dspace script respects the environment's JAVA_OPTS if it is set; the one in the script is only used if JAVA_OPTS is empty. Alan On Tue, Dec 9, 2014 at 6:54 PM, Antoine Snyers anto...@atmire.com wrote: Hi Alan Orth -Dfile.encoding=UTF-8 should be added to the bin/dspace command. Here is the line: https://github.com/DSpace/DSpace/blob/dspace-4.2/dspace/bin/dspace#L75 Then rerun 'index-discovery -b'. I believe this will resolve your problem. Antoine Snyers Alan Orth schreef op 09/12/14 14:49: Hi, Our DSpace 4.2's Discovery search results displays snippets from the item's full-text PDF extract, but we get mojibake (strange characters) in the summaries (see attached photo). Browsing to the item's PDF-extracted text bitstream indeed shows the strange characters, and Firefox's developer tools show the encoding is ISO-8859-1. What's strange is, if I download the file the resulting encoding is UTF-8, and these characters display properly. I have tried the following: - Confirmed our Tomcat connectors are using URIEncoding=UTF-8 - Forced -Dfile.encoding=UTF-8 in JAVA_OPTS and manually re-run `filter-media' as well as `index-discovery -b' What could I be missing? Thanks! -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREEhttp://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk ___ DSpace-tech mailing listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- [image: logo] *Antoine Snyers* *2888 Loker Avenue East, Suite 315, Carlsbad, CA. 92010* *Esperantolaan 4, Heverlee 3001, Belgium* www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=antoine -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Issues with OpenSearch sortFields and sorting order
Christian, Wow, thank you! I've just managed to create a proper RSS feed for a certain metadata field and sort it by dc.date.accessioned in descending order. Now, the DSpace documentation is 100% wrong in several places with regards to the setup of OpenSearch... for example, I had to define custom search filters in `discovery.xml' [0], and the DSpace 4.2 documentation PDF (page 733) seems to suggest that OpenSearch only has the following parameters: format, scope, rpp, start, sort_by. Grateful but frustrated, Alan [0] https://github.com/ilri/DSpace/commit/b970a02554b9a6a3e809be3085864299a8233bad On Wed Dec 03 2014 at 9:38:21 AM Christian Scheible christian.schei...@uni-konstanz.de wrote: Hi, I'm playing with OpenSearch in DSpace 4.2 and I'm having a couple of issues. I see OpenSearch uses the Discovery configuration, so I'm investigating `discovery.xml'. Here are two related issues I'm having: 1- I've added a custom sortField for dc.date.accessioned and added it to my bean's sortFields, but I can't figure out how to use it. OpenSearch has a `sort_by' parameter which only takes an integer; how do I know which integer to use? 2- If I use `sort_by=2` it appears to be sorting by dc.date.issued, but in ascending order, even though `discovery.xml' lists desc as the default sort order everywhere in the config. We're trying to piggy back on OpenSearch to build custom RSS feeds for certain terms, so we can use a feed aggregator to send email alerts when there are new items. Unfortunately it's quite pointless if we can't customize the field to sort by or see the newest items first in the feed! Our application stack is: DSpace 4.2, Tomcat 7, Oracle Java 7, Ubuntu 14.04. Thank you, Alan Hi Alan, OpenSearch is using the dspace.cfg not discovery.xml and sorting options are defined like that: webui.itemlist.sort-option.1 = dateissued:dc.date.issued:date webui.itemlist.sort-option.2 = dateaccessioned:dc.date.accessioned:date The sort-option and the order can be defined with get parameters like this sort_by=1order=DESC Best -- Christian Scheible Softwareentwickler / Abt. Content-basierte Dienste Kommunikations-, Informations- und Medienzentrum (KIM) Universität Konstanz 78457 Konstanz +49 (0)7531 / 88-2857 Raum B 703 -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151; iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+ Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Issues with OpenSearch sortFields and sorting order
Hi, I'm playing with OpenSearch in DSpace 4.2 and I'm having a couple of issues. I see OpenSearch uses the Discovery configuration, so I'm investigating `discovery.xml'. Here are two related issues I'm having: 1- I've added a custom sortField for dc.date.accessioned and added it to my bean's sortFields, but I can't figure out how to use it. OpenSearch has a `sort_by' parameter which only takes an integer; how do I know which integer to use? 2- If I use `sort_by=2` it appears to be sorting by dc.date.issued, but in ascending order, even though `discovery.xml' lists desc as the default sort order everywhere in the config. We're trying to piggy back on OpenSearch to build custom RSS feeds for certain terms, so we can use a feed aggregator to send email alerts when there are new items. Unfortunately it's quite pointless if we can't customize the field to sort by or see the newest items first in the feed! Our application stack is: DSpace 4.2, Tomcat 7, Oracle Java 7, Ubuntu 14.04. Thank you, Alan -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=164703151iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] ant update command!
Hi, Nada. Those two commands are the same; it uses config/dspace.cfg by default, which is what most people should use. This is what I do, for example: mvn -U clean package cd dspace/target/dspace-4.2-build ant update Alan On Sun Nov 23 2014 at 4:22:58 PM Nada Abo Eita naboeit...@gmail.com wrote: Hi all, Regarding the ant update command used to update Dspace installation. Could you please explain to me what's the difference between these two commands: ant update ant -Dconfig=[dspace]/config/dspace.cfg. update Regards, Nada Abo-Eita -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751; iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+ Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] HiDPI favicon?
I never noticed until I had a HiDPI screen (MacBook Pro with Retina screen), but the default favicon for XMLUI and probably JSPUI is pretty low definition. Right now we're using 16x16 pixels, but we should be using something more like 48x48 or even 128x128, see some examples: https://medium.com/favicon.ico https://wiki.duraspace.org/favicon.ico https://www.google.com/favicon.ico https://www.apple.com/favicon.ico I'd make one myself but I'm not sure where the original came from... Regards, Alan -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] HiDPI favicon?
Ah, that's great, helix. They're high-res enough that we could easily get a 64x64 pixel favicon out of them. Tim? Alan On Wed Nov 05 2014 at 4:37:42 PM helix84 heli...@centrum.sk wrote: Hi Alan, we should probably use this one instead: https://wiki.duraspace.org/display/DSP/Logos#Logos-DSpaceLogos Tim, what do you think? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] DS-2220: Always load Google Analytics over SSL
Fair point. As a technologist working in the developing world, this should matter more to me, but alas, my heart always cries deploy hard crypto!. There is a compromise between the two somewhere... In related news, today Google announced that Chrome 39 will disable SSLv3 fallback, and Chrome 40 will disable it entirely: https://groups.google.com/a/chromium.org/forum/#!topic/security-dev/Vnhy9aKM_l4 And TLS1.0, which came after SSLv3 (despite the decrement), is 15 years old now! And even Windows XP supports TLS 1.0. Regards, On Tue, Oct 28, 2014 at 9:54 PM, Stuart Yeates stuart.yea...@vuw.ac.nz wrote: I was shooting for always loading over HTTPS, as surely loading ANYTHING we can over HTTPS should increase our users' security, ie jQuery, images, CSS, etc... Yes, but only if you're assuming that only humans connect and all of them use modern browsers with good https support. Many users in the developing world access on an array of kinds of hardware and software that we would consider obsolete. Requiring the latest and greatest web technologies to access our research isn't going to decrease that development gap. Many tools, from plain server monitoring systems to reference checking systems to fancy website thumbnail services just work better and more reliably over http than https. cheers stuart -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] DS-2220: Always load Google Analytics over SSL
Stuart, I was shooting for always loading over HTTPS, as surely loading ANYTHING we can over HTTPS should increase our users' security, ie jQuery, images, CSS, etc... but it seems Google's example code for this traditional ga.js recommends doing exactly what we're already doing: https://developers.google.com/analytics/devguides/collection/gajs/gaTrackingOverview So maybe there's something they're doing with the document.location in their script... I dunno. Alan On 10/27/2014 11:22 PM, Stuart Yeates wrote: Isn't the fix for this to use protocol-independent URIs? i.e. the ones that start with // rather than https:// or http:// ? Or is there an important secondary issue I'm missing? cheers stuart -- I have a new phone number: 04 463 5692 *From:* Alan Orth alan.o...@gmail.com *Sent:* Monday, 27 October 2014 11:51 p.m. *To:* dspace-tech@lists.sourceforge.net *Subject:* [Dspace-tech] DS-2220: Always load Google Analytics over SSL I was just poking around and noticed we conditionally load Google Analytics over SSL. We should *always* load ga.js over SSL. Bug here: https://jira.duraspace.org/browse/DS-2220 Patch and pull request is linked in bug report. Cheers, -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] DS-2220: Always load Google Analytics over SSL
I was just poking around and noticed we conditionally load Google Analytics over SSL. We should *always* load ga.js over SSL. Bug here: https://jira.duraspace.org/browse/DS-2220 Patch and pull request is linked in bug report. Cheers, -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Using DSpace with https
In that same vein, I just submitted a patch to always load Google Analytics (ga.js) over SSL: https://jira.duraspace.org/browse/DS-2220 https://www.google.com/url?q=https%3A%2F%2Fjira.duraspace.org%2Fbrowse%2FDS-2220sa=Dsntz=1usg=AFQjCNET49mpfkl2NltnX5oxfsT07UqDaw The more resources we load over HTTPS the better. Cheers, On Mon, Oct 20, 2014 at 4:22 PM, Carlos Walter Blandon Alvarez cblan...@eafit.edu.co wrote: Hi helix84. Thanks for reply. Other possible solution could be to replace script variable protocol in page-structure.xsl: script type=text/javascript src={concat($protocol, ' ajax.googleapis.com/ajax/libs/jquery/', $jqueryVersion ,'/jquery.min.js')}#160;/script by script type=text/javascript src={concat('https://', ' ajax.googleapis.com/ajax/libs/jquery/', $jqueryVersion ,'/jquery.min.js')}#160;/script Regards, Walter Blandón Profesional-Analista Gestión Tecnológica Centro Cultural Biblioteca LEV Teléfono (57 4) 261 9500, extensión 9365 cblan...@eafit.edu.co Carrera 49 N 7 sur - 50, Medellín - Colombia Línea de atención al usuario: (57 4) 4489500 www.eafit.edu.co -Mensaje original- De: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] En nombre de helix84 Enviado el: viernes, 17 de octubre de 2014 04:55 p.m. Para: Carlos Walter Blandon Alvarez CC: Hilton Gibson; dspace-tech@lists.sourceforge.net Asunto: Re: [Dspace-tech] Using DSpace with https On Fri, Oct 17, 2014 at 11:15 PM, Carlos Walter Blandon Alvarez cblan...@eafit.edu.co wrote: [blocked] The page at 'https://repository.eafit.edu.co/' was loaded over HTTPS, but ran insecure content from ' http://ajax.googleapis.com/ajax/libs/jquery/1.6.2/jquery.min.js': this content should also be loaded over HTTPS. Hi Carlos, you were right, you do need to update dspace.baseUrl. See https://github.com/DSpace/DSpace/blob/dspace-4.1/dspace-xmlui/src/main/webapp/themes/Mirage/lib/xsl/core/page-structure.xsl#L671-681 Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Comprehensive Server Monitoring with Site24x7. Monitor 10 servers for $9/Month. Get alerted through email, SMS, voice calls or mobile push notifications. Take corrective actions from your mobile device. http://p.sf.net/sfu/Zoho ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com https://alaninkenya.org https://mjanja.ch In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Recommended TLS cipher suite for sites using HTTPS
Stuart, Interesting that you consider Mozilla's guidlines too strict. Bettercrypto.org's are even more so. :) For reference, I use a stricter config than Mozilla's in that I disallow SSLv3 (as even XP supports TLS 1.0), and I get an A+ on the Qualys SSL test: https://www.ssllabs.com/ssltest/analyze.html?d=cgspace.cgiar.org TLS is fun, isn't it?! Alan On 09/15/2014 01:20 AM, Stuart Yeates wrote: I use a verifier to check my config: https://www.ssllabs.com/ssltest/analyze.html?d=exams.victoria.ac.nz Note that my settings are less secure than I might like, because increasing them causes some platforms (especially mobile platforms) to fail to access the content, while leaving nothing useful in the logs. Personally I find the Mozilla advice a little strong on the force users with outdated browsers to update approach. It's also possible to force users who login to use more secure credentials than those who just access content, if you can assume that only admin staff login from their desktops with recent browsers. There's an example on https://httpd.apache.org/docs/2.0/ssl/ssl_howto.html https://httpd.apache.org/docs/2.0/ssl/ssl_howto.html Cheers stuart *From:*Alan Orth [mailto:alan.o...@gmail.com] *Sent:* Sunday, 14 September 2014 7:39 p.m. *To:* Ivan Masár *Cc:* dspace-tech@lists.sourceforge.net *Subject:* Re: [Dspace-tech] Recommended TLS cipher suite for sites using HTTPS Hi, Hilton. Thanks for your reply. First, I'd like to point out that I reverse proxy DSpace via nginx (and Apache httpd a few years ago). The decision to put nginx / httpd in front of Tomcat was made partially on the fact that it's easier to configure HTTPS in those servers than Tomcat, and nginx supports more modern crypto than Apache http or Apache Tomcat. Also mod_rewrite and vhosts etc were easier. Your HTTPS configuration could use several improvements. Attached is a screenshot of the negotiated cipher suite as seen in Chrome in GNU/Linux. Of note: - The connection is encrypted using AES CBC. AES is government-grade security, but implemented in CBC mode it is vulnerable to padding oracle attacks (see BEAST and Lucky13)[0]. It is recommended to use GCM mode (galois counter mode). - Message authentication (MAC, basically a hash or fingerprint) is using SHA1, which is of course very old and started showing weaknesses in academic circles and was first shown to be broken in 2005[1]. - Your connection is using Diffie-Hellman Ephemeral, which is good! Ephemeral means that there is a temporary secret used in the HTTPS negotiation that is thrown away after the session. In the scenario that an adversary (NSA?) gets your HTTPS key and records secure traffic, they won't be able to decode those sessions. This is called 'forward secrecy' (sometimes perfect forward secrecy). Other than that, your HTTPS certs are signed using SHA1, which has been deprecated by all major browsers in favor of SHA2[2]. It's kinda overwhelming, but using the Mozilla cipher list will get you started. They are a list of safe defaults which take into account most of the latest information we have on cryptography. Hope that helps, [0] https://wiki.mozilla.org/Security/Server_Side_TLS#Attacks_on_TLS [1] https://www.schneier.com/blog/archives/2005/02/sha1_broken.html [2] https://sha.com/ On Sat, Sep 13, 2014 at 10:35 PM, helix84 heli...@centrum.sk mailto:heli...@centrum.sk wrote: On Sat, Sep 13, 2014 at 9:05 PM, Hilton Gibson hilton.gib...@gmail.com mailto:hilton.gib...@gmail.com wrote: Who is the arbiter safe ciphers? I am not a cipher expert. There's no arbiter. The set changes over time as new vulnerabilities are found in existing ciphers and new ciphers are developed to mitigate those attack vectors. A cipher might look good on paper, but only widespread use reveals its weaknesses. Then there is the natural deprecation of shorter key sizes, which is required as new computers gets faster. Furthermore, errors exist in PRNGs, which encryption vitally depends on. The only way is to keep up to date on this information. That's why the Mozilla list Alan mentioned helps - they watch it for you and give you their recommendations. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID
Re: [Dspace-tech] Recommended TLS cipher suite for sites using HTTPS
Hi, Hilton. Thanks for your reply. First, I'd like to point out that I reverse proxy DSpace via nginx (and Apache httpd a few years ago). The decision to put nginx / httpd in front of Tomcat was made partially on the fact that it's easier to configure HTTPS in those servers than Tomcat, and nginx supports more modern crypto than Apache http or Apache Tomcat. Also mod_rewrite and vhosts etc were easier. Your HTTPS configuration could use several improvements. Attached is a screenshot of the negotiated cipher suite as seen in Chrome in GNU/Linux. Of note: - The connection is encrypted using AES CBC. AES is government-grade security, but implemented in CBC mode it is vulnerable to padding oracle attacks (see BEAST and Lucky13)[0]. It is recommended to use GCM mode (galois counter mode). - Message authentication (MAC, basically a hash or fingerprint) is using SHA1, which is of course very old and started showing weaknesses in academic circles and was first shown to be broken in 2005[1]. - Your connection is using Diffie-Hellman Ephemeral, which is good! Ephemeral means that there is a temporary secret used in the HTTPS negotiation that is thrown away after the session. In the scenario that an adversary (NSA?) gets your HTTPS key and records secure traffic, they won't be able to decode those sessions. This is called 'forward secrecy' (sometimes perfect forward secrecy). Other than that, your HTTPS certs are signed using SHA1, which has been deprecated by all major browsers in favor of SHA2[2]. It's kinda overwhelming, but using the Mozilla cipher list will get you started. They are a list of safe defaults which take into account most of the latest information we have on cryptography. Hope that helps, [0] https://wiki.mozilla.org/Security/Server_Side_TLS#Attacks_on_TLS [1] https://www.schneier.com/blog/archives/2005/02/sha1_broken.html [2] https://sha.com/ On Sat, Sep 13, 2014 at 10:35 PM, helix84 heli...@centrum.sk wrote: On Sat, Sep 13, 2014 at 9:05 PM, Hilton Gibson hilton.gib...@gmail.com wrote: Who is the arbiter safe ciphers? I am not a cipher expert. There's no arbiter. The set changes over time as new vulnerabilities are found in existing ciphers and new ciphers are developed to mitigate those attack vectors. A cipher might look good on paper, but only widespread use reveals its weaknesses. Then there is the natural deprecation of shorter key sizes, which is required as new computers gets faster. Furthermore, errors exist in PRNGs, which encryption vitally depends on. The only way is to keep up to date on this information. That's why the Mozilla list Alan mentioned helps - they watch it for you and give you their recommendations. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke In heaven all the interesting people are missing. -Friedrich Nietzsche GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Want excitement? Manually upgrade your production database. When you want reliability, choose Perforce Perforce version control. Predictably reliable. http://pubads.g.doubleclick.net/gampad/clk?id=157508191iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Recommended TLS cipher suite for sites using HTTPS
Hey, all. I was just having a look at a few institutional DSpace instances, and noticing that they are using sub-par cryptography. Unless you have a specific need to use SHA1, AES-CBC, RC4, MD5, or non-DHE RSA, you should REALLY be using the TLS cipher suite from this Mozilla guide: https://wiki.mozilla.org/Security/Server_Side_TLS They have copy/paste-able strings for Apache httpd and Nginx web servers. Obviously test in a development server first... but really, this is a trivial change. You went to the trouble of buying TLS certs and setting up HTTPS, so you might as well do it right! -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Want excitement? Manually upgrade your production database. When you want reliability, choose Perforce Perforce version control. Predictably reliable. http://pubads.g.doubleclick.net/gampad/clk?id=157508191iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] IPv4/6 double stack setup and session hijacking prevention
Well, if this *was* a session hijacking attempt... wouldn't it look exactly like this? ie, DSpace would be actually protecting you. :) Alan On 06/08/2014 03:34 PM, Becker, Pascal-Nicolas wrote: Hi, today I used my test installation of DSpace for the first time from home where I have IPv4 and IPv6 in a dual stack setup. My server has an IPv4 and IPv6 connection as well, but in my office I currently have IPv4 only. So today I was using DSPACE JSPUI (master branch from early may 2014) in a IPv4/IPv6 dual stack setup for the first time. While using DSpace I was asked to login every two minutes. As this was quite annoying I looked in to dspace.log and found the following line: 2014-06-08 14:01:13,201 WARN org.dspace.app.webui.util.UIUtil @ POSSIBLE HIJACKED SESSION: request from 2001:6f8:::::: does not match original session address: 85.XXX.XXX.XXX. Authentication rejected. I think the problem is obvious: My mac is alternating using IPv4 and IPv6 to connect to my DSpace installation. DSpace detects this as a possible session hijacking attack and invalidates my session. Has anyone had the same problem (already)? Has anyone an idea how to solve this problem? And please don't suggest me to use either IPv4 or IPv6. ;-) Regards, Pascal P.S. A solution could be to save a IPv4 and a IPv6 address to prevent session hijacking while supporting IPv4/6 double stack setups. But even then we could run into problems with IPv6 privacy extensions... -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/NeoTech ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Anyone played with G1 garbage collector in JDK7?
Hey, all. I was just looking over the DSpace tuning guide[0] and I got to reading about garbage collectors. Ubuntu {12,14}.04's Tomcat 7 both use the ConcMarkSweep garbage collector, but I wonder if we should be using Java 7's new G1 garbage collector, as JDK7 has been out for a few years now and there are some impressive numbers with G1GC[1][2]. I'm currently using the following JAVA_OPTS on my dev/production servers where the repository has ~20,000 items: -Djava.awt.headless=true -Xms1024m -Xmx2048m -XX:MaxPermSize=320m -XX:+UseConcMarkSweepGC -XX:+CMSIncrementalMode We're moving to newer servers soon (Linode, 6 CPUs, 8GB RAM, from a pretty slow Ec2 m1.medium), so we'll see much more performance anyways, but I'll be experimenting with different parameters there as our repository is becoming increasingly important and heavy (~500,000 hits per month). Cheers, [0] https://wiki.duraspace.org/display/DSDOC4x/Performance+Tuning+DSpace [1] http://blog.sematext.com/2013/06/24/g1-cms-java-garbage-collector/ [2] http://www.aioug.org/sangam12/Presentations/20155.pdf -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita BPM Community Edition Quickly connect people, data, and systems into organized workflows Winner of BOSSIE, CODIE, OW2 and Gartner awards http://p.sf.net/sfu/Bonitasoft___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Batch update of meta data using SQL?
James, Awesome, this works. For the record (and for future space travelers, including myself!), it was sufficient to do an `index-update` afterwards to update the search / browse indexes. No stopping of Tomcat and `index-init` required. Cheers, Alan On 06/13/2014 06:57 PM, James Creel wrote: UPDATE metadatavalue SET text_value = ’REPORT' WHERE text_value = ’TECHNICAL REPORT’; Directly manipulating the database is often the most expedient way to solve a problem or get a report, but at the database prompt you are always exactly one query away from destroying your DSpace. This query is fairly safe, but it’s very wise to back stuff up first and maybe kill current sessions on the system-wide alerts page so nobody loses their work if you have to restore. Good luck! James Creel Senior Lead Software Applications Developer Texas AM University Libraries Digital Initiatives jcr...@library.tamu.edu mailto:jcr...@library.tamu.edu On Jun 13, 2014, at 1039, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Hi, We've updated some of our controlled vocabularies for new conventions in our organization, like TECHNICAL REPORT - REPORT, but now I need to do a batch update of the stored meta data values. If it were simply a collection I'd export the meta data CSV and then change it there, but we have records across the whole repository which need to be updated. I assume SQL is the only way? I'm not very familiar with DSpace's PostgreSQL schema, so I really need advice on how to approach this in a safe way. We're on DSpace 3.1. Thanks, -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing Easy Data Exploration http://p.sf.net/sfu/hpccsystems___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing Easy Data Exploration http://p.sf.net/sfu/hpccsystems___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Batch update of meta data using SQL?
Hi, We've updated some of our controlled vocabularies for new conventions in our organization, like TECHNICAL REPORT - REPORT, but now I need to do a batch update of the stored meta data values. If it were simply a collection I'd export the meta data CSV and then change it there, but we have records across the whole repository which need to be updated. I assume SQL is the only way? I'm not very familiar with DSpace's PostgreSQL schema, so I really need advice on how to approach this in a safe way. We're on DSpace 3.1. Thanks, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing Easy Data Exploration http://p.sf.net/sfu/hpccsystems___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] JAVA_OPTS for cron jobs?
Peter, Ahh, that's very interesting. I just looked up the -server flag and it seems on recent Sun/Oracle JVMs -server is implied on 64-bit Linux platforms[0]. It seems my problem was the fact that heuristics used by the OOM killer were killing Tomcat's java instead of whatever filter-media, etc cron job which happened to be the final straw in exhausting the server's memory. I've since re-evaluated my Tomcat's -Xmx and -Xms values, and determined there wasn't enough physical RAM to run both Tomcat's java as well as the background tasks, yet DSpace's control panel shows Tomcat's java is actually underutilizing the RAM we've allocated. Reducing the allocation there made a little more room for the background tasks and things have been stable since then. Also, I suspect it was the checksum checker job (runs at 3am for us) which was actually the final straw in exhausting the memory, so I've modified to work for 1 hour each run, instead of attempting to crawl the whole repository (default): 0 3 * * * nice -n19 /blah/dspace/bin/dspace checker -d 1h -p Cheers, Alan [0] http://docs.oracle.com/javase/7/docs/technotes/guides/vm/server-class.html On 05/28/2014 05:33 PM, Peter Dietz wrote: Hi Alan, At Longsight, we customize the JAVA_OPTS in dspace/bin/dspace https://github.com/LongsightGroup/DSpace/blob/longsight-4_x/dspace/bin/dspace#L66 #Allow user to specify java options through JAVA_OPTS variable if [ $JAVA_OPTS = ]; then #Default Java to use 256MB of memory JAVA_OPTS=-server -Xmx256m fi Previously, when I was at Ohio State, I had more in my JAVA_OPTS, to help with permgen issues. https://github.com/osulibraries/DSpace/blob/osukb/dspace/bin/dspace#L66 #Allow user to specify java options through JAVA_OPTS variable if [ $JAVA_OPTS = ]; then #Default Java to use 256MB of memory JAVA_OPTS=-server -Xmx512m -XX:MaxPermSize=128m -XX:+CMSClassUnloadingEnabled fi By adding the -server your ensuring that Java runs in server mode, as opposed to client mode. Server has slower initial startup, but a better memory footprint, and better performance for a longer running task, as per: http://stackoverflow.com/questions/198577/real-differences-between-java-server-and-java-client Then, if one of our clients has some jumbo-sized content that just isn't completing the cron jobs, then we'll temporarily bump the Xmx memory limit high, such as 4G. Peter Dietz Longsight www.longsight.com pe...@longsight.com p: 740-599-5005 x809 On Tue, May 27, 2014 at 7:03 PM, Terry Brady tw...@georgetown.edu wrote: Alan, We override JAVA_OPTS for the nightly filter-media task in our cron. export JAVA_OPTS=-Xmx1200m;dspace filter-media ... We have a set of automated ingest tools. We set JAVA_OPTS in some of the workflows that are run by those tools. https://github.com/Georgetown-University-Libraries/batch-tools/blob/master/bin-src/dspaceBatch.sh Terry On Tue, May 20, 2014 at 1:33 AM, Alan Orth alan.o...@gmail.com wrote: Hi, I'm curious if anyone sets memory limits for DSpace's various cron jobs? Lately we've been having Tomcat's java process get killed every morning around the same time, and all dmesg shows is that java was killed by the kernel's OOM killer. Catalina logs don't show any SEVERE errors, so I have to assume it's the cron jobs which are using up loads of memory and then confusing the kernel, which then identifies Tomcat's java as the memory hog and kills it. So I'm just curious if anyone has had these kinds of problems, and if/what they set their JAVA_OPTS to in crontab. The long term plan of course is to move to a machine with more memory (currently 4GB). Thanks, DSpace version is 3.1, OS is Ubuntu 12.04. -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Terry Brady Applications Programmer Analyst Georgetown University Library Information Technology https://www.library.georgetown.edu/lit/code 425-298-5498
Re: [Dspace-tech] JAVA_OPTS for cron jobs?
Peter, A queue would be awesome. You're absolutely right regarding the cron jobs; it's almost like you need to set a weekly reminder to go check the execution times of your DSpace maintenance cron jobs to make sure they're all completing and not running at the same time. :) I find that I tweak everything and then we add a bunch more content, get a bunch more hits, etc, and all the timings are off again. :P Cheers, Alan On 05/30/2014 05:16 PM, Peter Dietz wrote: My hammer java_opts on our production server, for when some site has crazy big content is to temporarily run it with: JAVA_OPTS=-server -Xms256m -Xmx4g -XX:MaxPermSize=256m We have 64GB ram on our boxes, so we'll survive. Not to derail onto a tangent, but one thing I'd like to see DSpace support is some type of background-processing-queue. i.e. new content submitted should be queued to get: initial checksum, virus check, media-filters to generate thumbnail and fulltext extraction, Discovery needs to index the content And then there are maintenance jobs: Recompute the checksum, OAI harvest, index-maintenance, ... New submissions add to the queue, some scheduler can add maintenance tasks to the queue. This way you don't run into the issue of 3+ concurrent cron jobs because they didn't complete in time. Maybe you can even tie this in to the curation task queue system too. In the past we had a GitHub Enterprise/Firewall, and being an admin of that shows you fancy admin bells and whistles, where you can even inspect the queue. Now what happens if queue growth exceeds its throughput, we'll cross that bridge when we get there. Peter Dietz Longsight www.longsight.com http://www.longsight.com pe...@longsight.com mailto:pe...@longsight.com p: 740-599-5005 x809 On Fri, May 30, 2014 at 6:11 AM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Peter, Ahh, that's very interesting. I just looked up the -server flag and it seems on recent Sun/Oracle JVMs -server is implied on 64-bit Linux platforms[0]. It seems my problem was the fact that heuristics used by the OOM killer were killing Tomcat's java instead of whatever filter-media, etc cron job which happened to be the final straw in exhausting the server's memory. I've since re-evaluated my Tomcat's -Xmx and -Xms values, and determined there wasn't enough physical RAM to run both Tomcat's java as well as the background tasks, yet DSpace's control panel shows Tomcat's java is actually underutilizing the RAM we've allocated. Reducing the allocation there made a little more room for the background tasks and things have been stable since then. Also, I suspect it was the checksum checker job (runs at 3am for us) which was actually the final straw in exhausting the memory, so I've modified to work for 1 hour each run, instead of attempting to crawl the whole repository (default): 0 3 * * * nice -n19 /blah/dspace/bin/dspace checker -d 1h -p Cheers, Alan [0] http://docs.oracle.com/javase/7/docs/technotes/guides/vm/server-class.html On 05/28/2014 05:33 PM, Peter Dietz wrote: Hi Alan, At Longsight, we customize the JAVA_OPTS in dspace/bin/dspace https://github.com/LongsightGroup/DSpace/blob/longsight-4_x/dspace/bin/dspace#L66 #Allow user to specify java options through JAVA_OPTS variable if [ $JAVA_OPTS = ]; then #Default Java to use 256MB of memory JAVA_OPTS=-server -Xmx256m fi Previously, when I was at Ohio State, I had more in my JAVA_OPTS, to help with permgen issues. https://github.com/osulibraries/DSpace/blob/osukb/dspace/bin/dspace#L66 #Allow user to specify java options through JAVA_OPTS variable if [ $JAVA_OPTS = ]; then #Default Java to use 256MB of memory JAVA_OPTS=-server -Xmx512m -XX:MaxPermSize=128m -XX:+CMSClassUnloadingEnabled fi By adding the -server your ensuring that Java runs in server mode, as opposed to client mode. Server has slower initial startup, but a better memory footprint, and better performance for a longer running task, as per: http://stackoverflow.com/questions/198577/real-differences-between-java-server-and-java-client Then, if one of our clients has some jumbo-sized content that just isn't completing the cron jobs, then we'll temporarily bump the Xmx memory limit high, such as 4G. Peter Dietz Longsight www.longsight.com http://www.longsight.com pe...@longsight.com mailto:pe...@longsight.com p: 740-599-5005 x809 tel:740-599-5005%20x809 On Tue, May 27, 2014 at 7:03 PM, Terry Brady tw...@georgetown.edu mailto:tw...@georgetown.edu wrote: Alan, We override
[Dspace-tech] Anomaly in moving collection to new community
Hi, I've just moved a collection from CommunityA to CommunityB using SQL like this: UPDATE community2collection SET community_id=8 WHERE community_id=140 and collection_id=132; Followed by stopping Tomcat, then running an `index-init`. Now I see the collection listed in the new community, but when I browse to the collection itself, the bread crumb trail in the header still lists the old community as it's parent. I'm on DSpace 3.1 with PostgreSQL, and I've tested this same procedure in a test environment with the same DB / environment, where it worked as expected. Any pointers? :) -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Anomaly in moving collection to new community
On 05/28/2014 10:26 AM, Hilton Gibson wrote: Hi Alan Check the cache options. Try clearing the cache when logged in as admin using the XMLUI. Cheers hg *Hilton Gibson* Ubuntu Linux Systems Administrator JS Gericke Library Room 1025D Stellenbosch University Private Bag X5036 Stellenbosch 7599 South Africa Tel: +27 21 808 4100 | Cell: +27 84 646 4758 http://scholar.sun.ac.za http://bit.ly/goodir http://library.sun.ac.za http://za.linkedin.com/in/hiltongibson On 28 May 2014 08:36, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Hi, I've just moved a collection from CommunityA to CommunityB using SQL like this: UPDATE community2collection SET community_id=8 WHERE community_id=140 and collection_id=132; Followed by stopping Tomcat, then running an `index-init`. Now I see the collection listed in the new community, but when I browse to the collection itself, the bread crumb trail in the header still lists the old community as it's parent. I'm on DSpace 3.1 with PostgreSQL, and I've tested this same procedure in a test environment with the same DB / environment, where it worked as expected. Any pointers? :) -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://www.restlet.com/download http://p.sf.net/sfu/restlet ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net mailto:DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette Hilton, Ah, indeed. It was indeed the Cocoon cache. Thank you, I was going crazy. :) -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Time is money. Stop wasting it! Get your web API in 5 minutes. www.restlet.com/download http://p.sf.net/sfu/restlet___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] JAVA_OPTS for cron jobs?
Hi, I'm curious if anyone sets memory limits for DSpace's various cron jobs? Lately we've been having Tomcat's java process get killed every morning around the same time, and all dmesg shows is that java was killed by the kernel's OOM killer. Catalina logs don't show any SEVERE errors, so I have to assume it's the cron jobs which are using up loads of memory and then confusing the kernel, which then identifies Tomcat's java as the memory hog and kills it. So I'm just curious if anyone has had these kinds of problems, and if/what they set their JAVA_OPTS to in crontab. The long term plan of course is to move to a machine with more memory (currently 4GB). Thanks, DSpace version is 3.1, OS is Ubuntu 12.04. -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG public key ID: 0x8cb0d0acb5cd81ec209c6cdfbd1a0e09c2f836c0 signature.asc Description: OpenPGP digital signature -- Accelerate Dev Cycles with Automated Cross-Browser Testing - For FREE Instantly run your Selenium tests across 300+ browser/OS combos. Get unparalleled scalability from the best Selenium testing platform available Simple to use. Nothing to install. Get started now for free. http://p.sf.net/sfu/SauceLabs___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Pxpdf-mediafilter configuration
Morning from Nairobi, Robert. :) Here are the notes I took when I prepared our servers for Xpdf thumbnails: https://gist.github.com/alanorth/b71a458e9b83c3a8015a They are basically a copy paste able version of the ones in the DSpace docs Helix linked to. Alan On 02/06/2014 05:14 PM, helix84 wrote: I see. Did you skip this step? https://wiki.duraspace.org/display/DSDOC4x/Configuration+Reference#ConfigurationReference-Fetchandinstalljai_imageioJAR If you didn't, did you do it under the same user as you're building DSpace? Above you're building DSpace as root, so in that scenario you'd also need to install jai_imageio to root's ~/.m2/. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Pxpdf-mediafilter configuration
Also, you will need to enable Xpdf in dspace.cfg... just in case the docs don't make sense, here's my commit which does this for our DSpace: https://github.com/ilri/DSpace/commit/30c1460bb17d5e357a4002503da8cc066c823449 Alan On 02/07/2014 11:12 AM, Alan Orth wrote: Morning from Nairobi, Robert. :) Here are the notes I took when I prepared our servers for Xpdf thumbnails: https://gist.github.com/alanorth/b71a458e9b83c3a8015a They are basically a copy paste able version of the ones in the DSpace docs Helix linked to. Alan On 02/06/2014 05:14 PM, helix84 wrote: I see. Did you skip this step? https://wiki.duraspace.org/display/DSDOC4x/Configuration+Reference#ConfigurationReference-Fetchandinstalljai_imageioJAR If you didn't, did you do it under the same user as you're building DSpace? Above you're building DSpace as root, so in that scenario you'd also need to install jai_imageio to root's ~/.m2/. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- Managing the Performance of Cloud-Based Applications Take advantage of what the Cloud has to offer - Avoid Common Pitfalls. Read the Whitepaper. http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Heisenbug with item mapping in AIP import
Hi, I'm trying to migrate a community hierarchy between two different DSpace instances using AIP Import (in -s mode), and I'm seeing unpredictable behavior with mapped items. I've been trying to identify a pattern, but so far have only identified the following cases: * Some item views show only some of the collections they are mapped to, but if you navigate to another collection you can see it there * Some items are mapped to incorrect collections entirely Has anyone else noticed this? Both DSpaces are 3.1 with PostgreSQL 9.1, on Linux of course. Thanks, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Heisenbug with item mapping in AIP import
Hey, Tim. That's an excellent explanation, thanks! I now understand the trade offs between AIP in -s and -r mode. Re-creating item mappings is indeed a tricky issue. In the notes you added to the AIP docs you mention a possible strategy for dealing with this (#3). I will have to think about the problem a bit more and decide whether it's worth my time or not as a sysadmin, or if I should just create my ~10 collections manually and then export/import all items using CSV and tell my editors to suck it up and re-map them. We originally wanted to use AIP because a few dozen items have bitstreams, but now it's become a problem I've spent a few-too-many hours solving. ;) Cheers, Alan On 01/22/2014 07:51 PM, Tim Donohue wrote: I realized these item mapping issues were not well documented. So, I've added a warning to the Submitting an AIP Hierarchy section of the AIP Backup Restore docs: https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-SubmittinganAIPHierarchy See the warning that says Item Mappings may not be maintained when submitting an AIP hierarchy. There's a few possible workarounds noted there. Unfortunately, none are perfect at this time. On 1/22/2014 10:32 AM, Tim Donohue wrote: Hi Alan, Because you are running the AIP import in -s mode, this acts as a *new submission* and will assign *new handles* to Items, Collections and Communities. The reason this is important to note is that in AIPs, *handles are the unique identifier used to maintain relationships between objects*. Let's repeat that: *In AIPs, handles are the unique identifier used to maintain relationships between objects* :) What this means is the following: * Suppose you a DSpaceInstance#1 containing an Item with Handle 1234/10 which is in a Collection with Handle 1234/2 and mapped to another Collection with Handle 1234/5 * When you export this Item to an AIP, it with generate an AIP named ITEM@1234/10.zip. Instead this AIP (in a METS file) will be recorded that this Item is owned by a Collection with Handle 1234/2 and mapped to a Collection with Handle 1234/5. * When you *import* this Item's AIP into another DSpace (DSpaceInstance#2) using -s option, here's what happens. By default, -s will import the Item to whatever Collection you specify (i.e. it ignores the parent object handle specified in the AIP). So, the Item will end up under the Collection you expect. * HOWEVER, Item Mappings are an entirely different issue. When it comes to Item Mappings, DSpace will just map the Item to the Collection(s) specified in the AIP, as unfortunately DSpace has no way to determine if the Handle of the mapped Collections has changed or not. DSpace also has no way to 100% verify that the Collection with Handle 1234/5 in DSpaceInstance#2 is the SAME AS the Collection with Handle 1234/5 in DSpaceInstance#1. So, the problem here may be that you are using the -s option to import Communities/Collections. When using the -s option, DSpace is going to assign a *brand new handle* to each Community/Collection during the import process (unless you specify --o ignoreHandle=true to keep the existing handle). Although DSpace will retain the hierarchy of newly submitted Communities/Collections/Items (because the --o ignoreParent=true is default), it may have difficulty in maintaining the *Item Mappings* between collections (as mappings are always recorded by Collection Handle, and Collection Handles may have changed when you moved this content between DSpace instances). This is one of the big differences between -r (restore) and -s (submit) modes. The former (-r) ensures that Handles are maintained/restored (therefore item mappings everything else will be restored properly). The latter (-s) specifically assigns *new Handles* to all objects. This has the potential to cause issues with Item Mapping, though a Community-Collection-Item hierarchy will work fine. Not sure if that helps, but I think this is what you are seeing. It's essentially a known issue, because unfortunately the only unique external identifier DSpace has is Handles. Therefore, when an object's Handle *changes*, attempting to maintain all mappings becomes extremely complex. - Tim On 1/22/2014 10:04 AM, Alan Orth wrote: Hi, I'm trying to migrate a community hierarchy between two different DSpace instances using AIP Import (in -s mode), and I'm seeing unpredictable behavior with mapped items. I've been trying to identify a pattern, but so far have only identified the following cases: * Some item views show only some of the collections they are mapped to, but if you navigate to another collection you can see it there * Some items are mapped to incorrect collections entirely Has anyone else noticed this? Both DSpaces are 3.1 with PostgreSQL 9.1, on Linux of course. Thanks, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have
Re: [Dspace-tech] Migrate community with AIP?
Ah, thank you, helix. That's exactly what I needed. I hadn't noticed the command-line metadata export (I've only been using the web one). After importing via AIP (to capture community/collection hierarchy and bitstreams), I exported via CSV and reset the dc.date.accessioned and dc.date.available to a sensible string manually, like 2014-01-21T18:58:46Z. Minor detail, but will probably help some future travelers. Cheers, Alan On 01/21/2014 12:13 AM, helix84 wrote: See the -a flag and the configuration option. https://wiki.duraspace.org/display/DSDOC4x/Batch+Metadata+Editing#BatchMetadataEditing-ExportFunction On Jan 20, 2014 8:25 PM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Hi, I've just decided I will export the metadata (CSV) and clean it up manually, then re-import before I export via AIP. This works great for the dc.identifier.uri (handle link), but I just realized that dc.date.accessioned and dc.date.available aren't in the exported metadata. I assume these fields are in the database, so I'll have to use SQL to clean them up after importing via AIP? I'm not sure where to look in the DB... Thanks, Alan On 01/17/2014 09:12 AM, Alan Orth wrote: Thanks, both Tim and Helix. Yes, I initially looked into the -r mode, but then realized that, as Tim mentioned, our development instance doesn't necessarily create proper handles. Our development instance is more of a code-testing ground, and we don't sync the content very frequently. Also, the date-related meta data isn't necessarily correct either, as the accession into the development instance (for quality assurance) isn't necessarily the accession date we'd want. I think I'll have to rely on a two-step approach: first ingesting via AIP to get community/collection hierarchy and bitstreams, then meta data cleanup of the resulting community to clean the old URIs and accession dates etc. Thanks for bouncing some ideas around! Alan On 01/16/2014 06:39 PM, Tim Donohue wrote: Hi Alan, On 1/16/2014 9:10 AM, Alan Orth wrote: Hi, I've got a development instance where we uploaded a few hundred items (in one community and several collections). Our editors spent some time manually uploading bit streams to many of these items. Now I want to migrate the community and its hierarchy to the production instance. We can't use the CSV via Export Metadata because of the bit streams, so I've been looking at using AIP, ie: dspace packager -s -a -t AIP -e m...@us.org mailto:m...@us.org -p 10568/0 33474.zip This works great, but the resulting items now have two of each of the following fields: - dc.date.accessioned - dc.date.available - dc.identifier.uri I can't figure out a work flow that doesn't produce this effect... These three fields are unfortunately auto-generated by DSpace whenever you treat an AIP as a submission information package (SIP), which is what the -s option. Essentially, the '-s' option assumes this is new content, so DSpace defines these fields as: * dc.date.accessioned - the date this new content was added to DSpace * dc.date.available - the date this new content became available in DSpace (i.e. finished approval workflow) * dc.identifier.uri - the assigned Handle for this object For your situation, you may need to consider some metadata related questions. * Does your development instance assign proper Handles? If not, then you *need* Production to assign a new dc.identifier.uri. This may mean that you'll have to unfortunately do some post-metadata cleanup (perhaps via the Bulk Metadata Editor) of the invalid development handles in the dc.identifier.uri fields. DSpace never overwrites or removes existing metadata. * Do you want the date.accessioned and date.available fields to be set to the dates the Item was added to *development* or to *production*? If the latter, again, you may unfortunately need to do some post-metadata cleanup, as DSpace specifically *never* removes/overwrites existing metadata fields. Depending on your setup/answers to your questions, there are three possible AIP import options I can see: 1a) Use Restore/Replace option instead (-r) when migrating to Production. If you treat this as an AIP restoration then DSpace will skip creating date.accessioned, date.available and identifier.uri fields and assume that the provided values in the AIPs are correct
Re: [Dspace-tech] Migrate community with AIP?
Hi, I've just decided I will export the metadata (CSV) and clean it up manually, then re-import before I export via AIP. This works great for the dc.identifier.uri (handle link), but I just realized that dc.date.accessioned and dc.date.available aren't in the exported metadata. I assume these fields are in the database, so I'll have to use SQL to clean them up after importing via AIP? I'm not sure where to look in the DB... Thanks, Alan On 01/17/2014 09:12 AM, Alan Orth wrote: Thanks, both Tim and Helix. Yes, I initially looked into the -r mode, but then realized that, as Tim mentioned, our development instance doesn't necessarily create proper handles. Our development instance is more of a code-testing ground, and we don't sync the content very frequently. Also, the date-related meta data isn't necessarily correct either, as the accession into the development instance (for quality assurance) isn't necessarily the accession date we'd want. I think I'll have to rely on a two-step approach: first ingesting via AIP to get community/collection hierarchy and bitstreams, then meta data cleanup of the resulting community to clean the old URIs and accession dates etc. Thanks for bouncing some ideas around! Alan On 01/16/2014 06:39 PM, Tim Donohue wrote: Hi Alan, On 1/16/2014 9:10 AM, Alan Orth wrote: Hi, I've got a development instance where we uploaded a few hundred items (in one community and several collections). Our editors spent some time manually uploading bit streams to many of these items. Now I want to migrate the community and its hierarchy to the production instance. We can't use the CSV via Export Metadata because of the bit streams, so I've been looking at using AIP, ie: dspace packager -s -a -t AIP -e m...@us.org -p 10568/0 33474.zip This works great, but the resulting items now have two of each of the following fields: - dc.date.accessioned - dc.date.available - dc.identifier.uri I can't figure out a work flow that doesn't produce this effect... These three fields are unfortunately auto-generated by DSpace whenever you treat an AIP as a submission information package (SIP), which is what the -s option. Essentially, the '-s' option assumes this is new content, so DSpace defines these fields as: * dc.date.accessioned - the date this new content was added to DSpace * dc.date.available - the date this new content became available in DSpace (i.e. finished approval workflow) * dc.identifier.uri - the assigned Handle for this object For your situation, you may need to consider some metadata related questions. * Does your development instance assign proper Handles? If not, then you *need* Production to assign a new dc.identifier.uri. This may mean that you'll have to unfortunately do some post-metadata cleanup (perhaps via the Bulk Metadata Editor) of the invalid development handles in the dc.identifier.uri fields. DSpace never overwrites or removes existing metadata. * Do you want the date.accessioned and date.available fields to be set to the dates the Item was added to *development* or to *production*? If the latter, again, you may unfortunately need to do some post-metadata cleanup, as DSpace specifically *never* removes/overwrites existing metadata fields. Depending on your setup/answers to your questions, there are three possible AIP import options I can see: 1a) Use Restore/Replace option instead (-r) when migrating to Production. If you treat this as an AIP restoration then DSpace will skip creating date.accessioned, date.available and identifier.uri fields and assume that the provided values in the AIPs are correct (as it assumes you are restoring a set of deleted objects). WARNING: If the 'dc.identifier.uri' in the AIP does NOT correspond to a valid Handle, then you will end up with invalid Handles in Production! (See next option.) More on Restore/Replace: https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-Restoring/ReplacingusingAIP(s) 1b) When using Restore/Replace, you may want/need to override some of the default options. For example, restoration will always assume the 'dc.identifier.uri' is a valid Handle (so a new Handle will not be assigned). Restoration will also always attempt to restore an object under the *specified* parent object in the AIP -- so, this means if a Collection was under a Community with ID 123456789/1 in your development instance, then it will be restored under a Community of the *same ID* in Production Luckily, these defaults can be overridden. See the 'ignoreHandle' and 'ignoreParent' Advanced options documented here: https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-AdditionalPackagerOptions 2) The other option is to still use Submission (-s) option, but use one or more of the Advanced options (in 1b) to tweak the defaults. I know this is a lot of info
[Dspace-tech] Migrate community with AIP?
Hi, I've got a development instance where we uploaded a few hundred items (in one community and several collections). Our editors spent some time manually uploading bit streams to many of these items. Now I want to migrate the community and its hierarchy to the production instance. We can't use the CSV via Export Metadata because of the bit streams, so I've been looking at using AIP, ie: dspace packager -s -a -t AIP -e m...@us.org -p 10568/0 33474.zip This works great, but the resulting items now have two of each of the following fields: - dc.date.accessioned - dc.date.available - dc.identifier.uri I can't figure out a work flow that doesn't produce this effect... Thanks, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- CenturyLink Cloud: The Leader in Enterprise Cloud Services. Learn Why More Businesses Are Choosing CenturyLink Cloud For Critical Workloads, Development Environments Everything In Between. Get a Quote or Start a Free Trial Today. http://pubads.g.doubleclick.net/gampad/clk?id=119420431iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Migrate community with AIP?
Thanks, both Tim and Helix. Yes, I initially looked into the -r mode, but then realized that, as Tim mentioned, our development instance doesn't necessarily create proper handles. Our development instance is more of a code-testing ground, and we don't sync the content very frequently. Also, the date-related meta data isn't necessarily correct either, as the accession into the development instance (for quality assurance) isn't necessarily the accession date we'd want. I think I'll have to rely on a two-step approach: first ingesting via AIP to get community/collection hierarchy and bitstreams, then meta data cleanup of the resulting community to clean the old URIs and accession dates etc. Thanks for bouncing some ideas around! Alan On 01/16/2014 06:39 PM, Tim Donohue wrote: Hi Alan, On 1/16/2014 9:10 AM, Alan Orth wrote: Hi, I've got a development instance where we uploaded a few hundred items (in one community and several collections). Our editors spent some time manually uploading bit streams to many of these items. Now I want to migrate the community and its hierarchy to the production instance. We can't use the CSV via Export Metadata because of the bit streams, so I've been looking at using AIP, ie: dspace packager -s -a -t AIP -e m...@us.org -p 10568/0 33474.zip This works great, but the resulting items now have two of each of the following fields: - dc.date.accessioned - dc.date.available - dc.identifier.uri I can't figure out a work flow that doesn't produce this effect... These three fields are unfortunately auto-generated by DSpace whenever you treat an AIP as a submission information package (SIP), which is what the -s option. Essentially, the '-s' option assumes this is new content, so DSpace defines these fields as: * dc.date.accessioned - the date this new content was added to DSpace * dc.date.available - the date this new content became available in DSpace (i.e. finished approval workflow) * dc.identifier.uri - the assigned Handle for this object For your situation, you may need to consider some metadata related questions. * Does your development instance assign proper Handles? If not, then you *need* Production to assign a new dc.identifier.uri. This may mean that you'll have to unfortunately do some post-metadata cleanup (perhaps via the Bulk Metadata Editor) of the invalid development handles in the dc.identifier.uri fields. DSpace never overwrites or removes existing metadata. * Do you want the date.accessioned and date.available fields to be set to the dates the Item was added to *development* or to *production*? If the latter, again, you may unfortunately need to do some post-metadata cleanup, as DSpace specifically *never* removes/overwrites existing metadata fields. Depending on your setup/answers to your questions, there are three possible AIP import options I can see: 1a) Use Restore/Replace option instead (-r) when migrating to Production. If you treat this as an AIP restoration then DSpace will skip creating date.accessioned, date.available and identifier.uri fields and assume that the provided values in the AIPs are correct (as it assumes you are restoring a set of deleted objects). WARNING: If the 'dc.identifier.uri' in the AIP does NOT correspond to a valid Handle, then you will end up with invalid Handles in Production! (See next option.) More on Restore/Replace: https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-Restoring/ReplacingusingAIP(s) 1b) When using Restore/Replace, you may want/need to override some of the default options. For example, restoration will always assume the 'dc.identifier.uri' is a valid Handle (so a new Handle will not be assigned). Restoration will also always attempt to restore an object under the *specified* parent object in the AIP -- so, this means if a Collection was under a Community with ID 123456789/1 in your development instance, then it will be restored under a Community of the *same ID* in Production Luckily, these defaults can be overridden. See the 'ignoreHandle' and 'ignoreParent' Advanced options documented here: https://wiki.duraspace.org/display/DSDOC4x/AIP+Backup+and+Restore#AIPBackupandRestore-AdditionalPackagerOptions 2) The other option is to still use Submission (-s) option, but use one or more of the Advanced options (in 1b) to tweak the defaults. I know this is a lot of info, but hopefully it gives you some ideas to go on. - Tim -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature
[Dspace-tech] Ansible playbook to deploy DSpace
Hey, all. I spent yesterday afternoon writing an Ansible playbook[0] for deploying DSpace on Ubuntu 12.04. Ansible is an infrastructure orchestration framework, similar to Chef, Puppet, etc. I like it because it doesn't need daemons or databases, and just piggybacks over SSH and uses vanilla Python 2.6+. The playbook is still kinda rough around the edges, but does work! The focus is a bit different than the vagrant-dspace[1] stuff, which is more geared toward quickly spinning up VMs for hacking on (and testing) DSpace master, whereas I wanted to short-circuit the setup process for new DSpace servers for our environment (since I seem to be setting up a lot of testing/production environments lately, both virtual and physical). Check it out, try it, and send me pull requests. :) [0] https://github.com/alanorth/ansible-dspace [1] https://github.com/DSpace/vagrant-dspace -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349831iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] PDF Thumbnail Resolution
Terry, That's very interesting. I just tried your code and it works great, I compared it to the XPDF method as well as our the manual thumbnailing we've been using (embarassingly) for a few years[0]. The colors in the PDF I used as an example are a bit exaggerated, but it's not too bad and the thumbnail is much more crisp. BTW, I made a pull request[1] on your repo to add some installation / configuration instructions to your README.md. Cheers! Alan [0] http://i.imgur.com/xzH5tcL.png [1] https://github.com/Georgetown-University-Libraries/DSpaceImageMagickThumnails/pull/1 On 10/15/2013 06:34 PM, Terry Brady wrote: We had a similar need. I created a module to build thumbnails with ImageMagick and Ghost Script. If this sounds useful, you can see the code here. https://github.com/Georgetown-University-Libraries/DSpaceImageMagickThumnails Terry On Tue, Oct 15, 2013 at 11:08 AM, RENTON Scott scott.ren...@ed.ac.ukwrote: Thanks folks. I've emailed Jason as you can probably see! Scott Scott Renton Systems Developer, Library University Collections 24 Buccleuch Place, 2nd Floor Right tel: 515219 On 15/10/2013 16:02, RENTON Scott scott.ren...@ed.ac.uk wrote: Thanks Ivan I will do- in fact I will mail him now before I forget! Cheers Scott Scott Renton Systems Developer, Library University Collections 24 Buccleuch Place, 2nd Floor Right tel: 515219 On 15/10/2013 16:01, helix84 heli...@centrum.sk wrote: On Tue, Oct 15, 2013 at 4:31 PM, RENTON Scott scott.ren...@ed.ac.uk wrote: Yeah, that's the code I'm using. I'm not too surprised if the JPEG work is only for images- I would say the PDF thumbnails work is quite an obscure requirement. Anyway, if this is not changing, I would be happy to tell the users that this is the best they can get for the moment; they may decide not to implement, but that's up to them! Sorry about the inaccurate first answer. I see that both the fixed JPEGFilter/BrandedPreviewJPEGFilter and your XPDF2Thumbnail use the awt libraries to manipulate bitmaps. Therefore it might be possible to apply the same fixes to XPDF2Thumbnail as were applied to the JPEG filters. Try asking Jason Sherman jsher...@usao.edu about it, he's the author of the fixes and might be willing to do it for PDF, too. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c signature.asc Description: OpenPGP digital signature -- Rapidly troubleshoot problems before they affect your business. Most IT organizations don't have a clear picture of how application performance affects their revenue. With AppDynamics, you get 100% visibility into your Java,.NET, PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro! http://pubads.g.doubleclick.net/gampad/clk?id=84349351iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display
Re: [Dspace-tech] PDF Thumbnail Resolution
Jason, To clarify, you sent Scott a solution for PDF thumbnails? I'd be very interested in testing it as well. :) We're still testing XPDF and Terry's Ghostscript solution for PDF thumbnails so I'd love to compare. Cheers, Alan On 11/27/2013 02:00 AM, Jason Sherman wrote: I sent Scott a solution based of my jpeg thumbnail off-list for testing. I suppose I should have kept it on list so that everyone was on the same page. Live and learn! I can share my version if anyone is interested, but it sounds like Terry's solution does the job. On Tue, Nov 26, 2013 at 2:13 PM, Alan Orth alan.o...@gmail.com wrote: Terry, That's very interesting. I just tried your code and it works great, I compared it to the XPDF method as well as our the manual thumbnailing we've been using (embarassingly) for a few years[0]. The colors in the PDF I used as an example are a bit exaggerated, but it's not too bad and the thumbnail is much more crisp. BTW, I made a pull request[1] on your repo to add some installation / configuration instructions to your README.md. Cheers! Alan [0] http://i.imgur.com/xzH5tcL.png [1] https://github.com/Georgetown-University-Libraries/DSpaceImageMagickThumnails/pull/1 On 10/15/2013 06:34 PM, Terry Brady wrote: We had a similar need. I created a module to build thumbnails with ImageMagick and Ghost Script. If this sounds useful, you can see the code here. https://github.com/Georgetown-University-Libraries/DSpaceImageMagickThumnails Terry On Tue, Oct 15, 2013 at 11:08 AM, RENTON Scott scott.ren...@ed.ac.uk wrote: Thanks folks. I've emailed Jason as you can probably see! Scott Scott Renton Systems Developer, Library University Collections 24 Buccleuch Place, 2nd Floor Right tel: 515219 On 15/10/2013 16:02, RENTON Scott scott.ren...@ed.ac.uk wrote: Thanks Ivan I will do- in fact I will mail him now before I forget! Cheers Scott Scott Renton Systems Developer, Library University Collections 24 Buccleuch Place, 2nd Floor Right tel: 515219 On 15/10/2013 16:01, helix84 heli...@centrum.sk wrote: On Tue, Oct 15, 2013 at 4:31 PM, RENTON Scott scott.ren...@ed.ac.uk wrote: Yeah, that's the code I'm using. I'm not too surprised if the JPEG work is only for images- I would say the PDF thumbnails work is quite an obscure requirement. Anyway, if this is not changing, I would be happy to tell the users that this is the best they can get for the moment; they may decide not to implement, but that's up to them! Sorry about the inaccurate first answer. I see that both the fixed JPEGFilter/BrandedPreviewJPEGFilter and your XPDF2Thumbnail use the awt libraries to manipulate bitmaps. Therefore it might be possible to apply the same fixes to XPDF2Thumbnail as were applied to the JPEG filters. Try asking Jason Sherman jsher...@usao.edu about it, he's the author of the fixes and might be willing to do it for PDF, too. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60135031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ GPG Public Key: 0xf92c4bd91084bb5de14e20be9470dd588dd1026c
[Dspace-tech] [RFC] Incorrect Java Vendor in DSpace control panel
Hi, I've just realized that DSpace tries to be helpful and print the Java JDK vendor in the control panel, but gets it utterly wrong and is ultimately misleading. For example: Java Runtime Environment Vendor:Oracle Corporation But in reality, we're using OpenJDK: [aorth@dspace: ~]$ java -version java version 1.7.0_21 OpenJDK Runtime Environment (IcedTea 2.3.9) (7u21-2.3.9-0ubuntu0.12.04.1) OpenJDK 64-Bit Server VM (build 23.7-b01, mixed mode) I guess DSpace is using the wrong string to determine the vendor, ie: java.vendor=Oracle Corporation When it should probably be using: java.vm.name=OpenJDK 64-Bit Server VM We were recently working with a vendor (@mire!) and we told them to test their addons with Oracle JDK, as it appeared that's what we were using. I think it's more relevant to report the implementation vendor, not the specification vendor here. If I make an issue on Jira and a GitHub commit, would people appreciate that? Cheers, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] [RFC] Incorrect Java Vendor in DSpace control panel
Ok, I've created a Jira Bug[0] and submitted a patch[1]. Cheers, Alan [0] https://jira.duraspace.org/browse/DS-1629 [1] https://github.com/alanorth/DSpace/commit/fba6c5b9771f04374daf10e7b20c288e85b6f715 On 08/15/2013 11:22 AM, Alan Orth wrote: Hi, I've just realized that DSpace tries to be helpful and print the Java JDK vendor in the control panel, but gets it utterly wrong and is ultimately misleading. For example: Java Runtime Environment Vendor:Oracle Corporation But in reality, we're using OpenJDK: [aorth@dspace: ~]$ java -version java version 1.7.0_21 OpenJDK Runtime Environment (IcedTea 2.3.9) (7u21-2.3.9-0ubuntu0.12.04.1) OpenJDK 64-Bit Server VM (build 23.7-b01, mixed mode) I guess DSpace is using the wrong string to determine the vendor, ie: java.vendor=Oracle Corporation When it should probably be using: java.vm.name=OpenJDK 64-Bit Server VM We were recently working with a vendor (@mire!) and we told them to test their addons with Oracle JDK, as it appeared that's what we were using. I think it's more relevant to report the implementation vendor, not the specification vendor here. If I make an issue on Jira and a GitHub commit, would people appreciate that? Cheers, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Get 100% visibility into Java/.NET code with AppDynamics Lite! It's a free troubleshooting tool designed for production. Get down to code-level detail for bottlenecks, with 2% overhead. Download for free and get started troubleshooting in minutes. http://pubads.g.doubleclick.net/gampad/clk?id=48897031iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Musings on metadata
Hi, We're experimenting with adding new metadata schemas on DSpace. I come from a computer science background, so I am thinking hierarchically, like DNS and LDAP, where the hierarchies are essentially name spaces with common at the top, moving to more unique at the bottom: . org. example. www. OR dc=org dc=example dc=www Naturally, I want to create a schema in DSpace which look like this: org.example.subject org.example2.subject My library guy prefers: org.subject.example org.subject.example2 Are there any technical reasons why we should use one over the other? Cheers, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Musings on metadata
Helix, I was only using rubbish examples because I was trying not to bore you with the details of our complex organization :) ... but basically, there are 15 large, entirely separate companies/organizations in this loosely-related group of agricultural research centers (we call it the CG). Several of us share one DSpace installation. In my mind it makes sense to use a schema/namespace/whatever like: cg.org1.{subject,author,type} cg.org2.{subject,author,type} Would that work? I assume we just need to use the same convention in our input-forms and XMLUI, etc... Or am I misunderstanding the way it works in DSpace? Thanks, Alan On 07/18/2013 11:53 AM, helix84 wrote: I think I understand now. I don't have a definitive answer, but I can offer you my point of view. You are trying to define different schemata for different oraganizational units. A more natural fit in dspace would be to do this at the namespace level (the first part, which we wrongly call schema). So IMHO, a more natural fit for DSpace would be: org-group1.subject org-group2.subject You may want to consult this on dspace-general or with DCAT, who are currently dealing with this kind of thing and may offer better advice. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette On Thu, Jul 18, 2013 at 10:37 AM, Alan Orth alan.o...@gmail.com wrote: Helix, Perhaps it was a poor example. As a sysadmin I of course abhor www as well, but it was an easy example to illustrate DNS hierarchy; mail.example.org. would have worked as well for demonstration purposes. :) To clarify, I'm naturally more comfortable with a format like this: org.group1.subject org.group2.subject Where org is a large, common parent organization, and group1 and group2 are autonomous groups in this organization. Each group will have their own special, non-overlapping subjects, special terminology, authors, etc. The alternative, as my librarian suggests, is: org.subject.group1 org.subject.group2 Are there any technical merits to using one convention over the other? We had previously been polluting DC with things like dc.xzysubject.subject, which is what we want to move away from. Thanks, Alan On 07/18/2013 11:13 AM, helix84 wrote: On Thu, Jul 18, 2013 at 8:54 AM, Alan Orth alan.o...@gmail.com wrote: org.subject.example org.subject.example2 Hi Alan, this is the principle behind Dublin Core, which the DSpace metadata schema is generally based on. The second part from the left (element) is less specific, while the third one (qualifier) is more specific. Since DNS and LDAP use the same principle, I don't really see how you came up with the first one. If that's based on www being the same value in the third part from the left, there's no real reason for that. A web server FQDN doesn't really have to start with www (and arguably shouldn't, see e.g. no-www.org for reasons), so I see this particular convention more as a coincidence than a rule. Just to make sure, can you give a specific example of such metadata in your repository? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Musings on metadata
Ok, Yeah, now that I realize there are no *technical* reasons (and since it's really about best practice), I will go ask on dspace-general. Cheers, Alan On 07/18/2013 01:21 PM, helix84 wrote: Yes, it would work either way. You were asking about best practice, though. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- See everything from the browser to the database with AppDynamics Get end-to-end visibility with application monitoring from AppDynamics Isolate bottlenecks and diagnose root cause in seconds. Start your free trial of AppDynamics Pro today! http://pubads.g.doubleclick.net/gampad/clk?id=48808831iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] LDAP permssion
Thanks, Helix. In this case the problem was that this particular user had registered with an email address (long before we moved to LDAP) which didn't match that of the email address in LDAP; ie ao...@example.org instead of a.o...@example.org. We added the user to the appropriate groups and all is well. Cheers, Alan On 06/20/2013 11:14 AM, helix84 wrote: On Thu, Jun 20, 2013 at 9:47 AM, Webshet, Sisay (ILRI) s.webs...@cgiar.org wrote: We implemented LDAP/SSL. Our e-people doesn’t have the same permission as the normal dsapce/password login when they login through the LDAP. Use login.specialgroup and/or login.groupmap.* to assign LDAP users to DSpace groups as described in https://wiki.duraspace.org/display/DSDOC3x/Authentication+Plugins#AuthenticationPlugins-ConfiguringLDAPAuthentication Keep in mind that LDAP groups are assigned dynamically, only for the duration of the login session, so user's membership in such group won't be visible to the administrator using the usual tools. You may, however, check the effective group membership in the user's profile while he's logged in. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Questions on upgrading from 1.5 to 3.1 and maintaining a live and test instance
Hi, Spencer. Regarding development/production instances, we maintain our code on github[0], with 3_x-dev and 3_x-prod branches. Code modifications like XMLUI tweaks, input form modifications, etc are tested and pushed to the development branch for scrutiny by our editors and users, and then pushed to production every Monday (this creates a predictable workflow and timeline for both devs and users). From time to time we manually re-install the development instance with a new snapshot of the database and assetstore from production. Both of our instances live on the same box, but we use separate domain names and Apache httpd / Tomcat virtual hosting to keep the instances separate. Hope that helps, Alan [0] https://github.com/ilri/DSpace/ On 06/19/2013 09:23 PM, Spencer Lamm wrote: Thanks everyone! Helix84, we'll give your plan a try next week. I'll let you know about anything we learn that would be good for your potential wiki entry. On Tue, Jun 18, 2013 at 4:39 AM, helix84 heli...@centrum.sk mailto:heli...@centrum.sk wrote: On Mon, Jun 17, 2013 at 6:52 PM, Spencer Lamm sla...@swarthmore.edu mailto:sla...@swarthmore.edu wrote: 1) This wiki post by Mark Diggory details how to upgrade on the same box without overwriting - essentially install the new code, diff the config file differences, copy the db files to the new instance, and run the db upgrades. We need to upgrade on a new server due to OS issues. Are there any additional considerations replicating Mark's process on a new server? https://wiki.duraspace.org/display/DSPACE/To+upgrade+without+overwriting Hi Spencer, using a new server when upgrading is actually a procedure I would recommend over in-place upgrade, especially in case of a large version jump. I outlined the procedure here (I really need to make a wiki page for it): http://dspace.2283337.n4.nabble.com/Migrate-1-8-to-3-0-database-td4661490.html http://dspace.2283337.n4.nabble.com/migrate-repository-into-a-new-instances-machine-td4660930.html Basically, make sure you read through each of the upgrading instructions (1.5-1.6, 1.6-1.7, ...). You don't need to do all the steps, but at the very least you have to run the database migration scripts in the correct order. Then, in the last one (1.8-3.1) follow the repeated steps you previously skipped. 2) Our assetstore is on a Compellent SAN accessible via a server mount. What I would love to do is use this upgrade as an opportunity to set up live and development DSpace servers that are as close as possible to being exact replicas. Is it possible to have two DSpace instances pointing to the same assetstore? Do database changes in DSpace upgrades affect the assetstore ? Yes, it's possible, but I strongly recommend against it! If you did, you could delete a file in the staging instance and it would be also missing in the production instance assetstore, but the production instance wouldn't know about it, because it would still have a record that it exists in its database. What you can do is make separate assetstores and use some deduplication method (because most of the files will be the same) - from primitive ones like hardlinks to advanced ones like deduplication on ZFS - check with your SAN admin. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Spencer Lamm Digital Initiatives Librarian Swarthmore College Library sla...@swarthmore.edu mailto:sla...@swarthmore.edu 610-328-8541 -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Hey, helix. I just had a go at solving this... :) https://gist.github.com/alanorth/5652740 This works, both for `dspace test-email` as well as Forgot password functions. I will comment on the Jira bug. I'm not sure how DSpace releases go, but is this type of fix a candidate for 3.1.x or 3.2? Cheers, Alan On 05/25/2013 01:58 PM, helix84 wrote: On Sat, May 25, 2013 at 12:54 PM, Alan Orth alan.o...@gmail.com wrote: Thanks for doing that for me. I'll make sure to keep an eye on DS-1561. I maked you as the reporter and added the issue to your watchlist, so you will receive any updates to that issue. I'll try to prepare a fix when I can get to it. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Helix, I hear you about changing past behavior, but something to consider: The move to build.properties and Maven filtering broke the setup we've been using since DSpace 1.5! This may merit a special case. Of course we'll have to see how the wider DSpace community feels about this, and if we can unearth any corner cases and or side effects. Cheers, Alan On 05/26/2013 04:26 PM, helix84 wrote: On Sun, May 26, 2013 at 3:17 PM, Alan Orth alan.o...@gmail.com wrote: I will comment on the Jira bug. I'm not sure how DSpace releases go, but is this type of fix a candidate for 3.1.x or 3.2? Glad to hear that. I'm actually on the fence with this one - in a certain aspect it is a fix, but it changes the past behaviour, which also brings potential to break a corner case. There is still the possibility that sending empty username and password serves a purpose, I just don't know about it. Since DSpace 3 the release numbering actually changed and 3.2 would be the next bugfix release and 4.0 will be the next major release. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Bram, Sorry, I meant to say that Maven filtering broke our setup with regards to SMTP only. I migrated our configs to *.properties based on the default build.properties and we've had no other problems. :) Alan On 05/26/2013 05:35 PM, Bram Luyten wrote: Hi Alan, did you trace back the origin of why it broke? I had an older setup with maven profiles that also didn't immediately build dspace 3.x, because a new property, dspace.install.dir, was not present in my maven profiles. Info about this in particular: https://github.com/DSpace/DSpace/pull/126 Did you experience any other incompatibilities? rgds Bram -- logo *Bram Luyten*/@mire/ /2888 Loker Avenue East, Suite 315, Carlsbad, CA. 92010/ /Esperantolaan 4, Heverlee 3001, Belgium/ www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=braml On Sun, May 26, 2013 at 3:40 PM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Helix, I hear you about changing past behavior, but something to consider: The move to build.properties and Maven filtering broke the setup we've been using since DSpace 1.5! This may merit a special case. Of course we'll have to see how the wider DSpace community feels about this, and if we can unearth any corner cases and or side effects. Cheers, Alan On 05/26/2013 04:26 PM, helix84 wrote: On Sun, May 26, 2013 at 3:17 PM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: I will comment on the Jira bug. I'm not sure how DSpace releases go, but is this type of fix a candidate for 3.1.x or 3.2? Glad to hear that. I'm actually on the fence with this one - in a certain aspect it is a fix, but it changes the past behaviour, which also brings potential to break a corner case. There is still the possibility that sending empty username and password serves a purpose, I just don't know about it. Since DSpace 3 the release numbering actually changed and 3.2 would be the next bugfix release and 4.0 will be the next major release. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net mailto:DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Yes, I've verified again that commenting out the mail.server.username and mail.server.password in dspace.cfg and then restarting Tomcat fixes it. For some reason reloading the context via Tomcat admin didn't work... For now I'll have to remember to comment that line out after deployment, or look into using authenticated SMTP. Alan On 05/23/2013 12:54 PM, Alan Orth wrote: Helix, Correct; if I comment out the mail.server.username and mail.server.password in the generated / final dspace.cfg sending email works. Also, I was mistaken before regarding `dspace test-email`, it behaves the same way as the emails from the Forgot Password etc; I had previously stated that these were working regardless, but I was wrong. This is probably a corner case for people using un-authenticated SMTP I guess... Ironically, I've been yelling at our ISP to enable authentication for years ;) Alan On 05/23/2013 12:34 PM, helix84 wrote: OK, here's how I understood it: if you comment out mail.server.username and mail.server.password, it works, but if you leave them empty, it doesn't work (presumably because it sends empty values). On Thu, May 23, 2013 at 11:20 AM, Alan Orth alan.o...@gmail.com wrote: If I comment out the mail.* settings in my production.properties file I get an error about circular definitions: Yes, unfortunately, this is a known limitation of the Maven filtering process as documented here: https://wiki.duraspace.org/display/DSDOC3x/Configuration#Configuration-Thebuild.propertiesConfigurationPropertiesFile I guess the the only solution we can provide in DSpace is not to send username and password if they're both empty. The relevant code is here: https://github.com/DSpace/DSpace/blob/dspace-3.1/dspace-api/src/main/java/org/dspace/core/Email.java#L265 It seems to me the code should behave the same (username == null) regardless of whether mail.server.username is commented out or has an empty value. But I would need to verify this. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Helix, Thanks for doing that for me. I'll make sure to keep an eye on DS-1561. Cheers, Alan On 05/25/2013 01:37 PM, helix84 wrote: https://jira.duraspace.org/browse/DS-1561 Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Ok, I've done some more tests... When I specify my mail settings in a custom config -- [dspace-src]/dspace/config/production.cfg -- and then install with `ant update -Dconfig=../../config/production.cfg`, sending mail via DSpace works fine. The [dspace]/config/dspace.cfg generated by the build/ant process has these SMTP settings: # SMTP mail server mail.server = my.smtp.com # SMTP mail server authentication username and password (if required) # mail.server.username = # mail.server.password = # SMTP mail server alternate port (defaults to 25) # mail.server.port = When I use a stock DSpace config -- [dspace-src]/dspace/config/dspace.cfg -- with a custom build.properties -- [dspace-src]/production.properties -- and then build with -Denv=production and install with `ant update`, sending mail via DSpace errors with javax.mail.AuthenticationFailedException. The [dspace]/config/dspace.cfg generated by the build/ant process has these SMTP settings: # SMTP mail server mail.server = smtp.serv.cgnet.com # SMTP mail server authentication username and password (if required) mail.server.username = mail.server.password = # SMTP mail server alternate port (defaults to 25) mail.server.port = 25 If I comment out the mail.* settings in my production.properties file I get an error about circular definitions: BUILD FAILED /home/aorth/src/git/DSpace/dspace/target/dspace-3.1-build/build.xml:88: Property mail.server.password was circularly defined. I think the build.properties way of doing things is a step forward, as it allows for management of multiple DSpace instances from one code base, but it's a bit confusing! Also, we're using unauthenticated SMTP, so these username/password settings are really not useful to us. Thanks, Alan On 05/21/2013 04:45 PM, Alan Orth wrote: Ok, I've just reverted to our all in one dspace.cfg and emails work fine. Sounds like there are some issues with my build.properties (dev.properties and production.properties)... Alan On 05/21/2013 03:32 PM, helix84 wrote: On Tue, May 21, 2013 at 2:08 PM, Alan Orth alan.o...@gmail.com wrote: Here's the full stack trace, including the Caused by: https://gist.github.com/alanorth/4e942aa252764a3d87aa/raw/2c4a9ad30f537386cc3bb1c9a885204e949038e2/gistfile1.txt OK, in this case that didn't tell me anything new. My mail.* properties from dspace.cfg: Looks good. mail.allowed.referrers = cgspace.cgiar.org I have localhost here, so try this: mail.allowed.referrers = localhost, cgspace.cgiar.org But I don't think this is the issue (the error would be different). We did switch to using build.properties (one for dev and production) with a common dspace.cfg, though... I will try to use our old config file maybe... This sounds like a likely culprit to me. Make sure you didn't comment out any properties present in the default build.properties. You have to leave the value empty instead of commenting it out. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Helix, Correct; if I comment out the mail.server.username and mail.server.password in the generated / final dspace.cfg sending email works. Also, I was mistaken before regarding `dspace test-email`, it behaves the same way as the emails from the Forgot Password etc; I had previously stated that these were working regardless, but I was wrong. This is probably a corner case for people using un-authenticated SMTP I guess... Ironically, I've been yelling at our ISP to enable authentication for years ;) Alan On 05/23/2013 12:34 PM, helix84 wrote: OK, here's how I understood it: if you comment out mail.server.username and mail.server.password, it works, but if you leave them empty, it doesn't work (presumably because it sends empty values). On Thu, May 23, 2013 at 11:20 AM, Alan Orth alan.o...@gmail.com wrote: If I comment out the mail.* settings in my production.properties file I get an error about circular definitions: Yes, unfortunately, this is a known limitation of the Maven filtering process as documented here: https://wiki.duraspace.org/display/DSDOC3x/Configuration#Configuration-Thebuild.propertiesConfigurationPropertiesFile I guess the the only solution we can provide in DSpace is not to send username and password if they're both empty. The relevant code is here: https://github.com/DSpace/DSpace/blob/dspace-3.1/dspace-api/src/main/java/org/dspace/core/Email.java#L265 It seems to me the code should behave the same (username == null) regardless of whether mail.server.username is commented out or has an empty value. But I would need to verify this. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] SMTP Authentication failed
All, I'm getting a javax.mail.AuthenticationFailedException[0] when trying to send emails from DSpace, such as the Forgot Password function. Other things like workflow alerts simply fail silently (users don't receive alerts). I've tested the following: 1. SMTP server is working (send mail manually via port 25, HELO, etc) 2. `dspace test-email` is working. We are using unauthenticated SMTP mail and our dspace.cfg configuration has not changed. We're running DSpace 3.1 on Linux with Tomcat 6. Thanks! [0] https://gist.github.com/alanorth/5618260/raw/89e508170e4878ec4f88f43a01b5ebf86a7b82de/gistfile1.txt -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Hey, Helix. Here's the full stack trace, including the Caused by: https://gist.github.com/alanorth/4e942aa252764a3d87aa/raw/2c4a9ad30f537386cc3bb1c9a885204e949038e2/gistfile1.txt My mail.* properties from dspace.cfg: # SMTP mail server mail.server = smtp.serv.cgnet.com # SMTP mail server authentication username and password (if required) mail.server.username = mail.server.password = # SMTP mail server alternate port (defaults to 25) mail.server.port = 25 mail.from.address = cgspace-supp...@cgiar.org mail.admin = a.o...@cgiar.org # Recipient for new user registration emails # Set the default mail character set. This may be overridden by providing a line # inside the email template charset: encoding, otherwise this default is used. mail.charset = UTF-8 # A comma-separated list of hostnames that are allowed to refer browsers to email forms. mail.allowed.referrers = cgspace.cgiar.org # Pass extra settings to the Java mail library. Comma-separated, equals sign between #mail.extraproperties = mail.smtp.socketFactory.port=465, \ # mail.smtp.socketFactory.class=javax.net.ssl.SSLSocketFactory, \ # mail.smtp.socketFactory.fallback=false # An option is added to disable the mailserver. By default, this property is set to false # By setting mail.server.disabled = true, DSpace will not send out emails. # It will instead log the subject of the email which should have been sent #mail.server.disabled = false # By default RSS feeds, OAI-PMH and subscription emails will include ALL items # For backwards compatibility, the subscription emails by default include any modified items # uncomment the following entry for only new items to be emailed # since that usually contains email addresses which ought to be kept If I recall correctly we upgraded from DSpace 1.8 - 3.1 in March, but one user reported that emails were working last month (April). On the server side, I can't think of anything (it's Debian 6.0, so not much new packages these days). We did switch to using build.properties (one for dev and production) with a common dspace.cfg, though... I will try to use our old config file maybe... Cheers, Alan On 05/21/2013 02:45 PM, helix84 wrote: Isn't there a Caused by line just after the snippet you sent? I'm baffled why sending email from DSpace wouldn't work when using test-email works - both use the same send() method of the org.dspace.core.Email class. The only difference is that one is launched from command line, the other one from Tomcat. Can you think of any changes you made between when the email last worked and now? Also attach your mail.* properties from dspace.cfg. We're looking for an inconspicuous problem here. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] SMTP Authentication failed
Ok, I've just reverted to our all in one dspace.cfg and emails work fine. Sounds like there are some issues with my build.properties (dev.properties and production.properties)... Alan On 05/21/2013 03:32 PM, helix84 wrote: On Tue, May 21, 2013 at 2:08 PM, Alan Orth alan.o...@gmail.com wrote: Here's the full stack trace, including the Caused by: https://gist.github.com/alanorth/4e942aa252764a3d87aa/raw/2c4a9ad30f537386cc3bb1c9a885204e949038e2/gistfile1.txt OK, in this case that didn't tell me anything new. My mail.* properties from dspace.cfg: Looks good. mail.allowed.referrers = cgspace.cgiar.org I have localhost here, so try this: mail.allowed.referrers = localhost, cgspace.cgiar.org But I don't think this is the issue (the error would be different). We did switch to using build.properties (one for dev and production) with a common dspace.cfg, though... I will try to use our old config file maybe... This sounds like a likely culprit to me. Make sure you didn't comment out any properties present in the default build.properties. You have to leave the value empty instead of commenting it out. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Try New Relic Now We'll Send You this Cool Shirt New Relic is the only SaaS-based application performance monitoring service that delivers powerful full stack analytics. Optimize and monitor your browser, app, servers with just a few lines of code. Try New Relic and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_may ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Aggressive XMLUI cache
Peter, Unfortunately I haven't found a pattern. It seems quite random, basically during random clicking around our site. My test was to go visit several collections in communities which use custom XMLUI themes, and by a few collections into this exercise I'd see mismatched elements (CSS from one theme, banner from another!). Regarding load/memory. I haven't find that our server (development or production) is using more memory than before. Also, it seems to be just as snappy as it ever was (if it ever was snappy!). It should be noted that since I've ruled out client-side caching as causing this problem, I've encouraged a bit more aggressive client-side caching via mod_headers[1], so that will help reduce requests for static content. I'll have to keep an eye on it to see if load from dynamic pages increases. Regards, Alan [1] https://gist.github.com/alanorth/5530986/raw/911457283ce9a4acacdbdf86edd7b65fe2db4753/gistfile1.txt On 04/29/2013 07:03 PM, Peter Dietz wrote: Hi Alan, Disabling cache all-together was too high of a performance cost for us. Depending on your usage, you might be fine though. If you can narrow your problems/grievances with incorrectly-cached-pages to a number of certain activities / use-cases, then it is worth it to debug those, and find problems in the cache logic. i.e. for the recent-submissions the code had a bug where it didn't nullify the validity of the object, so cocoon assumed that it could continue to present the cache. Adding some code to nullify the validity upon changes, made the caching problem act how we intended it. Peter Dietz On Mon, Apr 29, 2013 at 8:17 AM, Bram Luyten b...@atmire.com mailto:b...@atmire.com wrote: Hi Alan, great to hear that disabling the pipeline cache did not substantially increase your memory footprint in development. Did you see the same on production? best regards, Bram -- logo *Bram Luyten*/@mire/ /2888 Loker Avenue East, Suite 315, Carlsbad, CA. 92010/ /Esperantolaan 4, Heverlee 3001, Belgium/ www.atmire.com http://atmire.com/website/?q=servicesutm_source=emailfooterutm_medium=emailutm_campaign=braml On Sat, Apr 20, 2013 at 4:51 PM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: Hi, So for now I *seem* to have found a work around: setting the pipeline to noncaching. I found the hint referenced in Jira DS-298, and have modified my code[1] to use the noncaching pipeline type. It's been working on my development instance for a few hours now, and I've been refreshing quite a bit, browsing a number of other themes, etc, and still no cache corruption persisting across this particular theme which was so often affected before. Fingers crossed! [1] https://gist.github.com/alanorth/5426188 On Fri, Apr 19, 2013 at 8:36 PM, Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com wrote: All, I've been battling with overly-aggressive XMLUI caching ever since version 1.7. We use DSpace 3.1 with 14 different XMLUI themes, and after a fresh restart it only takes around 5 minutes before themes start getting jumbled between communities. 1.8 introduced the ability to clear the Cocoon Cache from the Control Panel, but that only solves the problem temporarily. I see a few XMLUI caching issues on JIRA, but they look to be only slightly related: * https://jira.duraspace.org/browse/DS-298 * https://jira.duraspace.org/browse/DS-871 Client-side caching is expressly prohibited via mod_headers in our httpd reverse proxy, so I'm positive it's not a browser issue. Can anyone shed insight on this? Adios, -- Alan Orth alan.o...@gmail.com mailto:alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net mailto:DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette
Re: [Dspace-tech] Aggressive XMLUI cache
Hi, So for now I *seem* to have found a work around: setting the pipeline to noncaching. I found the hint referenced in Jira DS-298, and have modified my code[1] to use the noncaching pipeline type. It's been working on my development instance for a few hours now, and I've been refreshing quite a bit, browsing a number of other themes, etc, and still no cache corruption persisting across this particular theme which was so often affected before. Fingers crossed! [1] https://gist.github.com/alanorth/5426188 On Fri, Apr 19, 2013 at 8:36 PM, Alan Orth alan.o...@gmail.com wrote: All, I've been battling with overly-aggressive XMLUI caching ever since version 1.7. We use DSpace 3.1 with 14 different XMLUI themes, and after a fresh restart it only takes around 5 minutes before themes start getting jumbled between communities. 1.8 introduced the ability to clear the Cocoon Cache from the Control Panel, but that only solves the problem temporarily. I see a few XMLUI caching issues on JIRA, but they look to be only slightly related: https://jira.duraspace.org/browse/DS-298 https://jira.duraspace.org/browse/DS-871 Client-side caching is expressly prohibited via mod_headers in our httpd reverse proxy, so I'm positive it's not a browser issue. Can anyone shed insight on this? Adios, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Export archive restricted?
Helix, I just did a more-thorough test on demo.dspace.org. I added Demo Community Administrator as an administrator of the haris community [1] and Demo Collection Administrator as an administrator of the haris collection [2]. When either are logged in, there's a new menu item that appears in their respective community/collection: Context - Export Metadata. Despite the option appearing, neither of them are able to actually export the meta data; the error message is as before, This export archive is restricted. Note, I'm not exporting communities or collections, just the metadata; it seems you must be a site-wide administrator to be allowed to export meta data as a CSV. I'm not sure why the option appears if it's not allowed... Hope that clarifies and possibly helps someone else. [1] http://demo.dspace.org/xmlui/handle/10673/2432 [2] http://demo.dspace.org/xmlui/handle/10673/2433 On 04/17/2013 03:45 PM, helix84 wrote: Hi Alan, I admit I did only a single test on demo where I exported a community using the community admin's account, but the test was successful, as you can still see when you look at that account's My Exports. I used the dspacedemo+commad...@gmail.com mailto:dspacedemo%2bcommad...@gmail.com account to export the following community (which he's an admin of): http://demo.dspace.org/xmlui/handle/10673/1 What did you do differently while testing on demo? Are you sure the community or collection admin you tried was really an admin for that particular community/collection? Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Aggressive XMLUI cache
All, I've been battling with overly-aggressive XMLUI caching ever since version 1.7. We use DSpace 3.1 with 14 different XMLUI themes, and after a fresh restart it only takes around 5 minutes before themes start getting jumbled between communities. 1.8 introduced the ability to clear the Cocoon Cache from the Control Panel, but that only solves the problem temporarily. I see a few XMLUI caching issues on JIRA, but they look to be only slightly related: * https://jira.duraspace.org/browse/DS-298 * https://jira.duraspace.org/browse/DS-871 Client-side caching is expressly prohibited via mod_headers in our httpd reverse proxy, so I'm positive it's not a browser issue. Can anyone shed insight on this? Adios, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Export archive restricted?
Thanks, helix. My memory of the exact error wording wasn't as good as I thought! :) In my experience the Export Metadata link directly downloads a CSV file; I don't see My Exports...? I'll try to replicate this experiment on demo.dspace.org. Thanks, Alan On 04/16/2013 11:37 PM, helix84 wrote: It saves time to list the exact wording (This export archive is restricted.). Here's what the comment at that place in code says: // The user does not have read access to this bitstream. Interrupt this current request // and then forward them to the login page so that they can be authenticated. Once that is // successful they will request will be resumed. I don't use that functionality, so I don't know what's wrong. One guess I would take is - if you made the export and he's trying to download it, he probably can't (although I'm not sure your export would show up in his My Exports). Is the message the same when he tried to do the export as opposed to downloading the export? BTW you may test the functionality on demo.dspace.org. From my test, it works (both exporting and downloading the export). Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Export archive restricted?
Responding to myself here, It seems that you must be a super admin in order to export meta data from a collection. ie, community- and collection-level admins cannot export metadata from their communities / collections unless they are also top-level admins. Alan On 04/17/2013 10:00 AM, Alan Orth wrote: Thanks, helix. My memory of the exact error wording wasn't as good as I thought! :) In my experience the Export Metadata link directly downloads a CSV file; I don't see My Exports...? I'll try to replicate this experiment on demo.dspace.org. Thanks, Alan On 04/16/2013 11:37 PM, helix84 wrote: It saves time to list the exact wording (This export archive is restricted.). Here's what the comment at that place in code says: // The user does not have read access to this bitstream. Interrupt this current request // and then forward them to the login page so that they can be authenticated. Once that is // successful they will request will be resumed. I don't use that functionality, so I don't know what's wrong. One guess I would take is - if you made the export and he's trying to download it, he probably can't (although I'm not sure your export would show up in his My Exports). Is the message the same when he tried to do the export as opposed to downloading the export? BTW you may test the functionality on demo.dspace.org. From my test, it works (both exporting and downloading the export). Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Export archive restricted?
All, I've got a community admin who is trying to export metadata from his community as a CSV. He's an admin of the community and he's logged in but DSpace tells him that Export of this archive is restricted. If I log in as myself it works (I'm the super admin!). Anyone have an idea what I've presumably misconfigured? helix84? ;) Thanks! -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Strange issue with DSpace 3.0
All, I'm just testing a DSpace 1.8.2 - 3.0 upgrade and I'm having some strange issues; all my custom themes and config work, but I get this weird error when trying to look at the DSpace Configuration tab in the control panel: org.dspace.app.xmlui.wing.WingInvalidArgument: The 'characters' parameter is required for list items. Full Java stack trace is here: http://pastebin.com/raw.php?i=9Mu5EQUU Anyone have an idea what's wrong? My platform is Debian 6, Sun Java 6 1.6.0_18, Tomcat 6. Cheers, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester Wave(TM): Endpoint Security, Q1 2013 and remains a good choice in the endpoint security space. For insight on selecting the right partner to tackle endpoint security challenges, access the full report. http://p.sf.net/sfu/symantec-dev2dev ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Strange issue with DSpace 3.0
Helix, Ah, so you're right. For the record, I had been comparing our configs to those in the dspace-3.0 release in preparation for upgrading from 1.8. I pulled over some new syntax changes, including what I thought were defaults in the new config file, such as db.maxconnections. Defining them as listed in 3.0's dspace.cfg[1] resulted in a build error: BUILD FAILED /home/aorth/src/git/DSpace/dspace/target/dspace-3.0-build/build.xml:88: Property db.maxconnections was circularly defined. ... so I left it blank (or commented it out, I can't remember now). That's how I ended up with the weird Cocoon crashes. Anyways, I'm now up and running. If you think it's worth it, I'll submit a bug report / pull request so we can hopefully stop others from running into this same error? Cheers, [1] https://github.com/DSpace/DSpace/blob/dspace-3_x/dspace/config/dspace.cfg#L74 On Mon, Mar 11, 2013 at 6:10 PM, helix84 heli...@centrum.sk wrote: Hi Alan, you can determine the cause by looking at this part of the stacktrace: Caused by: org.dspace.app.xmlui.wing.WingInvalidArgument: The 'characters' parameter is required for list items. at org.dspace.app.xmlui.wing.element.AbstractWingElement.require(AbstractWingElement.java:117) at org.dspace.app.xmlui.wing.element.List.addItem(List.java:292) at org.dspace.app.xmlui.aspect.administrative.ControlPanel.addDSpaceConfiguration(ControlPanel.java:511) This tells you that the 'characters' parameter is an empty string and addItem uses the require() method to check for that. If you go up a call, addItem() is called from the ControlPanel class, specifically line 511: https://github.com/DSpace/DSpace/blob/dspace-3_x/dspace-xmlui/src/main/java/org/dspace/app/xmlui/aspect/administrative/ControlPanel.java#L511 The parameter passed to addItem() is db.maxconnections property from dspace.cfg. So it seems you either didn't define it or left it empty. If you didn't touch this property in dspace.cfg, the problem is in build.properties. All properties defined in build.properties are transferred to dspace.cfg when you execute ant (update or fresh_install). You have to make sure you do not comment out any line in build.properties. Read the warning here: https://wiki.duraspace.org/display/DSDOC3x/Configuration#Configuration-Thebuild.propertiesConfigurationPropertiesFile Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke In heaven all the interesting people are missing. -Friedrich Nietzsche -- Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester Wave(TM): Endpoint Security, Q1 2013 and remains a good choice in the endpoint security space. For insight on selecting the right partner to tackle endpoint security challenges, access the full report. http://p.sf.net/sfu/symantec-dev2dev___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Strange issue with DSpace 3.0
Helix, I wasn't using any special build.properties; just the default one in the dspace-3.0 tree. I suppose the settings should have been inherited from there, where they *are* defined. Regarding defining them as listed, I had meant that I just copied the lines over literally from dspace/config/dspace.cfg into our local dspace config. That's when I got the circular definition error, but I can't remember now if I fixed the error by commenting them out or simply setting them to = nothing. :\ Alan On Mon, Mar 11, 2013 at 11:12 PM, helix84 heli...@centrum.sk wrote: On Mar 11, 2013 9:00 PM, Alan Orth alan.o...@gmail.com wrote: Helix, Ah, so you're right. For the record, I had been comparing our configs to those in the dspace-3.0 release in preparation for upgrading from 1.8. I pulled over some new syntax changes, including what I thought were defaults in the new config file, such as db.maxconnections. Defining them as listed in 3.0's dspace.cfg[1] resulted in a build error: Glad that solved it for you. What do you mean by defining them as listed. What's in your dspace.cfg and what in your build.properties? BUILD FAILED /home/aorth/src/git/DSpace/dspace/target/dspace-3.0-build/build.xml:88: Property db.maxconnections was circularly defined. ... so I left it blank (or commented it out, I can't remember now). That's how I ended up with the weird Cocoon crashes. There's an important difference between those two, as the warning says. Anyways, I'm now up and running. If you think it's worth it, I'll submit a bug report / pull request so we can hopefully stop others from running into this same error? I agree this is unfortunate, but I don't think this can be easily solved properly. The original issue introducing this feature is here: https://jira.duraspace.org/browse/DS-1208 -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke In heaven all the interesting people are missing. -Friedrich Nietzsche -- Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester Wave(TM): Endpoint Security, Q1 2013 and remains a good choice in the endpoint security space. For insight on selecting the right partner to tackle endpoint security challenges, access the full report. http://p.sf.net/sfu/symantec-dev2dev___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Private config includes?
Thanks, helix. I'll check out the build.properties for 3.0, hopefully I can specify my database configs there. I'll also make sure to comment on the Jira ticket to make my use cases known for that new Dynamic Configuration functionality. Alan On 02/25/2013 03:03 PM, helix84 wrote: On Mon, Feb 25, 2013 at 12:53 PM, Alan Orth alan.o...@gmail.com wrote: We currently use git to manage our development / production DSpace instances. Because our github repository is public we don't keep our configs under version control (to protect database passwords, etc). This is a bit of a kludge, and requires us to `ant update -Dconfig=configname` as well as keep several config/modules/ directories around (one for production, one for development). I was just thinking it would be nice if there was a way to have a private includes functionality (similar to httpd configuration), so I could include private.conf or something. Does this exist already? No, there is only the build.properties functionality since 3.0 [1], but that intentionally doesn't allow you to specify all the configuration options. The proposed Dynamic Configurations functionality [2] is a proposed reimplementation of the configuration service. It's still in the early stages, so make sure to mention your needs in its Jira issue [3]. If it's accepted, it could be available as soon as in 4.0. [1] https://wiki.duraspace.org/display/DSDOC3x/Configuration#Configuration-Thebuild.propertiesConfigurationPropertiesFile [2] https://wiki.duraspace.org/display/~lyncode/Dynamic+Configurations [3] https://jira.duraspace.org/browse/DS-1242 Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] XMLUI caching, recommended mod_expires settings?
All, We've been on and off battling with excessive XMLUI caching in our DSpace environment. It seems the server-side issue is less rampant since 1.8.x introduced the ability to clear the Cocoon cache from the Control Panel. Now we're attempting to solve the client-side caching issue, and I'd like to know what settings other people are using. Our environment uses Apache httpd in front of Tomcat, so we put this in our DSpace virtual host: ### See: http://httpd.apache.org/docs/2.2/mod/mod_expires.html # Turn on Expires and set default to 1 IfModule mod_expires.c ExpiresActive On ExpiresDefault A1 /IfModule I'm wondering if Access +1 second is a bit overzealous. Perhaps we should be using the modification time of the file (CSS or GIF, for example). What are you guys using? Thanks! -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS, MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft MVPs and experts. ON SALE this month only -- learn more at: http://p.sf.net/sfu/learnnow-d2d ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] next button for collections view?
Helix, Yeah, it looks like the bug report you filed[1] describes the issue I'm talking about; the default behavior makes sense to developers, but not to users. I guess DSpace is still evolving, and with each release it's getting more and more user friendly. Hopefully some more polish comes to the out-of-box experience, possibly by way of some modifications to the default XMLUI theme, or maybe a default (configurable via dspace.cfg) collection view. I've seen at least one XMLUI example where recent submissions were in the side bar, and the main content stressed that you actually have to choose a browse type in order to see the collection contents. We don't want to lose users because they're confused and can't navigate our repositories. Furthermore, Discovery's faceted search was suggested as a solution, but I'm not entirely sure it addresses this particular usability problem (though it is a cool way to navigate repository data). Cheers, Alan [1] https://jira.duraspace.org/browse/DS-1188 On 06/12/2012 11:38 AM, helix84 wrote: I'm not sure if it's clear enough from the issue but what DSpace expects you to do when you open a collection or community is next to choose a view from the menu: By Issue Date, Titles, Authors... These views are paginable and display all the results. Regards, ~~helix84 -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] next button for collections view?
All, I was just noticing on our DSpace repository that large collections are a bit tricky to browse. Collections display the most recent 5 or so by default, but there's no logical way for users to continue browsing the collection they're currently in, ie a next page button. For example: Community view, showing a large collection: http://i.imgur.com/tArtv.png Inside the collection:http://i.imgur.com/OZclO.png We're using XMLUI. Am I missing some simple config option, or do we need to add something to our theme template? Thanks, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] git workflow with DSpace 1.8
Hey, all. In the past my git-based DSpace workflow was this: our DSpace git repository had 'development' and 'production' branches, and I maintained two config files, dev_dspace.cfg and prod_dspace.cfg. When redeploying my web apps I simply specified which config I want to use, for example: sudo ant -Dconfig=../../prod_dspace.cfg update This made it easy to maintain one code base for different environments, as well as keeping our configuration changes out of git; keeping our changes outside of dspace.cfg makes it easier to merge patches from DSpace master, as well as have less headaches between dev/prod environments. With DSpace 1.8 the main DSpace config file has been split into several subsections and my workflow is now broken. Not only do my changes diverge from upstream, I can't maintain dev/prod configs in one code base. I could commit my config changes to git, but then I'd have to worry about passwords and other internal information getting into our public github. I could maintain two separate git checkouts, but that seems like a waste. Trying to establish the path of least resistance, some sort of a balance between ease of use and git-admin fu. :) How are other people handling this? -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] git workflow with DSpace 1.8
Helix, Oops, thanks for CC'ing the list. I'm a rubbish programmer, but I might take a look at this kind of functionality. Or maybe Tim/Peter/etc might decide this feature is really useful and take it up on their own ;). Alan On 06/08/2012 03:49 PM, helix84 wrote: On Fri, Jun 8, 2012 at 2:44 PM, Alan Orth alan.o...@gmail.com wrote: Helix, Yeah, I could do that. Maybe something something like dev_modules and prod_modules. I wonder how hard it would be to add an option ant's build script which allowed for the specification of a custom modules location. Maybe something like: -Dmodules=../../config/prod_modules -Dconfig=../../config/prod_dspace.cfg It seems like this might be a useful feature, much like the -Dconfig option. Cheers, Alan I'm copying your answer back to dspace-tech. Yes, I think it's a logical complement to -Dconfig. Although I personally don't use it, it might be interesting for others, so if you manage to get it working, be sure to post a patch to the tracker: https://jira.duraspace.org/browse/ Regards, ~~helix84 -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] Log file size and management
All, I'm curious about how people are managing their log files. In particular, I've noticed that dspace.log.* and cocoon.log.* are around 200MB apiece, per day. We zip them at night, but even so, they're taking a ridiculous amount of storage space on our VPS. What are people doing with these logs? Offline archival? Also, while dspace.log.* are potentially useful if you have to go back and rerun legacy statistics, I'm not sure what cocoon logs are useful for...? Adios, -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- For Developers, A Lot Can Happen In A Second. Boundary is the first to Know...and Tell You. Monitor Your Applications in Ultra-Fine Resolution. Try it FREE! http://p.sf.net/sfu/Boundary-d2dvs2 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Log file size and management
Hey, helix. I zip my logs nightly with something like this: find /home/dspace/log/ ! -iname *.gz -iname dspace.log.* -o ! -iname *.gz -iname cocoon.log.* -daystart -mtime +1 -exec ionice -c2 -n7 gzip {} \; I'm more wondering what people use these logs for, and how/if they archive them (especially the cocoon logs). Alan On 04/19/2012 10:59 AM, helix84 wrote: Hi Alan, take a look at logrotate. It's a standard unix tool to manage your logs, including splitting logs by day (DSpace does this itself), keeping the last N days around and compressing and/or moving the rest elsewhere. It's very configurable. Regards, ~~helix84 -- Alan Orth alan.o...@gmail.com http://alaninkenya.org http://mjanja.co.ke I have always wished for my computer to be as easy to use as my telephone; my wish has come true because I can no longer figure out how to use my telephone. -Bjarne Stroustrup, inventor of C++ -- For Developers, A Lot Can Happen In A Second. Boundary is the first to Know...and Tell You. Monitor Your Applications in Ultra-Fine Resolution. Try it FREE! http://p.sf.net/sfu/Boundary-d2dvs2 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech