|
Hi Monika, On 30/11/12 03:10, Monika Mevenkamp
wrote:
We are harvesting content from DSPACE instances into a LOCKSS network for preservation purposes. The system is setup to initially fetch content using HTTP get requests and periodically check for updates using HTTP GET with If-Last-Modified-Since. We were hoping that the DSPACE software would support the standard behaviour. This does not appear to be the case. There are two ways to deal with this: What user agent does LOCKSS send in its requests? Try adding that to the list of known crawlers in the XMLUI sitemap (assuming you are in fact using XMLUI), eg https://github.com/DSpace/DSpace/blob/dspace-1_8_x/dspace-xmlui/dspace-xmlui-webapp/src/main/webapp/sitemap.xmap#L127 in DSpace 1.8.x As I outlined in my other post to this thread, DSpace should then consult the item's last modified timestamp in responses to If-modified-since requests. For testing purposes, you should be able to edit [dspace]/webapps/xmlui/sitemap.xmap directly, but make sure to add your changes back to your source tree if they produce the desired results. cheers, Andrea -- Dr Andrea Schweer IRR Technical Specialist, ITS Information Systems The University of Waikato, Hamilton, New Zealand |
------------------------------------------------------------------------------ Keep yourself connected to Go Parallel: DESIGN Expert tips on starting your parallel project right. http://goparallel.sourceforge.net/
_______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

