---------- Forwarded message ----------
From: Ondrej Kosarko <ko...@centrum.cz>
Date: 2015-05-27 12:05 GMT+02:00
Subject: Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
To: Christian Scheible <christian.schei...@uni-konstanz.de>
Cc: "dspace-tech@lists.sourceforge.net" <dspace-tech@lists.sourceforge.net>


Hi Christian,

I'm not sure I remember all the setup, but here goes...

in dspace.cfg there is

  event.dispatcher.default.consumers = versioning, discovery, eperson,
harvester, eudatreplication, *xoai*

and

# consumer to maintain the oai index
event.consumer.xoai.class = cz.cuni.mff.ufal.event.OAIIndexEventConsumer
event.consumer.xoai.filters =
Community|Collection|Item|Bundle|Bitstream+Add|Create|Modify|Modify_Metadata|Delete|Remove

and you need to make sure the class is found by the event dispatcher, we
solve it by adding dspace-oai(where the consumer lives) as a dependency to
dspace-xmlui

diff --git a/dspace-xmlui/pom.xml b/dspace-xmlui/pom.xml
index 17d75c0..f19257f 100644
--- a/dspace-xmlui/pom.xml
+++ b/dspace-xmlui/pom.xml
@@ -93,6 +93,22 @@
     </profiles>

     <dependencies>
+     <!-- because of event listener that updates oai -->
+        <dependency>
+               <groupId>org.dspace</groupId>
+               <artifactId>dspace-oai</artifactId>
+        <classifier>classes</classifier>
+        </dependency>

I think that's about it...

Regards,
OK


2015-05-27 9:05 GMT+02:00 Christian Scheible <
christian.schei...@uni-konstanz.de>:

>  Hi Ondřej,
>
> I have seen that you are using your own OAIIndexEventConsumer. We did the
> same but I added It in the module additions (which leads to code
> duplication).
> How do you register your Consumer? Is it the same way as for other
> consumers? Because I couldn't solve that problem.
>
> Best Regards,
> Christian
>
>
> Am 27.05.2015 um 08:41 schrieb Ondřej Košarko:
>
>  In short no...
>
>  As I've suggested in the thread the indexAll should include also the
> withdrawn, that's what we do
> https://github.com/ufal/lindat-dspace/blob/lindat/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L193
> We use our Event consumer to trigger the update (it currently doesn't
> handle caching and changes in discoverable flag)
> https://github.com/ufal/lindat-dspace/blob/lindat/dspace-oai/src/main/java/cz/cuni/mff/ufal/event/OAIIndexEventConsumer.java
> And by the way dspace changed the deletion mode to transient in
> https://jira.duraspace.org/browse/DS-2491 so maybe the right way for
> dspace is updating the documentation...
>
>
>  OK
>
>
>
> 2015-05-26 23:19 GMT+02:00 Jozef Misutka <misu...@ufal.mff.cuni.cz>:
>
>>  Ondrej, is this happening to us as well?
>>
>>  jm
>>
>>  ------------------------------
>> *From: *"Tim Donohue" <tdono...@duraspace.org>
>> *To: *dspace-tech@lists.sourceforge.net
>> *Sent: *Tuesday, 26 May, 2015 23:09:20
>>
>> *Subject: *Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
>>
>> Hi All,
>>
>> I'm only just now "re-discovering" this thread, as I've found this same
>> behavior in DSpace OAI-PMH servers. Here's how to see this behavior (and
>> I've double checked it on http://demo.dspace.org, running 5.2):
>>
>> 1. Create an Item
>> 2. Run the OAI-PMH import (./dspace oai import).
>> 3. The item now appears in OAI-PMH
>> 4. Withdraw that Item
>> 5. Re-Run the OAI-PMH import (./dspace oai import)
>> 6. The Item still appears in OAI-PMH and is NEVER flagged as withdrawn.
>>
>> So, you can still access all its metadata, etc. The ONLY way to trigger
>> an update to the OAI-PMH record (of the withdrawn item) is to re-import
>> EVERYTHING (./dspace oai import -c). However, as noted, since withdrawn
>> items are not included in the XOAI "indexAll()" command, the item will
>> now disappear from OAI-PMH entirely.
>>
>> Here's that indexAll() command:
>>
>> https://github.com/DSpace/DSpace/blob/master/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L191
>>
>> This behavior seems to be counter to what is documented at:
>>
>> https://wiki.duraspace.org/pages/viewpage.action?pageId=45548245#OAI-PMHDataProvider2.0%28Internals%29-Deletions
>>
>> The documentation specifically states that "DSpace keeps a permanent
>> record of withdrawn items". It also states that a request for a
>> withdrawn item "will yield the 'record deleted' header".
>>
>> This sounds like a bug to me. The documentation definitely does not
>> match with the behavior.
>>
>> I've created a new bug ticket for this. It will need a volunteer to
>> resolve.
>>
>> https://jira.duraspace.org/browse/DS-2593
>>
>> - Tim
>>
>> On 3/6/2015 5:21 AM, helix84 wrote:
>> > Yes, that makes sense. Still, it needs a brief verification because
>> > sometimes our understanding of a concept and its implementation may
>> > differ.
>> >
>> >
>> > On Fri, Mar 6, 2015 at 10:12 AM, Kristian Roberto Salcedo
>> > <k.r.salc...@ub.uio.no> wrote:
>> >> Hi Ivan,
>> >>
>> >> I might have overlooked something, but from your comments on
>> https://jira.duraspace.org/browse/DS-2491 wouldnt it be meaningful
>> >> to do both your declaration change and Ondřej's suggestion in order to
>> match both what is declared and what the documentation says about OAI and
>> deleted items?
>> >>
>> >> regards,
>> >> Kristian
>> >>
>> >>> -----Original Message-----
>> >>> From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of
>> >>> helix84
>> >>> Sent: Thursday, March 05, 2015 3:46 PM
>> >>> To: Kristian Roberto Salcedo
>> >>> Cc: dspace-tech@lists.sourceforge.net; João Melo
>> >>> Subject: Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
>> >>>
>> >>> Hi Kristian,
>> >>>
>> >>> I think you're right that the declared
>> >>> <deletedRecord>persistent</deletedRecord> doesn't match how DSpace
>> >>> behaves. I filed a Jira issue and created a pull request to change
>> the declared
>> >>> status to <deletedRecord>transient</deletedRecord>.
>> >>>
>> >>> https://jira.duraspace.org/browse/DS-2491
>> >>>
>> >>>
>> >>> Regards,
>> >>> ~~helix84
>> >>>
>> >>> Compulsory reading: DSpace Mailing List Etiquette
>> >>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>> >>>
>> >>>
>> >>> On Thu, Mar 5, 2015 at 2:52 PM, Kristian Roberto Salcedo
>> >>> <k.r.salc...@ub.uio.no> wrote:
>> >>>> Hi all,
>> >>>>
>> >>>> In Dspace 4.2 we are currently seeing that our OAI feeds are not
>> being
>> >>> persistent as per these definitions when it comes to deleted
>> (withdrawn)
>> >>> items:
>> >>>>
>> >>>> http://www.openarchives.org/OAI/openarchivesprotocol.html#deletion
>> >>>>
>> https://wiki.duraspace.org/pages/viewpage.action?pageId=34640887#OAI-
>> >>> P
>> >>>> MHDataProvider2.0(Internals)-Deletions
>> >>>>
>> >>>> Withdrawn items do not get the <header status=deleted> as they
>> should,
>> >>> like in this example:
>> >>>>
>> >>>>
>> >>>
>> http://webservices.itcs.umich.edu/mediawiki/oaibp/index.php/Deleted_Re
>> >>>> cord_Example_1
>> >>>>
>> >>>> Withdrawing an item is not reflected in the OAI entry in any way.
>> >>>>
>> >>>> ----------------------------------------------
>> >>>>
>> >>>> This is our Identify page at the moment:
>> >>>>
>> >>>> https://www.duo.uio.no/oai/request?verb=Identify
>> >>>>
>> >>>> ----------------------------------------------
>> >>>>
>> >>>> I believe we're doing everything right regarding config parameters
>> and
>> >>> maintenance of the solr oai index.
>> >>>>
>> >>>> The only place I can find a "persistent"-parameter is in this config
>> file:
>> >>>> /www/var/data/dspace/config/oaicat.properties
>> >>>> which I thought was deprecated, but we still set it just to be sure:
>> >>>> Identify.deletedRecord=persistent
>> >>>>
>> >>>> All other relevant config parameters are set in these two files as
>> far as I can
>> >>> tell:
>> >>>>
>> >>>> /www/var/data/dspace/config/modules/oai.cfg
>> >>>> /www/var/data/dspace/config/crosswalks/oai/description.xml
>> >>>>
>> >>>> We are running the oai import -o command nightly.
>> >>>>
>> >>>> ----------------------------------------------
>> >>>>
>> >>>> The only way we've found to remove withdrawn items from the OAI feed
>> is
>> >>> by running a complete re-indexing of the oai solr index:
>> >>>>
>> >>>> /www/var/data/dspace/bin/dspace oai import -c
>> >>>>
>> >>>> with a subsequent
>> >>>>
>> >>>> /www/var/data/dspace/bin/dspace oai clean-cache
>> >>>>
>> >>>> This removes a withdrawn item:
>> >>>>
>> >>>> https://www.duo.uio.no/handle/10852/42670
>> >>>>
>> >>>> completely from the feed:
>> >>>>
>> >>>>
>> >>> https://www.duo.uio.no/oai/request?verb=GetRecord&metadataPrefix=oai
>> >>> _d
>> >>>> c&identifier=oai:localhost:10852/42670
>> >>>>
>> >>>> which is not what we want...
>> >>>>
>> >>>> ----------------------------------------------
>> >>>>
>> >>>>
>> >>>> Is anyone else experiencing the same problem?
>> >>>>
>> >>>> Maybe I've missed something - If this actually works differently
>> than we
>> >>> expect or has been addressed in OAI 2.1, please let me know.
>> >>>>
>> >>>>
>> >>>> regards,
>> >>>> Kristian Salcedo
>> >>>> Universitetet of Oslo Library
>> >>>> Department of digital services
>> >>>>
>> >>>>
>> >>>>
>> ----------------------------------------------------------------------
>> >>>> -------- Dive into the World of Parallel Programming The Go Parallel
>> >>>> Website, sponsored by Intel and developed in partnership with
>> Slashdot
>> >>>> Media, is your hub for all things parallel software development, from
>> >>>> weekly thought leadership blogs to news, videos, case studies,
>> >>>> tutorials and more. Take a look and join the conversation now.
>> >>>> http://goparallel.sourceforge.net/
>> >>>> _______________________________________________
>> >>>> DSpace-tech mailing list
>> >>>> DSpace-tech@lists.sourceforge.net
>> >>>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >>>> List Etiquette:
>> >>>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>> >
>> >
>> ------------------------------------------------------------------------------
>> > Dive into the World of Parallel Programming The Go Parallel Website,
>> sponsored
>> > by Intel and developed in partnership with Slashdot Media, is your hub
>> for all
>> > things parallel software development, from weekly thought leadership
>> blogs to
>> > news, videos, case studies, tutorials and more. Take a look and join the
>> > conversation now. http://goparallel.sourceforge.net/
>> > _______________________________________________
>> > DSpace-tech mailing list
>> > DSpace-tech@lists.sourceforge.net
>> > https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> > List Etiquette:
>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>> >
>>
>>
>> ------------------------------------------------------------------------------
>> _______________________________________________
>> DSpace-tech mailing list
>> DSpace-tech@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> List Etiquette:
>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>
>
>
>
> ------------------------------------------------------------------------------
>
>
>
> _______________________________________________
> DSpace-tech mailing 
> listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette: 
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
>
>
> --
> Christian Scheible
> Softwareentwickler / Abt. Content-basierte Dienste
> Kommunikations-, Informations- und Medienzentrum (KIM)
> Universität Konstanz
> 78457 Konstanz
> +49 (0)7531 / 88-2857
> Raum B 703
>
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to