In short no...

As I've suggested in the thread the indexAll should include also the
withdrawn, that's what we do
https://github.com/ufal/lindat-dspace/blob/lindat/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L193
We use our Event consumer to trigger the update (it currently doesn't
handle caching and changes in discoverable flag)
https://github.com/ufal/lindat-dspace/blob/lindat/dspace-oai/src/main/java/cz/cuni/mff/ufal/event/OAIIndexEventConsumer.java
And by the way dspace changed the deletion mode to transient in
https://jira.duraspace.org/browse/DS-2491 so maybe the right way for dspace
is updating the documentation...


OK



2015-05-26 23:19 GMT+02:00 Jozef Misutka <misu...@ufal.mff.cuni.cz>:

> Ondrej, is this happening to us as well?
>
> jm
>
> ------------------------------
> *From: *"Tim Donohue" <tdono...@duraspace.org>
> *To: *dspace-tech@lists.sourceforge.net
> *Sent: *Tuesday, 26 May, 2015 23:09:20
>
> *Subject: *Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
>
> Hi All,
>
> I'm only just now "re-discovering" this thread, as I've found this same
> behavior in DSpace OAI-PMH servers. Here's how to see this behavior (and
> I've double checked it on http://demo.dspace.org, running 5.2):
>
> 1. Create an Item
> 2. Run the OAI-PMH import (./dspace oai import).
> 3. The item now appears in OAI-PMH
> 4. Withdraw that Item
> 5. Re-Run the OAI-PMH import (./dspace oai import)
> 6. The Item still appears in OAI-PMH and is NEVER flagged as withdrawn.
>
> So, you can still access all its metadata, etc. The ONLY way to trigger
> an update to the OAI-PMH record (of the withdrawn item) is to re-import
> EVERYTHING (./dspace oai import -c). However, as noted, since withdrawn
> items are not included in the XOAI "indexAll()" command, the item will
> now disappear from OAI-PMH entirely.
>
> Here's that indexAll() command:
>
> https://github.com/DSpace/DSpace/blob/master/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L191
>
> This behavior seems to be counter to what is documented at:
>
> https://wiki.duraspace.org/pages/viewpage.action?pageId=45548245#OAI-PMHDataProvider2.0%28Internals%29-Deletions
>
> The documentation specifically states that "DSpace keeps a permanent
> record of withdrawn items". It also states that a request for a
> withdrawn item "will yield the 'record deleted' header".
>
> This sounds like a bug to me. The documentation definitely does not
> match with the behavior.
>
> I've created a new bug ticket for this. It will need a volunteer to
> resolve.
>
> https://jira.duraspace.org/browse/DS-2593
>
> - Tim
>
> On 3/6/2015 5:21 AM, helix84 wrote:
> > Yes, that makes sense. Still, it needs a brief verification because
> > sometimes our understanding of a concept and its implementation may
> > differ.
> >
> >
> > On Fri, Mar 6, 2015 at 10:12 AM, Kristian Roberto Salcedo
> > <k.r.salc...@ub.uio.no> wrote:
> >> Hi Ivan,
> >>
> >> I might have overlooked something, but from your comments on
> https://jira.duraspace.org/browse/DS-2491 wouldnt it be meaningful
> >> to do both your declaration change and Ondřej's suggestion in order to
> match both what is declared and what the documentation says about OAI and
> deleted items?
> >>
> >> regards,
> >> Kristian
> >>
> >>> -----Original Message-----
> >>> From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of
> >>> helix84
> >>> Sent: Thursday, March 05, 2015 3:46 PM
> >>> To: Kristian Roberto Salcedo
> >>> Cc: dspace-tech@lists.sourceforge.net; João Melo
> >>> Subject: Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
> >>>
> >>> Hi Kristian,
> >>>
> >>> I think you're right that the declared
> >>> <deletedRecord>persistent</deletedRecord> doesn't match how DSpace
> >>> behaves. I filed a Jira issue and created a pull request to change the
> declared
> >>> status to <deletedRecord>transient</deletedRecord>.
> >>>
> >>> https://jira.duraspace.org/browse/DS-2491
> >>>
> >>>
> >>> Regards,
> >>> ~~helix84
> >>>
> >>> Compulsory reading: DSpace Mailing List Etiquette
> >>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
> >>>
> >>>
> >>> On Thu, Mar 5, 2015 at 2:52 PM, Kristian Roberto Salcedo
> >>> <k.r.salc...@ub.uio.no> wrote:
> >>>> Hi all,
> >>>>
> >>>> In Dspace 4.2 we are currently seeing that our OAI feeds are not being
> >>> persistent as per these definitions when it comes to deleted
> (withdrawn)
> >>> items:
> >>>>
> >>>> http://www.openarchives.org/OAI/openarchivesprotocol.html#deletion
> >>>> https://wiki.duraspace.org/pages/viewpage.action?pageId=34640887#OAI-
> >>> P
> >>>> MHDataProvider2.0(Internals)-Deletions
> >>>>
> >>>> Withdrawn items do not get the <header status=deleted> as they should,
> >>> like in this example:
> >>>>
> >>>>
> >>> http://webservices.itcs.umich.edu/mediawiki/oaibp/index.php/Deleted_Re
> >>>> cord_Example_1
> >>>>
> >>>> Withdrawing an item is not reflected in the OAI entry in any way.
> >>>>
> >>>> ----------------------------------------------
> >>>>
> >>>> This is our Identify page at the moment:
> >>>>
> >>>> https://www.duo.uio.no/oai/request?verb=Identify
> >>>>
> >>>> ----------------------------------------------
> >>>>
> >>>> I believe we're doing everything right regarding config parameters and
> >>> maintenance of the solr oai index.
> >>>>
> >>>> The only place I can find a "persistent"-parameter is in this config
> file:
> >>>> /www/var/data/dspace/config/oaicat.properties
> >>>> which I thought was deprecated, but we still set it just to be sure:
> >>>> Identify.deletedRecord=persistent
> >>>>
> >>>> All other relevant config parameters are set in these two files as
> far as I can
> >>> tell:
> >>>>
> >>>> /www/var/data/dspace/config/modules/oai.cfg
> >>>> /www/var/data/dspace/config/crosswalks/oai/description.xml
> >>>>
> >>>> We are running the oai import -o command nightly.
> >>>>
> >>>> ----------------------------------------------
> >>>>
> >>>> The only way we've found to remove withdrawn items from the OAI feed
> is
> >>> by running a complete re-indexing of the oai solr index:
> >>>>
> >>>> /www/var/data/dspace/bin/dspace oai import -c
> >>>>
> >>>> with a subsequent
> >>>>
> >>>> /www/var/data/dspace/bin/dspace oai clean-cache
> >>>>
> >>>> This removes a withdrawn item:
> >>>>
> >>>> https://www.duo.uio.no/handle/10852/42670
> >>>>
> >>>> completely from the feed:
> >>>>
> >>>>
> >>> https://www.duo.uio.no/oai/request?verb=GetRecord&metadataPrefix=oai
> >>> _d
> >>>> c&identifier=oai:localhost:10852/42670
> >>>>
> >>>> which is not what we want...
> >>>>
> >>>> ----------------------------------------------
> >>>>
> >>>>
> >>>> Is anyone else experiencing the same problem?
> >>>>
> >>>> Maybe I've missed something - If this actually works differently than
> we
> >>> expect or has been addressed in OAI 2.1, please let me know.
> >>>>
> >>>>
> >>>> regards,
> >>>> Kristian Salcedo
> >>>> Universitetet of Oslo Library
> >>>> Department of digital services
> >>>>
> >>>>
> >>>> ----------------------------------------------------------------------
> >>>> -------- Dive into the World of Parallel Programming The Go Parallel
> >>>> Website, sponsored by Intel and developed in partnership with Slashdot
> >>>> Media, is your hub for all things parallel software development, from
> >>>> weekly thought leadership blogs to news, videos, case studies,
> >>>> tutorials and more. Take a look and join the conversation now.
> >>>> http://goparallel.sourceforge.net/
> >>>> _______________________________________________
> >>>> DSpace-tech mailing list
> >>>> DSpace-tech@lists.sourceforge.net
> >>>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> >>>> List Etiquette:
> >>>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
> >
> >
> ------------------------------------------------------------------------------
> > Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> > by Intel and developed in partnership with Slashdot Media, is your hub
> for all
> > things parallel software development, from weekly thought leadership
> blogs to
> > news, videos, case studies, tutorials and more. Take a look and join the
> > conversation now. http://goparallel.sourceforge.net/
> > _______________________________________________
> > DSpace-tech mailing list
> > DSpace-tech@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/dspace-tech
> > List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
> >
>
>
> ------------------------------------------------------------------------------
> _______________________________________________
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
------------------------------------------------------------------------------
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to