Hey,
but the indexAll() does not select the withdrawn items
https://github.com/DSpace/DSpace/blob/master/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L191

2015-03-05 16:31 GMT+01:00 João Melo <jm...@lyncode.com>:

> Hi Kristian,
>
> I cannot see how your description matches the implementation.
> From the implementation withdrawn items are indexed.
>
>
> https://github.com/DSpace/DSpace/blob/master/dspace-oai/src/main/java/org/dspace/xoai/app/XOAI.java#L248
>
> The only way withdrawn items can disappear from the list is due to usage
> of virtual contexts (driver/openaire).
> Is that your scenario?
>
> On 5 March 2015 at 15:13, Kristian Roberto Salcedo <k.r.salc...@ub.uio.no>
> wrote:
>
>> Hi Ivan,
>>
>> Thank you for answering so swiftly.
>> But are you saying that this part of the documentation
>>
>> https://wiki.duraspace.org/pages/viewpage.action?pageId=45548245#OAI-PMHDataProvider2.0(Internals)-Deletions
>> is incorrect and that Dspace only supports deletions "transiently"?
>> If so, wouldn’t it mean that all who harvest us would have to perform
>> complete harvests every time in order to keep track of changes in our
>> repository unless we create custom solutions in order to reveal deleted
>> status?
>>
>> regards,
>> Kristian
>>
>> >-----Original Message-----
>> >From: ivan.ma...@gmail.com [mailto:ivan.ma...@gmail.com] On Behalf Of
>> >helix84
>> >Sent: Thursday, March 05, 2015 3:46 PM
>> >To: Kristian Roberto Salcedo
>> >Cc: dspace-tech@lists.sourceforge.net; João Melo
>> >Subject: Re: [Dspace-tech] OAI-PMH data provider 2.0 not persistent?
>> >
>> >Hi Kristian,
>> >
>> >I think you're right that the declared
>> ><deletedRecord>persistent</deletedRecord> doesn't match how DSpace
>> >behaves. I filed a Jira issue and created a pull request to change the
>> declared
>> >status to <deletedRecord>transient</deletedRecord>.
>> >
>> >https://jira.duraspace.org/browse/DS-2491
>> >
>> >
>> >Regards,
>> >~~helix84
>> >
>> >Compulsory reading: DSpace Mailing List Etiquette
>> >https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>> >
>> >
>> >On Thu, Mar 5, 2015 at 2:52 PM, Kristian Roberto Salcedo
>> ><k.r.salc...@ub.uio.no> wrote:
>> >> Hi all,
>> >>
>> >> In Dspace 4.2 we are currently seeing that our OAI feeds are not being
>> >persistent as per these definitions when it comes to deleted (withdrawn)
>> >items:
>> >>
>> >> http://www.openarchives.org/OAI/openarchivesprotocol.html#deletion
>> >> https://wiki.duraspace.org/pages/viewpage.action?pageId=34640887#OAI-
>> >P
>> >> MHDataProvider2.0(Internals)-Deletions
>> >>
>> >> Withdrawn items do not get the <header status=deleted> as they should,
>> >like in this example:
>> >>
>> >>
>> >http://webservices.itcs.umich.edu/mediawiki/oaibp/index.php/Deleted_Re
>> >> cord_Example_1
>> >>
>> >> Withdrawing an item is not reflected in the OAI entry in any way.
>> >>
>> >> ----------------------------------------------
>> >>
>> >> This is our Identify page at the moment:
>> >>
>> >> https://www.duo.uio.no/oai/request?verb=Identify
>> >>
>> >> ----------------------------------------------
>> >>
>> >> I believe we're doing everything right regarding config parameters and
>> >maintenance of the solr oai index.
>> >>
>> >> The only place I can find a "persistent"-parameter is in this config
>> file:
>> >> /www/var/data/dspace/config/oaicat.properties
>> >> which I thought was deprecated, but we still set it just to be sure:
>> >> Identify.deletedRecord=persistent
>> >>
>> >> All other relevant config parameters are set in these two files as far
>> as I can
>> >tell:
>> >>
>> >> /www/var/data/dspace/config/modules/oai.cfg
>> >> /www/var/data/dspace/config/crosswalks/oai/description.xml
>> >>
>> >> We are running the oai import -o command nightly.
>> >>
>> >> ----------------------------------------------
>> >>
>> >> The only way we've found to remove withdrawn items from the OAI feed is
>> >by running a complete re-indexing of the oai solr index:
>> >>
>> >> /www/var/data/dspace/bin/dspace oai import -c
>> >>
>> >> with a subsequent
>> >>
>> >> /www/var/data/dspace/bin/dspace oai clean-cache
>> >>
>> >> This removes a withdrawn item:
>> >>
>> >> https://www.duo.uio.no/handle/10852/42670
>> >>
>> >> completely from the feed:
>> >>
>> >>
>> >https://www.duo.uio.no/oai/request?verb=GetRecord&metadataPrefix=oai
>> >_d
>> >> c&identifier=oai:localhost:10852/42670
>> >>
>> >> which is not what we want...
>> >>
>> >> ----------------------------------------------
>> >>
>> >>
>> >> Is anyone else experiencing the same problem?
>> >>
>> >> Maybe I've missed something - If this actually works differently than
>> we
>> >expect or has been addressed in OAI 2.1, please let me know.
>> >>
>> >>
>> >> regards,
>> >> Kristian Salcedo
>> >> Universitetet of Oslo Library
>> >> Department of digital services
>> >>
>> >>
>> >> ----------------------------------------------------------------------
>> >> -------- Dive into the World of Parallel Programming The Go Parallel
>> >> Website, sponsored by Intel and developed in partnership with Slashdot
>> >> Media, is your hub for all things parallel software development, from
>> >> weekly thought leadership blogs to news, videos, case studies,
>> >> tutorials and more. Take a look and join the conversation now.
>> >> http://goparallel.sourceforge.net/
>> >> _______________________________________________
>> >> DSpace-tech mailing list
>> >> DSpace-tech@lists.sourceforge.net
>> >> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> >> List Etiquette:
>> >> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>
>
>
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for
> all
> things parallel software development, from weekly thought leadership blogs
> to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> _______________________________________________
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to