Not sure if you still need help... Which repository are you trying to harvest?
REST and web interfaces show the contents of DSpace in real time. Their counts should match. OAI keeps a cache by default and that cache has to be refreshed by the administrator ([dspace]/bin/dspace/oai import), so those counts may run out of date if the administrator doesn't update the cache. Anyway, OAI uses handle as part of the identifier. You can get handles from the REST API, so you can, in fact, mix and match the two. Example: http://demo.dspace.org/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:demo.dspace.org:10673/4 https://demo.dspace.org/rest/items <items> <item> ... <handle>10673/4</handle> ... Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette On Mon, Nov 14, 2016 at 6:06 AM, Luke <[email protected]> wrote: > I'm trying to harvest data from DSpace 4 server. > > Using the OAI protocol I couldn't get the raw resources so I moved to the > REST API. But I can't seem to get all the items in the collection through > REST. If I: > * get all collections (<rest>/collections) I get around 4000 collections > (there isn't some pagination thing I'm missing is there?). > * recursively getting all communities/sub-communities (<rest>/communities) > and append collections (<rest>/collections) I get just shy of 40000 > collections. > > But if I view dspace using a browser it shows 22436 at 10 items a page or > 224360 in the public collection. OAI doesn't seem to have the id of > resources, so I can't mix OAI with the rest api. > > The server is controlled by an external party, so I'm struggling to play > around to see if things like an upgrading the server fixes the issue. So I > was wondering if anyone would have any experience with this or might know > why I can't get all the items out of the API. > > Thanks. > > -- > You received this message because you are subscribed to the Google Groups > "DSpace Community" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/dspace-community. > For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "DSpace Community" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/dspace-community. For more options, visit https://groups.google.com/d/optout.
