Thanks Helix, Your reply was very helpful but unfortunately I still haven't managed to work this out.
In DSpace 4 /rest/items has a mandatory id at the end of the path according to the documentation. If I just put in /rest/items I get the error "method not allowed". I can see this is meant to work in higher versions of DSpace and this would be the ideal method. With the OAI example my issue was that the rest interface was not returning all records. Is there anyway to get the OAI interface to return the rest id so I can get the resources? On Monday, November 14, 2016 at 6:22:33 PM UTC+9:30, helix84 wrote: > > Not sure if you still need help... Which repository are you trying to > harvest? > > REST and web interfaces show the contents of DSpace in real time. > Their counts should match. > > OAI keeps a cache by default and that cache has to be refreshed by the > administrator ([dspace]/bin/dspace/oai import), so those counts may > run out of date if the administrator doesn't update the cache. > > Anyway, OAI uses handle as part of the identifier. You can get handles > from the REST API, so you can, in fact, mix and match the two. > > Example: > > > http://demo.dspace.org/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:demo.dspace.org:10673/4 > > > https://demo.dspace.org/rest/items > <items> > <item> > ... > <handle>10673/4</handle> > ... > > > > Regards, > ~~helix84 > > Compulsory reading: DSpace Mailing List Etiquette > https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette > > On Mon, Nov 14, 2016 at 6:06 AM, Luke <[email protected] <javascript:>> > wrote: > > I'm trying to harvest data from DSpace 4 server. > > > > Using the OAI protocol I couldn't get the raw resources so I moved to > the > > REST API. But I can't seem to get all the items in the collection > through > > REST. If I: > > * get all collections (<rest>/collections) I get around 4000 collections > > (there isn't some pagination thing I'm missing is there?). > > * recursively getting all communities/sub-communities > (<rest>/communities) > > and append collections (<rest>/collections) I get just shy of 40000 > > collections. > > > > But if I view dspace using a browser it shows 22436 at 10 items a page > or > > 224360 in the public collection. OAI doesn't seem to have the id of > > resources, so I can't mix OAI with the rest api. > > > > The server is controlled by an external party, so I'm struggling to play > > around to see if things like an upgrading the server fixes the issue. So > I > > was wondering if anyone would have any experience with this or might > know > > why I can't get all the items out of the API. > > > > Thanks. > > > > -- > > You received this message because you are subscribed to the Google > Groups > > "DSpace Community" group. > > To unsubscribe from this group and stop receiving emails from it, send > an > > email to [email protected] <javascript:>. > > To post to this group, send email to [email protected] > <javascript:>. > > Visit this group at https://groups.google.com/group/dspace-community. > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "DSpace Community" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/dspace-community. For more options, visit https://groups.google.com/d/optout.
