Hi,

We're using DSpace v6.2, JSPUI.

Whilst troubleshooting an issue with a large number of broken full text links 
harvested from our repository via OAI-PMH by the CORE service, CORE reported to 
us that "the provided full text link in the OAI-PMH dc:identifier field is 
broken."

For example, for this item in our repository:

https://dspace.stir.ac.uk/handle/1893/30142

- the link to the associated bitstream from this repository record is:

https://dspace.stir.ac.uk/retrieve/17570e9c-aa29-4c15-99b2-af5892853652/Revisions_Final_Chronic_wounds.pdf

- however, if harvested via OAI-PMH:

https://dspace.stir.ac.uk/oai/request?verb=GetRecord&identifier=oai:dspace.stir.ac.uk:1893/30142&metadataPrefix=oai_dc

- then the bitstream link in dc.identifier is wrong:

<dc:identifier>http://dspace.stir.ac.uk/bitstream/1893/30142/-1/Revisions_Final_Chronic_wounds.pdf</dc:identifier>

- i.e. it contains "-1" where I'd expect to see the bitstream UUID.

And looking at the "raw" XOAI output, it appears to be wrong there too (so not 
an issue with the oai_dc crosswalk?):

https://dspace.stir.ac.uk/oai/request?verb=GetRecord&identifier=oai:dspace.stir.ac.uk:1893/30142&metadataPrefix=xoai

<field 
name="url">http://dspace.stir.ac.uk/bitstream/1893/30142/-1/Revisions_Final_Chronic_wounds.pdf</field>

However, a large number of the OAI-PMH bitstream links do work - e.g.:

https://dspace.stir.ac.uk/oai/request?verb=GetRecord&identifier=oai:dspace.stir.ac.uk:1893/58&metadataPrefix=oai_dc

- includes the correct bitstream URL:

<dc:identifier>http://dspace.stir.ac.uk/bitstream/1893/58/1/Thesis.pdf</dc:identifier>

I've tried clearing the cache, and rebuilding the OAI-PMH index, but this issue 
remains. I also searched the Mailing list archives and JIRA, but couldn't find 
anything that seemed to relate to this problem.

I'm not sure, but my current working theory is that links to "older" bitstreams 
do work because they relate to records added to the repository before the 
upgrade that moved DSpace from using numeric IDs to UUIDs - but records added 
since then, that make use of UUIDs, don't work . . . . (but I haven't managed 
to prove this theory yet!).

Has anyone else come across this? Does anyone know of a solution (I'm happy to 
hack code/apply patches if required)?

If you're on this version of DSpace, are all the bitstream URLs harvested via 
OAI-PMH from your repository correct?

If anyone has any fixes, thoughts, observations etc, they would be most welcome 
as I'm currently at a loss as to how to resolve this and, given the importance 
of CORE for supporting the upcoming REF here in the UK, my library colleagues 
are getting a bit jumpy ;-).

Cheers,

Mike

Michael White
Senior Developer
Business Applications and Integrations
Information Services

4B19, Cottrell
University of Stirling
Stirling
FK9 4LA

Tel:  +44 (0)1786 466877
Email:  [email protected]<mailto:[email protected]>
Web: stir.ac.uk/informationservices<http://www.stir.ac.uk/informationservices>
[cid:[email protected]]<https://www.facebook.com/stirlinglibrary/>[cid:[email protected]]<https://twitter.com/isstirling>[cid:[email protected]]<https://www.instagram.com/isstirling/>[cid:[email protected]]<https://www.youtube.com/user/infoservicesatstir>
[Banner]<https://www.stir.ac.uk/>


________________________________
The University achieved an overall 5 stars in the QS World University Rankings 
2018
The University of Stirling is a charity registered in Scotland, number SC 
011159.

-- 
All messages to this mailing list should adhere to the DuraSpace Code of 
Conduct: https://duraspace.org/about/policies/code-of-conduct/
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/AM6PR03MB5560135CD01E0191E3F1C1C0D4EB0%40AM6PR03MB5560.eurprd03.prod.outlook.com.

Reply via email to