Dear All, I have an issue with harvested items from one of the repositories that I am working on. The harvested items contain Thai characters in their filename(s). There is no problem downloading the actual bitstream in the source repository, but when harvested, the link to the bitstream with Thai characters was converted to question marks when clicked.
To make it clear, here is the item from the source or original repository: https://repository.seafdec.or.th/handle/20.500.12067/1677 And here is the link to the harvested item: https://repository.seafdec.org/handle/20.500.12066/6814 I tried cleaning the OAI cache of the source repository and running dspace oai import LANG=en_US.UTF-8, did this after editing this item to trigger an item update to the OAI but the issue still persists. UTF encoding is enabled in Tomcat both in the source and the harvesting repository (DSpace 6.3 XMLUI). Here is the link to the ORE record of the item from the OAI webapp: https://repository.seafdec.or.th/oai/request?verb=GetRecord&identifier=oai:repository.seafdec.or.th:20.500.12067/1677&metadataPrefix=ore I believe this is where the harvesting repository gets its link for the bitstreams. Notice that the Thai text ปะการังเทียมลอยน้ำ_ผลสำเร็จ.pdf was URL encoded into %e0%b8%9b%e0%b8%b0%e0%b8%81%e0%b8%b2%e0%b8%a3%e0%b8%b1%e0%b8%87%e0%b9%80%e0%b8%97%e0%b8%b5%e0%b8%a2%e0%b8%a1%e0%b8%a5%e0%b8%ad%e0%b8%a2%e0%b8%99%e0%b9%89%e0%b8%b3_%e0%b8%9c%e0%b8%a5%e0%b8%aa%e0%b8%b3%e0%b9%80%e0%b8%a3%e0%b9%87%e0%b8%88.pdf Navigating to that link found in the generated ore file will result in "Resource not found" error because the text was transformed into question marks: [image: thai-text.PNG] Is there a way to resolve this except by changing the ORIGINAL file name (which is not an option because I have no idea how many of these files contain Thai text)? Thanks in advance and best regards, euler -- All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx --- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/7d99c537-1419-4278-8f42-4d84a3cacf09n%40googlegroups.com.
