Hi David, I have I found unknown characters in description and abstract field where our submitters copied and pasted metadata from word document to Dspace, I have corrected one by one item running bin/dspace oai clean-cache, bin/dspace oai import -c after cleaning individual item and harvesting to dev site when getting harvest error i clicked link inside and find identifier which direct me to item that has characters then i correct the item and save updates then run the commands again and harvest again aftyer deleting test community, at the end all 44 items found and cleared. I have received confirmationfrom external harvester that they managed to harvest all collection.
Thank you very much for sharing solution. Regards, Lewatle On Wednesday, 20 February 2019 16:13:09 UTC+2, Lewatle Johannes Phaladi wrote: > > Hi David, > > Thanks very much, I am now identifying many items with characters in the > abstract where users copied and pasted to Dspace. I am busy removing them > while harvesting on development and number of harvested items are > increasing from 100 to 4574, I am hoping all will go well, i will let you > know once all is completed or if there is new error. > > Regards, > Lewatle > > On Tuesday, 19 February 2019 12:01:22 UTC+2, david.delacroes wrote: >> >> Hi Lewatle, >> >> We recently experienced similar problems to yours, which prevented >> external harvesters from receiving our complete OAI feed. We discovered >> that a number of records caused the OAI XML to become malformed, because of >> “foreign” characters, etc. Once we have identified those records (only 9 >> records), we edited each records in DSpace. Thereafter, we executed the >> command “bin/dspace oai import”, which corrected the changed records in the >> OAI database and cache. >> >> >> >> If you delete records, you may have to run the following commands to >> rebuild your OAI indexes: >> >> bin/dspace oai clean-cache >> >> bin/dspace oai import -c >> >> >> >> Hope this helps! >> >> Regards, >> >> David >> >> *From:* [email protected] <[email protected]> *On >> Behalf Of *Lewatle Johannes Phaladi >> *Sent:* Tuesday, 19 February 2019 11:28 AM >> *To:* DSpace Community <[email protected]> >> *Subject:* [dspace-community] Re: DSpace: Harvesting Error >> >> >> >> Hi David and Colleagues, >> >> >> >> I have re-indexed discovery with -d, re-imported oai with -o and >> restarted tomcat then retried to harvest collection on our test server, I >> got error complaining about https://hdl.handle.net/10539/26028 >> <https://protect-za.mimecast.com/s/O17gCzm4GXCKXAvNuooziw> , the item in >> question I have deleted from the system after receiving error and >> re-inxexed again but it is still coming on root cause of the oai error, is >> there any way harvester can by pass this item. I have also attached another >> error messages. >> >> >> >> Regards, >> >> Lewatle >> >> On Wednesday, 13 February 2019 09:43:36 UTC+2, Lewatle Johannes Phaladi >> wrote: >> >> Hi DSpace Colleagues, >> >> >> >> When I tested my harvesting settings DSpace says settings are valid, but >> when other repositories harvesting our side tries to harvest they get error >> messages attached, on another attachment i just put screenshot of test I >> have done on our dev dspace trying to harvest collection from Prod Dspace >> site, TXT document contains error received when running import on another >> dspace system as test. Your advise on this error is much appreciated! >> >> >> >> Regards, >> >> Lewatle >> >> -- >> All messages to this mailing list should adhere to the DuraSpace Code of >> Conduct: https://duraspace.org/about/policies/code-of-conduct/ >> <https://protect-za.mimecast.com/s/vHYlCAnX51ijxzMJfMYsE2> >> --- >> You received this message because you are subscribed to the Google Groups >> "DSpace Community" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/dspace-community >> <https://protect-za.mimecast.com/s/UkOcCBgX56fvxO3JtvQ605>. >> For more options, visit https://groups.google.com/d/optout >> <https://protect-za.mimecast.com/s/nggFCDRZ58iX7kQrcBQkJ2>. >> Disclaimer - University of Cape Town This email is subject to UCT >> policies and email disclaimer published on our website at >> http://www.uct.ac.za/main/email-disclaimer or obtainable from +27 21 650 >> 9111. If this email is not related to the business of UCT, it is sent by >> the sender in an individual capacity. Please report security incidents or >> abuse via https://csirt.uct.ac.za/page/report-an-incident.php. >> > -- All messages to this mailing list should adhere to the DuraSpace Code of Conduct: https://duraspace.org/about/policies/code-of-conduct/ --- You received this message because you are subscribed to the Google Groups "DSpace Community" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/dspace-community. For more options, visit https://groups.google.com/d/optout.
