Sorry. Maybe I should make it clearer: PubMed has a field called "Affiliation." (http://www.ncbi.nlm.nih.gov/pubmed/advanced) We only want to harvest publications affiliated with our university.
I can get the results by querying the website, but I don't know how to harvest this set of data to our DSpace. Thanks!! Sophie -----Original Message----- From: Deng, Sai Sent: Wednesday, May 23, 2012 11:02 AM To: 'Tim Donohue' Cc: [email protected] Subject: RE: [Dspace-tech] Harvesting PubMed Thank you, Tim!! There is no way to harvest for a specific institution (publications affiliated to one institution), right? I looked at the PubMed instructions and it looks like their sets are defined by Publishers. Here is my testing result: OAI Provider: http://www.pubmedcentral.nih.gov/oai/oai.cgi OAI Set id: aac (*This is for aac set. Seems no way to harvest for an institution.) Metadata format: Simple Dublin Core Content being harvested: Harvest metadata only. Last Harvest Result: Harvest from http://www.pubmedcentral.nih.gov/oai/oai.cgi sucessful on 2012-05-22 16:18:43.386 I am thinking whether the harvesting interface can be include more options. Is it possible to harvest only data from one university? Vika from Boston posted these two questions before: - Where in the harvesting settings can I specify things like the dates from/until which I want to harvest? - If my collection has an accept/reject step, is there a way to make harvested items completely invisible until they are accepted? Does the answer to this depend on whether I'm harvesting only metadata, or also full-text files? Thank you for any insight! Sophie -----Original Message----- From: Tim Donohue [mailto:[email protected]] Sent: Wednesday, May 23, 2012 10:58 AM To: Deng, Sai Cc: [email protected] Subject: Re: [Dspace-tech] Harvesting PubMed Sophie, I forgot to include the link to the DSpace documentation on how to harvest external content via OAI-PMH: https://wiki.duraspace.org/display/DSDOC18/XMLUI+Configuration+and+Customization#XMLUIConfigurationandCustomization-HarvestingItemsfromXMLUIviaOAIOREorOAIPMH That may also be helpful! - Tim On 5/23/2012 10:55 AM, Tim Donohue wrote: > Sophie, > > I've never tried this before, but it looks like PubMed supports > OAI-PMH harvesting, so you should be able to configure DSpace to > harvest content from PubMed via OAI-PMH. Here's the details from the PubMed > website: > > http://www.ncbi.nlm.nih.gov/pmc/tools/oai/ > > It says the base URL you'd want to use is > http://www.pubmedcentral.nih.gov/oai/oai.cgi > > It also has some examples of OAI requests to PubMed: > http://www.ncbi.nlm.nih.gov/pmc/tools/oai_examples/ > > Hopefully that will help you out. > > - Tim > > On 5/22/2012 3:47 PM, Deng, Sai wrote: >> Hi, >> >> Can anyone give me an example of harvesting PubMed publications from >> a specific institution? In other words, could you show me how to >> configure the auto harvesting under "Collection-Harvesting-Content >> Source": >> Content source: This collection harvests its content from an external >> source OAI Provider:______________________ OAI Set id: Specific >> sets_____________ Metadata Format: Simple Dublin Core [or] DSpace >> Intermediate Metadata >> >> Content being harvested: Harvest metadata and bitstreams (requires >> ORE >> support) >> >> We've been downloading xml data directly from the PubMed website and >> transform it to DCXML using some local VBscript. Then we export the >> DCXML file to Excel, transform Excel to SIP packages using >> BloomaMohan's program. We add several additional fields to the data >> set and do quite some editing in the Excel file. I have been >> wondering whether the auto harvesting will be a much better option. >> Any opinion or suggestion? What's your experience? >> >> Thank you for your reply! >> Sophie >> >> --------------------------------------------------------------------- >> --------- >> >> Live Security Virtual Conference >> Exclusive live event will cover all the ways today's security and >> threat landscape has changed and how IT managers can respond. >> Discussions will include endpoint security, mobile security and the >> latest in malware threats. >> http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ >> _______________________________________________ >> DSpace-tech mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dspace-tech ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

