Sorry. Maybe I should make it clearer:
PubMed has a field called "Affiliation." 
(http://www.ncbi.nlm.nih.gov/pubmed/advanced)
We only want to harvest publications affiliated with our university.

I can get the results by querying the website, but I don't know how to harvest 
this set of data to our DSpace.
Thanks!!
Sophie

-----Original Message-----
From: Deng, Sai 
Sent: Wednesday, May 23, 2012 11:02 AM
To: 'Tim Donohue'
Cc: [email protected]
Subject: RE: [Dspace-tech] Harvesting PubMed

Thank you, Tim!!
There is no way to harvest for a specific institution (publications affiliated 
to one institution), right? I looked at the PubMed instructions and it looks 
like their sets are defined by Publishers. 
Here is my testing result:
OAI Provider: http://www.pubmedcentral.nih.gov/oai/oai.cgi
OAI Set id: aac              (*This is for aac set. Seems no way to harvest for 
an institution.)
Metadata format: Simple Dublin Core
Content being harvested: Harvest metadata only.
Last Harvest Result: Harvest from http://www.pubmedcentral.nih.gov/oai/oai.cgi 
sucessful on 2012-05-22 16:18:43.386

I am thinking whether the harvesting interface can be include more options. Is 
it possible to harvest only data from one university?
Vika from Boston posted these two questions before:
- Where in the harvesting settings can I specify things like the dates 
from/until which I want to harvest?

- If my collection has an accept/reject step, is there a way to make harvested 
items completely invisible until they are accepted?  Does the answer to this 
depend on whether I'm harvesting only metadata, or also full-text files?

Thank you for any insight!
Sophie


-----Original Message-----
From: Tim Donohue [mailto:[email protected]]
Sent: Wednesday, May 23, 2012 10:58 AM
To: Deng, Sai
Cc: [email protected]
Subject: Re: [Dspace-tech] Harvesting PubMed

Sophie,

I forgot to include the link to the DSpace documentation on how to harvest 
external content via OAI-PMH:

https://wiki.duraspace.org/display/DSDOC18/XMLUI+Configuration+and+Customization#XMLUIConfigurationandCustomization-HarvestingItemsfromXMLUIviaOAIOREorOAIPMH

That may also be helpful!

- Tim

On 5/23/2012 10:55 AM, Tim Donohue wrote:
> Sophie,
>
> I've never tried this before, but it looks like PubMed supports 
> OAI-PMH harvesting, so you should be able to configure DSpace to 
> harvest content from PubMed via OAI-PMH. Here's the details from the PubMed 
> website:
>
> http://www.ncbi.nlm.nih.gov/pmc/tools/oai/
>
> It says the base URL you'd want to use is 
> http://www.pubmedcentral.nih.gov/oai/oai.cgi
>
> It also has some examples of OAI requests to PubMed:
> http://www.ncbi.nlm.nih.gov/pmc/tools/oai_examples/
>
> Hopefully that will help you out.
>
> - Tim
>
> On 5/22/2012 3:47 PM, Deng, Sai wrote:
>> Hi,
>>
>> Can anyone give me an example of harvesting PubMed publications from 
>> a specific institution? In other words, could you show me how to 
>> configure the auto harvesting under "Collection-Harvesting-Content
>> Source":
>> Content source: This collection harvests its content from an external 
>> source OAI Provider:______________________ OAI Set id: Specific 
>> sets_____________ Metadata Format: Simple Dublin Core [or] DSpace 
>> Intermediate Metadata
>>
>> Content being harvested: Harvest metadata and bitstreams (requires 
>> ORE
>> support)
>>
>> We've been downloading xml data directly from the PubMed website and 
>> transform it to DCXML using some local VBscript. Then we export the 
>> DCXML file to Excel, transform Excel to SIP packages using 
>> BloomaMohan's program. We add several additional fields to the data 
>> set and do quite some editing in the Excel file. I have been 
>> wondering whether the auto harvesting will be a much better option.
>> Any opinion or suggestion? What's your experience?
>>
>> Thank you for your reply!
>> Sophie
>>
>> ---------------------------------------------------------------------
>> ---------
>>
>> Live Security Virtual Conference
>> Exclusive live event will cover all the ways today's security and 
>> threat landscape has changed and how IT managers can respond.
>> Discussions will include endpoint security, mobile security and the 
>> latest in malware threats.
>> http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> _______________________________________________
>> DSpace-tech mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to