Hi dear list,
Can anyone give me an example of harvesting PubMed publications from a specific
institution to DSpace? Could you help me to configure the harvesting setting
under "Collection-Harvesting-Content Source" in DSpace:
Content source: This collection harvests its content from an external source.
OAI Provider:______________________?? (PubMed)
OAI Set id: Specific sets_____________?? (for a specific institution)
Metadata Format: Simple Dublin Core
[or] DSpace Intermediate Metadata
Content being harvested: Harvest metadata and bitstreams (requires ORE support)
By the way, we've been downloading xml data directly from the PubMed website
and transform it to DCXML using some local VBscript. Then we export the DCXML
file to Excel, transform Excel to SIP packages using BloomaMohan's program. We
add several additional fields to the data set and do quite some editing in the
Excel file. I have been wondering whether the DSpace built-in harvesting will
be a much better option.
Thank you for any idea or help!
Sophie