#543: BibHarvest: multiple harvests of records
------------------------+--------------------
Reporter: jlavik | Owner: jlavik
Type: defect | Status: new
Priority: major | Milestone:
Component: BibHarvest | Version:
Keywords: |
------------------------+--------------------
When the OAI Harvester is configured to harvest records from multiple sets
in one go, the same record may be harvested more then once.
From the OAI-PMH spec
([http://www.openarchives.org/OAI/openarchivesprotocol.html#Set]):
An item may be organized in more than one set; meaning that different
setSpec arguments may return the same record(s).
This can cause some issues during the harvesting workflow that should be
avoided. Firstly, it causes BibUpload to fail the second time a record is
inserted. Secondly, the post-processing may be done repeatedly for the
same record.
To fix this, I suggest that the OAI Harvester can keep track of which
records that have already been harvested through its OAI-PMH identifier.
--
Ticket URL: <http://invenio-software.org/ticket/543>
Invenio <http://invenio-software.org>