On Wed, 20 Apr 2011, Peter J. Halliday wrote:
> For arXiv, obviously we'll have both daily additions and daily
> changes. Ideally, it would be best to have this all in the same file
> intermingled.  However, that depends on Invenio being able to
> distinguish between and insert and a replace.  Is that the case, or do
> I need to split them into separate files and do separate operations on
> them?

Nope, you don't have to split them: you can use `bibupload -ir file.xml'
in order to perform inserts and updates at the same go.  BibUpload looks
at incoming records and tries to match the following MARC tags:

   001 (=recID) 
   970 (=CFG_BIBUPLOAD_EXTERNAL_SYSNO_TAG) 
   035 (=CFG_BIBUPLOAD_EXTERNAL_OAIID_TAG)
   909CO or 0248 (=CFG_OAI_ID_FIELD) 

in order to detect whether it should do an update or an insert, in the
above respective order.  For more details, see `retrieve_rec_id()' in
bibupload.py.

So if you prepare daily `file.xml' with nice persistent identifiers that
would allow for exact matching, you can safely use the same file.

Best regards
-- 
Tibor Simko

Reply via email to