On Wed, 20 Apr 2011, Peter J. Halliday wrote: > For arXiv, obviously we'll have both daily additions and daily > changes. Ideally, it would be best to have this all in the same file > intermingled. However, that depends on Invenio being able to > distinguish between and insert and a replace. Is that the case, or do > I need to split them into separate files and do separate operations on > them?
Nope, you don't have to split them: you can use `bibupload -ir file.xml' in order to perform inserts and updates at the same go. BibUpload looks at incoming records and tries to match the following MARC tags: 001 (=recID) 970 (=CFG_BIBUPLOAD_EXTERNAL_SYSNO_TAG) 035 (=CFG_BIBUPLOAD_EXTERNAL_OAIID_TAG) 909CO or 0248 (=CFG_OAI_ID_FIELD) in order to detect whether it should do an update or an insert, in the above respective order. For more details, see `retrieve_rec_id()' in bibupload.py. So if you prepare daily `file.xml' with nice persistent identifiers that would allow for exact matching, you can safely use the same file. Best regards -- Tibor Simko
