This is an automated notification sent by LCG Savannah.
It relates to:
task #9838, project CDS Invenio
==============================================================================
LATEST MODIFICATIONS of task #9838:
==============================================================================
Update of task #9838 (project cdsware):
Status: None => Done
Percent Complete: 0% => 30%
_______________________________________________________
Follow-up Comment #1:
BibUpload --delete is now taking care of updating bibxxx tables correctly
==============================================================================
OVERVIEW of task #9838:
==============================================================================
URL:
<http://savannah.cern.ch/task/?9838>
Summary: BibUpload: --delete mode should update bibxxx
properly
Project: CDS Invenio
Submitted by: simko
Submitted on: 2009-05-13 19:03
Should Start On: 2009-05-13 00:00
Should be Finished on: 2009-05-13 00:00
Category: BibUpload
Priority: 7 - High
Status: Done
Privacy: Public
Percent Complete: 30%
Assigned to: skaplun
Open/Closed: Open
Discussion Lock: Any
Effort: 0.00
_______________________________________________________
This task cumulates some bibupload problems together, notably (a) doubling of
970 when appending MARCXML snippets not containing 001 (hence matched via
970); (b) not upadating bibxxx tables in --delete mode (but MARCXML is fine).
Here are steps to reproduce the problems:
1) Take record 1, add some 970 to it, and bibupload -a it back.
Here is the example z1.xml file:
| <?xml version="1.0" encoding="UTF-8"?>
| <collection xmlns="http://www.loc.gov/MARC21/slim">
| <record>
| <controlfield tag="001">1</controlfield>
| <datafield tag="970" ind1=" " ind2=" ">
| <subfield code="a">0003717PHOPHO</subfield>
| </datafield>
| </record>
| </collection>
2) Prepare z2.xml of the content quoted below, then bibupload -a, then
you will see 970 doubled. This is the first problem. (970 should
be used for matching when 001 is not there, it should not get
doubled when -a mode is used.)
Here is the example z2.xml file:
| <?xml version="1.0" encoding="UTF-8"?>
| <collection xmlns="http://www.loc.gov/MARC21/slim">
| <record>
| <datafield tag="959" ind1=" " ind2=" ">
| <subfield code="a">foo1</subfield>
| </datafield>
| <datafield tag="959" ind1=" " ind2=" ">
| <subfield code="a">foo2</subfield>
| </datafield>
| <datafield tag="970" ind1=" " ind2=" ">
| <subfield code="a">0003717PHOPHO</subfield>
| </datafield>
| </record>
| </collection>
3) Take the same file and do bibupload -c, then things will get back to
shape, which is good.
4) Take the same file and do bibupload -d. Then check MARCXML, it is
good. Then check MARC, it is not good. Fields 959 stayed there.
Apparently the entries in the bibxxx tables were not properly deleted. This
is the second problem.
5) The issue(s) may be present when matching via 035 too; I have not
checked. This may be the third problem.
6) The current regression test suite for --delete mode does not catch these
problems.
It should be updated.
7) Finally, we use 970 $a for external SYSNOs, but the demo
site data use still 909COo-something. So we are not consistent.
_______________________________________________________
Follow-up Comments:
-------------------------------------------------------
Date: 2009-05-18 14:08 By: Samuele Kaplun <skaplun>
BibUpload --delete is now taking care of updating bibxxx tables correctly
_______________________________________________________
Carbon-Copy List:
CC Address | Comment
------------------------------------+-----------------------------
2195 | -COM-
1576 | -SUB-
==============================================================================
This item URL is:
<http://savannah.cern.ch/task/?9838>
_______________________________________________
Message sent via/by LCG Savannah
http://savannah.cern.ch/