This is an automated notification sent by LCG Savannah.
It relates to:
                task #9838, project CDS Invenio

==============================================================================
 OVERVIEW of task #9838:
==============================================================================

URL:
  <http://savannah.cern.ch/task/?9838>

                 Summary: BibUpload: --delete mode should update bibxxx
properly
                 Project: CDS Invenio
            Submitted by: simko
            Submitted on: 2009-05-13 19:03
         Should Start On: 2009-05-13 00:00
   Should be Finished on: 2009-05-13 00:00
                Category: BibUpload
                Priority: 7 - High
                  Status: None
                 Privacy: Public
        Percent Complete: 0%
             Assigned to: skaplun
             Open/Closed: Open
         Discussion Lock: Any
                  Effort: 0.00

    _______________________________________________________


This task cumulates some bibupload problems together, notably (a) doubling of
970 when appending MARCXML snippets not containing 001 (hence matched via
970); (b) not upadating bibxxx tables in --delete mode (but MARCXML is fine).
 Here are steps to reproduce the problems:

1) Take record 1, add some 970 to it, and bibupload -a it back.
   Here is the example z1.xml file:
   | <?xml version="1.0" encoding="UTF-8"?>
   | <collection xmlns="http://www.loc.gov/MARC21/slim";>
   | <record>
   |   <controlfield tag="001">1</controlfield>
   |   <datafield tag="970" ind1=" " ind2=" ">
   |     <subfield code="a">0003717PHOPHO</subfield>
   |   </datafield>
   | </record>
   | </collection>

2) Prepare z2.xml of the content quoted below, then bibupload -a, then
   you will see 970 doubled.  This is the first problem.  (970 should
   be used for matching when 001 is not there, it should not get
   doubled when -a mode is used.)

   Here is the example z2.xml file:
   | <?xml version="1.0" encoding="UTF-8"?>
   | <collection xmlns="http://www.loc.gov/MARC21/slim";>
   | <record>
   |   <datafield tag="959" ind1=" " ind2=" ">
   |     <subfield code="a">foo1</subfield>
   |   </datafield>
   |   <datafield tag="959" ind1=" " ind2=" ">
   |     <subfield code="a">foo2</subfield>
   |   </datafield>
   |   <datafield tag="970" ind1=" " ind2=" ">
   |     <subfield code="a">0003717PHOPHO</subfield>
   |   </datafield>
   | </record>
   | </collection>

3) Take the same file and do bibupload -c, then things will get back to
shape, which is good.

4) Take the same file and do bibupload -d.  Then check MARCXML, it is
   good.  Then check MARC, it is not good.  Fields 959 stayed there. 
Apparently the entries in the bibxxx tables were not properly deleted. This
is the second problem.

5) The issue(s) may be present when matching via 035 too; I have not
   checked.  This may be the third problem.

6) The current regression test suite for --delete mode does not catch these
problems.
   It should be updated.

7) Finally, we use 970 $a for external SYSNOs, but the demo
   site data use still 909COo-something.  So we are not consistent.



    _______________________________________________________

Carbon-Copy List:

CC Address                          | Comment
------------------------------------+-----------------------------
1576                                | -SUB-




==============================================================================

This item URL is:
  <http://savannah.cern.ch/task/?9838>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/

Reply via email to