XMLSERIALIZE operator doesn't follow SQL/XML spec in some areas when 
serializing a sequence.
--------------------------------------------------------------------------------------------

                 Key: DERBY-1759
                 URL: http://issues.apache.org/jira/browse/DERBY-1759
             Project: Derby
          Issue Type: Bug
    Affects Versions: 10.2.1.0, 10.2.2.0, 10.3.0.0
            Reporter: A B


The SQL/XML specification dictates that, when serializing a sequence of XML 
items, the XMLSERIALIZE operator must first "normalize" the sequence based on 
the rules defined here:

  http://www.w3.org/TR/xslt-xquery-serialization/#serdm

The current Derby implementation doesn't perform such normalization, which 
leads to two ways in which the results of an XMLSERIALIZE operator may not 
agree with the required behavior:

  1. Sequences of atomic values will not have spaces between, but
     the space is required as part of step 3 of the normalization
     rules at the above link.

  2. Derby will allow serialization of a sequence even if it has
     a top-level Attribute node in it, but the rules of normalization
     dictate that an error should be thrown instead (step 7).

Both of these behaviors can be seen with the following query.

values
  xmlserialize(
    xmlquery('/ageinfo/@*' passing by ref
      xmlparse(
        document '<ageinfo age="48" birthdate="1900-02-08"/>'
        preserve whitespace
      )
      empty on empty
    )
    as char(50)
  )

Derby will currently return the following result from this statement:

1
--------------------------------------------------
481900-02-08

This result does not abide by SQL/XML specification because a) Derby allowed 
serialization of a sequence having a top-level attribute node (actually, the 
sequence had two), and b) the atomic values produced from the attributes were 
displayed without a space between them.

The correct behavior for the above example is to return a serialization error 
caused by the presence of an Attribute node in the sequence.

If the example was rewritten as, say:

-    xmlquery('/ageinfo/@*' passing by ref
+    xmlquery('fn:data(/ageinfo/@*)' passing by ref

then the attribute nodes are no longer present--we only have their atomic 
values, which is allowed.  Thus the correct result should then be:

1
--------------------------------------------------
48 1900-02-08

Note, though, that Xalan doesn't appear to support the "fn:data" function, so 
this rewritten query won't actually work.  I tried using Xalan's built-in 
string function, as follows:

-    xmlquery('/ageinfo/@*' passing by ref
+    xmlquery('string(/ageinfo/@*)' passing by ref

but Xalan only returns the first attribute in that case; it doesn't return the 
second one.  So part of this Jira issue is probably going to involve figuring 
out how to allow a user to retrieve a sequence of attribute *values* (as 
opposed to attribute nodes) using Xalan and still abide by the SQL/XML rules.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to