Tony Graham wrote:
> [EMAIL PROTECTED] wrote at 27 Sep 2002 16:44:32 -0300:
>  > Out of the XML recomendation,section 2.2:
>  > 
>  >         A character is an atomic unit of text as specified by ISO/IEC 
>  > 10646 [ISO10646]. Legal characters are tab,
>  > carriage return, line feed, and the legal graphic characters of Unicode 
>  > and ISO/IEC 10646.
> XML 1.0 Second Edition removed "graphic" (which I always found
> confusing but which is good ISO-speak).
>  > or, more clearly:
>  > 
>  >         Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | 
>  > [#x10000-#x10FFFF]
>  >         /* any Unicode character, excluding the surrogate blocks, FFFE, 
>  > and FFFF. */
>  > 
>  > 
>  > That means  "-", "#12235" , etc are characters, while "'1'" is not. 
> ⿋ is a character reference.  '#12235' is how you talk about a
> character's code point, although the hexadecimal representation is
> usually preferable.
> In XSL terms, "'1'" is a one-character string literal, but while you
> could claim that it is one character, there's no XSL conversion from a
> string to a character, so <fo:character character="'1'"/> should fail.


I don't think this gets us out of difficulty.  A casual inspection 
reveals no conversion, either, from an NCName to a character.  So an 
attribute value assignment of
will, I think, parse (in the parser implied by the grammar of XSL 
expressions) as an NCName (whereas
will parse as an unadorned MINUS sign.)  So how do I represent a character?

Furthermore, Section 5.11 has
     A single Unicode character.
     A sequence of characters.

If an attribute value assignment of
   "'a sequence of characters'"
assigns a sequence of characters, then
must assign a sequence of one character.

What's the difference between "a single Unicode character" and "a 
sequence of one character"?  Well, one is a sequence, and therefore a 
string, and there's no XSL conversion, etc.

So how do I represent a character?

To me, the cleanest, least ambiguous way is to represent a <character> 
attribute assignment value with "'<character>'" - a string literal of 
length 1.

"Lord, to whom shall we go?"

To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, email: [EMAIL PROTECTED]

Reply via email to