Sort of...
Internally, they are stored in what is known as UV-UTF8, a slight variant
of UTF8, but essentially the
same.
We (ok, IBM does now :-) use the encoding scheme to encode the mark
characters and SQL NULL
into unique code positions that don't conflict with the other UTF8 encoded
entities. It does limit the
total number of extended byte sequences that can be represented, however
Basic theory:
If the byte is of the form 0xxx, it's an ascii byte
If the byte is of the form 10xx xxxx, it's multi-byte 'trail' byte
If the byte is of the form 110x xxxx or 1110 xxxx, it's a lead byte of a 2
or 3 byte multi-byte sequence
If the byte is of the form 1111 0xxx, it's a mark character, where the xxx
determines which one.
If the byte is of the form 1111 1111, it's a SQL NULL
Etc...
Dave
At 05:47 PM 7/13/2004 +1000, you wrote:
>Ray Wurlod wrote:
> > By default, all "standard" functions work in characters, not bytes.
> >
> > With NLS enabled, you have three possibilities; characters, bytes (only
> for length) and display positions (for example Chinese has some
> double-width characters; these need two display positions even though
> there is only one character). The query engine uses the display position
> functions rather than the character functions.
> >
> > Examples ("standard" (character), then bytes, then display positions):
> >
> > LEN BYTELEN LENDP
> > LENS n/a LENSDP
> > FMT n/a FMTDP
>
>Encyclopedic as usual Ray :)
>
>As a supplementary question, am I right in thinking that NLS strings in
>basic will always be UTF8 encoded?
>
>thanks,
>
>
>Craig
>-------
>u2-users mailing list
>[EMAIL PROTECTED]
>To unsubscribe please visit http://listserver.u2ug.org/
========================================================================
David T. Meeks || "All my life I'm taken by surprise
Architect, Technology Office || I'm someone's waste of time
Ascential Software || Now I walk a balanced line
[EMAIL PROTECTED] || and step into tomorrow" - IQ
========================================================================
-------
u2-users mailing list
[EMAIL PROTECTED]
To unsubscribe please visit http://listserver.u2ug.org/