https://en.wikipedia.org/wiki/UTF-8
B'0.......'  is a 8 bit ASCII characters.
B'110.....' is a 16 bit UTF character.
B'1110....' is a 24 bit UTF character.
B'11110...' is a 32 bit UTF character.
B'111110..' could be a 40 bit UTF character (none established).
B'1111110.' could be a 48 bit UTF character (none established).
B'11111110' could be a 56 bit UTF character (none established).
B'11111111' could be a 64 bit UTF character (none established).
B'10......' is a continuation UTF character after a previous leading character.
B'10000000' is a padding UTF character and should be removed.


On Thu, Dec 10, 2015 at 4:17 PM, Tony Harminc <[email protected]> wrote:
> On 9 December 2015 at 23:04, Shmuel Metz (Seymour J.)
> <[email protected]> wrote:
>
>> Tony Harminc <[email protected]> said:
>>
>> >UTF-8 *is* Unicode.
>>
>> No, UTF-32 is Unicode.
>>
>> >It's just a Transform Format,
>>
>> Aside from that, Mrs Linclon, how was the play.
>
> Shmuel, you know as well as I do that UTF-32 is not a privileged
> representation of Unicode. From the top of the very first page of
> their introduction:
> (http://www.unicode.org/standard/WhatIsUnicode.html)
>
> "What is Unicode?
>
> Unicode provides a unique number for every character,
> no matter what the platform,
> no matter what the program,
> no matter what the language."
>
> and later:
>
> "The Unicode Standard and ISO/IEC 10646 support three encoding forms
> (UTF-8, UTF-16, UTF-32) that use a common repertoire of characters. "
>
> UTF-8 *is* Unicode, as are UTF-16 and UTF-32.  Mrs Lincoln doesn't come into 
> it.
>
> Tony H.
>
> ----------------------------------------------------------------------
> For IBM-MAIN subscribe / signoff / archive access instructions,
> send email to [email protected] with the message: INFO IBM-MAIN



-- 
Mike A Schwab, Springfield IL USA
Where do Forest Rangers go to get away from it all?

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Reply via email to