https://en.wikipedia.org/wiki/UTF-8 B'0.......' is a 8 bit ASCII characters. B'110.....' is a 16 bit UTF character. B'1110....' is a 24 bit UTF character. B'11110...' is a 32 bit UTF character. B'111110..' could be a 40 bit UTF character (none established). B'1111110.' could be a 48 bit UTF character (none established). B'11111110' could be a 56 bit UTF character (none established). B'11111111' could be a 64 bit UTF character (none established). B'10......' is a continuation UTF character after a previous leading character. B'10000000' is a padding UTF character and should be removed.
On Thu, Dec 10, 2015 at 4:17 PM, Tony Harminc <[email protected]> wrote: > On 9 December 2015 at 23:04, Shmuel Metz (Seymour J.) > <[email protected]> wrote: > >> Tony Harminc <[email protected]> said: >> >> >UTF-8 *is* Unicode. >> >> No, UTF-32 is Unicode. >> >> >It's just a Transform Format, >> >> Aside from that, Mrs Linclon, how was the play. > > Shmuel, you know as well as I do that UTF-32 is not a privileged > representation of Unicode. From the top of the very first page of > their introduction: > (http://www.unicode.org/standard/WhatIsUnicode.html) > > "What is Unicode? > > Unicode provides a unique number for every character, > no matter what the platform, > no matter what the program, > no matter what the language." > > and later: > > "The Unicode Standard and ISO/IEC 10646 support three encoding forms > (UTF-8, UTF-16, UTF-32) that use a common repertoire of characters. " > > UTF-8 *is* Unicode, as are UTF-16 and UTF-32. Mrs Lincoln doesn't come into > it. > > Tony H. > > ---------------------------------------------------------------------- > For IBM-MAIN subscribe / signoff / archive access instructions, > send email to [email protected] with the message: INFO IBM-MAIN -- Mike A Schwab, Springfield IL USA Where do Forest Rangers go to get away from it all? ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
