UTF-8 is not ASCII. ASCII is a 7-bit charset (0-127), UTF-8 is a 8-bit, multibyte, NULL byte clean, encoding of Unicode.
Unicode is not "multi-byte strings". Unicode is the standard that defines which codepoint (number) is which glyph (character). There are different ways to represent Unicode in software. UTF-8 is one, UCS-16 or UCS-32 or UTF-16 are another. UCS-32 is the simplest 32-bit number to Unicode glyph mapping. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "pylons-discuss" group. To post to this group, send email to pylons-discuss@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/pylons-discuss?hl=en -~----------~----~----~----~------~----~------~--~---