On 1/13/07, James Gardner <[EMAIL PROTECTED]> wrote:
>
> I'm not trying to be pedantic here either but I'm afraid I didn't want
> to leave your email as the last in the thread in case it confuses anyone
> following on.


Thanks for the clarification. It does indeed help.

I suppose if I could edit my post, the key thing I wanted to remind people
is that utf-8 encoded text is not unicode... it's unicode represented in a
data structure.

And to the other poster, Damjan, you're right of course: ascii is 7bits
only. Not sure how I could have forgotten that :-P

Referencing the excellent article that James posted,
"Using UTF-8, every code point from 0-127 is stored in a single byte. Only
code points 128 and above are stored using 2, 3, in fact, up to 6 bytes."

So, when I said "unicode is just multi-byte strings" I should have said
"utf-8 is just multi-byte strings"

Thanks!

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"pylons-discuss" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/pylons-discuss?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to