Dave Rolsky wrote: > 97-111-251-116 That's not a well-formed UTF-8 sequence. It *is* correct Latin-1 or Unicode, encoding a sequence of four characters.
> IS UTF8 That refers to the internal UTF-8 flag on the string. That means it's internally represented using UTF-8, which usually means that the string being represented is Unicode, not UTF-8. -zefram