On Mon, Jul 28, 2003 at 07:04:12PM +0200, Eric Cholet wrote: > Dave Rolsky wrote: [...] > >But don't latin-1 and utf8 overlap for the first 255 chars anyway? > > No they don't. Common misconception it seems.
To clarify, UTF-8 and ASCII overlap, but any character with the high bit set incicates that this is a multi-byte character in UTF-8 so the extended characters in latin-1 can not match. However, the first 255 characters in Unicode overlap with Latin-1, but that is not really helpful since you are either using UTF-8 or a constant multi-byte encoding and need to add the extra bytes... -ben
