On Thu, 3 Dec 2009, Andy Koppe wrote:

2009/12/3 Linda Walsh:
C.UTF_8 doesn't exist.
...
You can't have "C" and "UTF-8", because C means no encoding (default).
UTF-8 IS an encoding, so they are mutually exclusive.

From http://www.opengroup.org/onlinepubs/009695399/basedefs/xbd_chap07.html,
ยง7.2:

"The tables in Locale Definition describe the characteristics and
behavior of the POSIX locale for data consisting entirely of
characters from the portable character set and the control character
set. For other characters, the behavior is unspecified."

This means that characters 0..127 have to be treated as ASCII, but
beyond that an implementation can do what it wants. And on Cygwin 1.7,
plain "C" actually does imply UTF-8, which happily is
backward-compatible with ASCII.

That's an interpretation that so far hasn't been blessed by the standards
people.  Any discussion of this topic should mention that, as a caveat.

ymmv

--
Thomas E. Dickey
http://invisible-island.net
ftp://invisible-island.net
--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://x.cygwin.com/docs/
FAQ:                   http://x.cygwin.com/docs/faq/

Reply via email to