> From: Nick Roberts <[EMAIL PROTECTED]> > Date: Mon, 26 Feb 2007 10:10:43 +1300 > Cc: [email protected] > > > > 1) Why was this change made? > > > > Probably to make it easier to avoid incorrect conversions when > > extracting elements. We don't want to get negative numbers > > for byte values above 127. > > Yes, that bit is self-evident but when are eight bit strings needed? > Mathieu Lacage stated on gdb-patches: > > ....a lot of people (the first > which comes to my mind is libxml2) decided to use "unsigned char *" to > identify utf-8 encoded strings in C. > > Would this also be the case for Emacs?
No, Emacs never uses utf-8 in Lisp strings (at least not in Emacs 22 and earlier). However, the emacs-mule internal encoding of characters stores 8-bit bytes for some codepoints (see the commentary in coding.c for details). I think byte code is also sometimes held in strings, and can include bytes with the 8th bit set. _______________________________________________ emacs-pretest-bug mailing list [email protected] http://lists.gnu.org/mailman/listinfo/emacs-pretest-bug
