John Delacour skribis 2007-10-18 20:24 (+0100): > >They are "characters outside the latin-1 range". > Latin-1 has nothing to do with it.
Blocks of characters have names in Unicode. One of those names is "Latin-1 Supplement". It has a lot to do with it. However, I was mistaken: "latin-1" in Unicode is U+0080..U+00FF, thus excluding ASCII part, which is called "Basic Latin" in Unicode. > There are countless legacy character sets that use the code points > from 32 to 255, and besides, what maquerades as Latin-1 in various > environments rarely is strict iso-8859-1 The latin-1 here is not an alias for iso-8859-1, though I do wish to point out that iso-8859-1 was redefined as a Unicode encoding in 1997. That is, byte 0x80 is defined as the Unicode character U+0080. -- Met vriendelijke groet, Kind regards, Korajn salutojn, Juerd Waalboer: Perl hacker <[EMAIL PROTECTED]> <http://juerd.nl/sig> Convolution: ICT solutions and consultancy <[EMAIL PROTECTED]>