On Friday 05 April 2002 12:14, Markus Kuhn wrote: > When I enter a Unicode character (Mozilla 0.9.9 nicely supports > UTF-8 cut&paste from xterm) into a bugzilla bug description, then > the resulting web page shows these characters as human-readable > numeric character references. Example: > > http://bugzilla.mozilla.org/show_bug.cgi?id=135762 ... > Please try this with other web interfaces! > > Markus
Viewing the bug page in Mozilla 0.9.9 on x86 Mandrake Linux 8.1: Case 1: View Encoding set to Western code page 1252 or ISO-8859-1, Mozilla font preference set to font adobe-courier-iso8859-1 for Western monospace. I can see two of the characters, double right quotation marks and Euro sign. (So some part of some software is doing something other than specified.) The other three characters come through as numeric references. In "View Source" I see approximately greek capital sigma = 'Σ' element of sign = '∈' double right quotation marks = '"' infinity = '∞' euro sign = 'EUR' The quotation character and the Euro sign display correctly in the Source window. Somehow in copy and paste the Euro sign was changed to the three-letter code. So the browser is displaying what it says it was sent. Case 2. UTF-8, mutt-clearlyu-iso10646-1 font Quote character and Euro sign initially appeared as question marks. The text displayed as above after reloading. No change in behavior of text in Source window accessed after reloading. My experience thus confirms what Markus noted later. >------- Additional Comment #1 From Markus Kuhn 2002-04-05 12:05 ------- > >Strangely, the double-right-quotation-mark and the euro-sign got >through intact onto this web page, but the other test characters got >indeed converted into numeric character references. -- Edward Cherlin Generalist "A knot! Oh, do let me help to undo it!" --Alice in Wonderland -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
