Начало переадресованного сообщения:

> От: David Carlisle <dav...@nag.co.uk>
> Дата: 17 сентября 2010 г. 14:28:33 Тихоокеанское летнее время
> Кому: Alexey Proskuryakov <a...@webkit.org>
> Тема: Ответ: [webkit-dev] Fwd: Fwd: HTML5 & MathML3 entities
> 
> On 17/09/2010 21:12, Alexey Proskuryakov wrote:
>> 
>> 17.09.2010, в 12:49, Alexey Proskuryakov написал(а):
>> 
>>> math fonts in uniocde layout typically use the opposite choice for
>>> the old character, and math renderers (including if I remember
>>> correctly mozilla's original mathml support) that map unicode slots
>>> to "legacy" 8 bit math font encodings (eg the TeX or mathematica
>>> fonts) also rendered these things to match the math operators.
>> 
>> I'd say that this was a misuse of the character point,
> 
> No the code point is in the math symbols block and was always intended
> for math usage. Some time after the code point was added (I think, I
> don't have the data to hand) it got added a canonical mapping to to 3xxx
> block, that was an error that the unicode consortium is now trying to
> correct (or at least back when unicode 3.x added this new character)
> 
>> and thus a bug in the font. Unicode fonts aren't supposed to use
> glyphs with different meanings in the same code points (although some
> may claim that CJK unification crosses the border in that respect).
>> 
>>> So given, as you say, that some change was inevitable, taking the
>>> math choice is I still think the right one
>> 
>> 
>> It just seems that Unicode consortium and W3C basically decided to
>> make opposite choices on this,
> 
> No the change was explicitly at the suggestion of the UTC who wanted to
> deprecate the use of the old character and clarify that (because the
> character was always intended for math use) that any uses of the old one
> are replaced by the new one,
> 
> > which was not very helpful.&rang used to be a synonym to U+232A, but > now 
> > the former is a CJK character, and the latter is a math one. I
> > think that consistency with Unicode consortium's choice is an
> > important consideration.
> 
> I think you have "former" and "latter" swapped in the above?
> 
> &rang used to map to 232A  which was originally a math character but then 
> erroneously given a canonical mapping to 3009
> 
> It now maps to 27E9; which was a character added by the UTC explicitly
> to be a replacement for the deprecated 232A, which is only deprecated because 
> it had been given an erroneous NFC mapping,
> 
> As I said above the definition as given in the spec _is_ the choice of the 
> Unicode consortium (or at least the choice of those members of the UTC who 
> corresponded with me)
>> 
> 
>> But if I understood you correctly, there is a significant amount of
>> MathML documents that use rang/lang, is that accurate?
> 
> Oh yes angle brackets are very common notation.
> 
> > Practical compatibility requirements are more important than potential
> > issues that I cited, at least as long as they remain potential. If I
> > had an example of brokenness,
> 
> > I'd say that HTML content that followed specs outweighs MathML content
> > that violated them.
> 
> MathML did not violate any specs.
> 
> the lang and rang entity names come from the ISO math entity to denote
> math angle brackets. These sets and these names predate Unicode and predate 
> HTML, it's unfortunate that after the names were mapped to unicode a 
> canonical mapping to a different character was added, but the only fix the 
> UTC suggest for that is just not using 2329 at all and use 27E8 instead. 
> Which is what the entity spec recommends.
>> 
>> - WBR, Alexey Proskuryakov
>> 
>> 
> David

- WBR, Alexey Proskuryakov


_______________________________________________
webkit-dev mailing list
webkit-dev@lists.webkit.org
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Reply via email to