On 8/17/2017 7:24 AM, Mike FABIAN wrote:
Asmus Freytag via Unicode <unicode@unicode.org> さんはかきました:

On 8/16/2017 6:26 AM, Mike FABIAN via Unicode wrote:

     EastAsianWidth.txt contains:
3248..324F;A # No [8] CIRCLED NUMBER TEN ON BLACK SQUARE..CIRCLED NUMBER EIGHTY ON BLACK SQUARE i.e. it classifies the width of the characters at codepoints
     between 3248 and 324F as ambiguous.
Is this really correct? Shouldn’t they be “W”, i.e. wide? In most fonts these characters seem to be square shaped wide characters.

"W" not only implies display width, but also a different treatment in the 
context of line
breaking and vertical layout of text.

"W" characters behave more like Ideographs, for the most part, while "N" are 
treated as
forming words (for the most part).
Most emoji now have "W", for example:

1F600..1F64F;W   # So    [80] GRINNING FACE..PERSON WITH FOLDED HANDS

That seems correct because emoji behave more like Ideographs.

Isn’t this the same for “CIRCLED NUMBER TEN ON BLACK SQUARE”?
This seems to me also more like an Ideograph.

"A" means, you get to decide whether to treat these as "W" or "N" based on 
context. If
used in a non ideographic context, they behave like all other symbols (but 
happen to fill
an EM square).

"A" means, you get to decide whether to treat these as "W" or "N" based on 
context.

There's really not strong need to change an "A" towards "W", because "A" doesn't get in 
your way if you decided that "W" works better for you.

Remember that all the EAW properties ares supposed to be "resolved" down to W 
or N. For some, like Na that resolution is deterministic, for A it is context/application 
dependent, but when you finally process your data, only W(ide) or N(arrow) remain after 
resolution.

A./


A./


Reply via email to