Re: On the lack of a SQUARE TB glyph

2019-09-27 Thread Ken Whistler via Unicode

Fred,

2 hours and 33 minutes from now (today). But you don't need to try to 
synch a proposal like this to a particular script ad hoc meeting. That 
group meets roughly once a month, and any new proposal coming in right 
now wouldn't be on the Unicode 13.0 train, even if the UTC immediately 
agreed to it. So there isn't an immediately urgent deadline for new 
proposals.


--Ken

On 9/26/2019 10:15 PM, Fred Brennan via Unicode wrote:

When does the Script Ad Hoc meet next?


Re: On the lack of a SQUARE TB glyph

2019-09-27 Thread Yifán Wáng via Unicode
> * Does the existence of the legacy Adobe encoding Adobe-Japan1-6 shift the
> balance? It has a SQUARE TB at CID+8306.

The code point suggests that it was already there as early as
Adobe-Japan1-1 in 1993. The fact makes it seem certainly weird why it
hasn't been included like its fellows nearby. Perhaps it could be used
as a rationale that this character is erroneously missing.

> * Should I also propose SQUARE PB up to SQUARE YB? At the very least SQUARE PB
> in datacenter settings seems useful.

It does not however imply that squared *B things have some general
needs. The situation is very different than that of Reiwa. U+32FF
wasn't added for commemorative purpose, but is a critical requirement
of some legacy systems that hardcoded those squared eras as time
control characters, lest they fall into a Y2019 problem without a new
era symbol (that's why Unicode announced the code point far before the
real era name came out). From what I heard, even the Japanese
government hadn't recognized the problem in the first place but MS or
somebody listened to their clients and choosed to encode it. So far I
don't believe there's a real need because I don't know a case that KB,
MB... are used outside representational purpose, but I may be wrong.

2019年9月27日(金) 14:17 Fred Brennan via Unicode :

>
> I'm sorry to write twice to the list but after some discussion on Twitter it
> is certain I'm going to write a request.
>
> I only have two lingering questions.
>
> * Does the existence of the legacy Adobe encoding Adobe-Japan1-6 shift the
> balance? It has a SQUARE TB at CID+8306.
>
> https://www.adobe.com/content/dam/acom/en/devnet/font/pdfs/5078.Adobe-Japan1-6.pdf
>
> * Should I also propose SQUARE PB up to SQUARE YB? At the very least SQUARE PB
> in datacenter settings seems useful.
>
> (I emailed Dr. Ken Lunde with these questions, but he's on vacation until the
> next meeting of the UTC. Does anyone know how much time I have before the
> agenda closes? When does the Script Ad Hoc meet next?)
>
>
>



Re: On the lack of a SQUARE TB glyph

2019-09-27 Thread Julian Bradfield via Unicode
On 2019-09-27, David Starner via Unicode  wrote:
> On Thu, Sep 26, 2019 at 8:57 PM Fred Brennan via Unicode
> wrote:
[snip]
>> There is no sequence of glyphs that could be logically mapped, unless you're
>> telling me to request that the sequence T  B be recommended for general
>> interchange as SQUARE TB? That's silly.
>
> Why is that silly? You've got an unbounded set of these; even the base
> prefixes EPTGMkhdmμnp (and da) crossed with bBmglWsAKNJCΩT (plus a
> bunch more), which is over 200 combinations without all the units, and
> there's some exponents encoded, so some of those will need to be
> encoded with exponents. And that's far from a complete list of what
> people might want as squares.

Wouldn't T  B 
be a better sequence? 
In fact, it would have been nice (expecially for mathematicians) if
all combining marks could have been applied to character sequences, by
means of some "high precedence ZWJ" that binds more tightly than
combination.
(Playing devil's advocate here, since I don't think maths is plain
text:)

Or one could allow IDS to have leaf components that are any
characters, not just ideographic characters, and then one could have
all sorts of fun.


Re: On the lack of a SQUARE TB glyph

2019-09-27 Thread David Starner via Unicode
On Thu, Sep 26, 2019 at 8:57 PM Fred Brennan via Unicode
 wrote:
> The purpose of Unicode is plaintext encoding, is it not? The square TB form is
> fundamentally no different than the square form of Reiwa, U+32FF ㋿, which was
> added in a hurry. The difference is that SQUARE TB's necessity and use is a
> slow thing which happened over years, not all of a sudden via one announcement
> of the Japanese government.

Defining whether a pair of characters gets squeezed into one square is
hardly a plaintext issue.

The square form of Reiwa is a bit different, given its use in printing
time, where there may have been an expectation that it takes up one
square. It's also a new member of a tiny set, as opposed to SQUARE TB,
which people have been using already in various ways.

> New emoji are still being encoded. The existence of SQUARE GB leads to its
> use, which then leads to people wanting SQUARE TB and resorting to hacks to
> get it done. If you didn't want people to request more square forms you
> shouldn't have encoded any at all. It's too late for that.

It's unlikely that not encoding wouldn't have stopped the requests
from coming, and it's not too late for them to dismiss those requests.

Unicode, in order to become the one character set, had to become
backward compatible with all the major legacy character sets out
there. Unicode has piles and piles of frustrating compromises because
of that, but it was felt that was the cost that had to be paid.

> There is no sequence of glyphs that could be logically mapped, unless you're
> telling me to request that the sequence T  B be recommended for general
> interchange as SQUARE TB? That's silly.

Why is that silly? You've got an unbounded set of these; even the base
prefixes EPTGMkhdmμnp (and da) crossed with bBmglWsAKNJCΩT (plus a
bunch more), which is over 200 combinations without all the units, and
there's some exponents encoded, so some of those will need to be
encoded with exponents. And that's far from a complete list of what
people might want as squares.

-- 
Kie ekzistas vivo, ekzistas espero.



Re: On the lack of a SQUARE TB glyph

2019-09-27 Thread James Kass via Unicode



On 2019-09-27 5:15 AM, Fred Brennan via Unicode wrote:

I only have two lingering questions.

* Does the existence of the legacy Adobe encoding Adobe-Japan1-6 shift the
balance? It has a SQUARE TB at CID+8306.

https://www.adobe.com/content/dam/acom/en/devnet/font/pdfs/5078.Adobe-Japan1-6.pdf


That character set also has other items not in Unicode such as numbers 
enclosed in squares from "0" and "00" through "100" and fractions like 
3/7 and 10/11.  It was published in 2008, so it might not be considered 
as "legacy".