For script aficionados:
N2372 Revised proposal for encoding the Tai Le script in the BMP of the UCS
http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2372.pdf
N2378 Final proposal to encode Aegean scripts in the UCS
http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2378.pdf
--
Michael Everson
* Michael Everson
|
| N2372 Revised proposal for encoding the Tai Le script in the BMP of the UCS
| http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2372.pdf
This script is not the same as Tai Lue, it seems.
The proposal says:
Tai Nüa, Dehong Dai, Tai Mau, Tai Kong, and Chinese Shan are
At 1:42 PM +0100 10/8/01, Michael Everson wrote:
N2378 Final proposal to encode Aegean scripts in the UCS
http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2378.pdf
I note that this includes a proposal for encoding the symbols from the PHAISTOS DISK.
I thought these had been rejected on the
At 11:21 -0400 2001-10-08, Elliotte Rusty Harold wrote:
At 1:42 PM +0100 10/8/01, Michael Everson wrote:
N2378 Final proposal to encode Aegean scripts in the UCS
http://www.dkuug.dk/jtc1/sc2/wg2/docs/n2378.pdf
No, no, no, that's a serious mistake. My file includes Linear A and
At 11:21 AM -0400 10/8/01, Elliotte Rusty Harold wrote:
I note that this includes a proposal for encoding the symbols from
the PHAISTOS DISK. I thought these had been rejected on the grounds
that they were idiosyncratic (i.e. didn't show up anywhere except
the PHAISTOS DISK). Has that changed?
At 09:44 -0600 2001-10-08, John H. Jenkins wrote:
No, it wasn't rejected. Its official status with UTC is not
approved. Basically, there was no strong reason to go either way
on it. It appears to be a real script (or a board game), and there
are people who want to be able to work with the
Hi Dennis,
Your problem is understandable, but isn't due to the properties of the
character. U+0020 (normal space) has the bidirectional category of
whitespace. If your edit control worked properly, the insertion of a
whitespace character would not, in and of itself, change the directionality
of
For Java, the support for supplementary characters is actually better than one might
think.
It is true that the char type and the Character class only support 16-bit code units.
However, storing UTF-16 strings in String objects and char[] arrays and passing code
points as int's in non-JDK
Nick, et al -- You mentioned:
> In Classical scholarship (and I suspect, beyond it), all
> four possible corner brackets are routinely used as punctuation
> to delimit text in some way ---
I saw your examples of these the other day in Greek text. The upper corners also occur widely. For
Nick Nicholas asked:
I'm wondering if I could get some clarification on the Unicode corner
brackets.
Unicode encodes the following sets:
2308, 2309, 230A, 230B: LEFT CEILING, RIGHT CEILING, LEFT FLOOR, RIGHT
FLOOR
231C, 231D, 231E, 231F: TOP LEFT CORNER, TOP RIGHT CORNER, BOTTOM LEFT
10 matches
Mail list logo