I wrote a parser to take the rules from section 6 (Line Breaking
Algorithm) of UAX14 and generate something much line the pairwise
chart given in section 7. I couldn't help but notice that my row
column for class WJ was all wrong; but after looking at the ruleset in
section 6, I wasn't
Raymond Mercier suggested...
http://wwwold.dkuug.dk/jtc1/sc2/wg2/docs/n2422.pdf
And these 6 Sogdian letters were accepted and do appear in Unicode 4.0.
http://www.gengo.l.u-tokyo.ac.jp/~hkum/pdf/SIE3.pdf
That documnet is apparently in some non-standard encoding and the French
accented
- Original Message -
From: John Cowan [EMAIL PROTECTED]
To: Miikka-Markus Alhonen [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Sent: Thursday, August 07, 2003 7:38 PM
Subject: Re: Colourful scripts
Miikka-Markus Alhonen scripsit:
Anyone interested in preparing an encoding proposal?
At 17:46 +0100 2003-08-08, [EMAIL PROTECTED] wrote:
I'm reasonably sure that this question reflects my own ignorance, rather
than some problem with the standard, but nonetheless, I am confused.
Read the text. Don't just read the code charts.
--
Michael Everson * * Everson Typography * *
Philippe,
Just look at musical notations where a upper horizontal parenthesis
is used to group some elements (sorry I don't know how you name
it exactly in English or Italian), despite there's a measure break
in the middle, which may span to the other musical line: you end
up with two parts
On Tuesday, August 05, 2003 1:52 AM, Kenneth Whistler [EMAIL PROTECTED] wrote:
Peter,
The carrier for a combining mark that is to display in isolation
without a base character is U+0020 SPACE. If you want to also
indicate the absence of a line break opportunity, then the
carrier is
On 09/08/2003 13:41, John Cowan wrote:
Peter Kirk scripsit:
The gap may not be large, but Philippe, John H and I have identified a
real gap. Why this antagonism against filling it?
What you have identified is a set of implementation defects, not problems
with the Unicode Standard. The
While creating a new version of the document you can consult in
http://www.flagspot.net/flags/bib_main.html , I noticed that IE5
managed to do something I found quite strange while trying to display
the HTML sequence «... Z#770;itni ...». Somehow, the font engine
managed to know that U+0302 is a
On 08/08/2003 17:27, Kenneth Whistler wrote:
Philippe continued:
On Saturday, August 09, 2003 12:49 AM, Michael Everson wrote:
At 14:22 -0700 2003-08-08, Kenneth Whistler wrote:
Philippe, you are tilting at windmills, here. There is no chance
that the UTC is going to consider
On 05/08/2003 16:59, Curtis Clark wrote:
on 2003-08-05 15:31 Peter Kirk wrote:
Thank you, Mark. This helps to clarify things, but still doesn't
explicitly answer my question of how to encode a sentence like In
this language the diacritic ^ may appear above the letters ..., but
instead of ^ I
Hi!
Some time ago there was discussion about whether there are scripts using colour
as a distinctive feature or not. I just came across the following pages:
http://www.alphabets-world.com/edo_color.html
http://www.library.cornell.edu/africana/Writing_Systems/Edo.html
Anyone interested in
Ken Whistler wrote on 08/06/2003 03:19:34 PM:
Again, why should not a, ring above, cgj, dot below be canonically
equivalent to a, dot below, cgj, ring above, when a, ring above,
dot below is canonically equivalent to a, dot below, ring above?
And I want a design answer, not a formal
On Monday, August 04, 2003 11:59 PM, Kenneth Whistler [EMAIL PROTECTED] wrote:
The function I think you have in mind is not isolated display of
a combining mark, but rather trying to find a mechanism for
getting around the conformance strictures of the standard, to
get a combining mark to
My knowledge of Aramaic script is a little scanty, but my understanding is
more or less the same as Peter's.
Which leads me to suggest that encoding Aramaic separately would be a bit
like encoding Old Akkadian (Cuneiform) separately from NeoAssyrian
(Cuneiform). Which would be a bit silly (and
On 08/08/2003 12:35, John Cowan wrote:
Peter Kirk scripsit:
What if there is a line break between the two characters joined by a
double width combining character?
That would be unbelievably atrocious typography. Double-width CCs are a
hack, but a useful hack. Creating a factitious
15 matches
Mail list logo