In UAX14, WJ class doesn't appear to have an explicit rule

2003-08-09 Thread David E. Hollingsworth
I wrote a parser to take the rules from section 6 (Line Breaking Algorithm) of UAX14 and generate something much line the pairwise chart given in section 7. I couldn't help but notice that my row column for class WJ was all wrong; but after looking at the ruleset in section 6, I wasn't

Re: Which ancestral links

2003-08-09 Thread Rick McGowan
Raymond Mercier suggested... http://wwwold.dkuug.dk/jtc1/sc2/wg2/docs/n2422.pdf And these 6 Sogdian letters were accepted and do appear in Unicode 4.0. http://www.gengo.l.u-tokyo.ac.jp/~hkum/pdf/SIE3.pdf That documnet is apparently in some non-standard encoding and the French accented

Pigpen/Masonic/Poundex

2003-08-09 Thread Chris Jacobs
- Original Message - From: John Cowan [EMAIL PROTECTED] To: Miikka-Markus Alhonen [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Sent: Thursday, August 07, 2003 7:38 PM Subject: Re: Colourful scripts Miikka-Markus Alhonen scripsit: Anyone interested in preparing an encoding proposal?

Re: Newbie Question - what are all those duplicated charactersFOR?

2003-08-09 Thread Michael Everson
At 17:46 +0100 2003-08-08, [EMAIL PROTECTED] wrote: I'm reasonably sure that this question reflects my own ignorance, rather than some problem with the standard, but nonetheless, I am confused. Read the text. Don't just read the code charts. -- Michael Everson * * Everson Typography * *

Re: Conflicting principles

2003-08-09 Thread Kenneth Whistler
Philippe, Just look at musical notations where a upper horizontal parenthesis is used to group some elements (sorry I don't know how you name it exactly in English or Italian), despite there's a measure break in the middle, which may span to the other musical line: you end up with two parts

Re: Questions on ZWNBS - for line initial holam plus alef

2003-08-09 Thread Philippe Verdy
On Tuesday, August 05, 2003 1:52 AM, Kenneth Whistler [EMAIL PROTECTED] wrote: Peter, The carrier for a combining mark that is to display in isolation without a base character is U+0020 SPACE. If you want to also indicate the absence of a line break opportunity, then the carrier is

Re: Questions on ZWNBS - for line initial holam plus alef

2003-08-09 Thread Peter Kirk
On 09/08/2003 13:41, John Cowan wrote: Peter Kirk scripsit: The gap may not be large, but Philippe, John H and I have identified a real gap. Why this antagonism against filling it? What you have identified is a set of implementation defects, not problems with the Unicode Standard. The

IE5 displaying U+005A U+0302

2003-08-09 Thread Anto'nio Martins-Tuva'lkin
While creating a new version of the document you can consult in http://www.flagspot.net/flags/bib_main.html , I noticed that IE5 managed to do something I found quite strange while trying to display the HTML sequence «... Z#770;itni ...». Somehow, the font engine managed to know that U+0302 is a

Re: Questions on ZWNBS - for line initial holam plus alef

2003-08-09 Thread Peter Kirk
On 08/08/2003 17:27, Kenneth Whistler wrote: Philippe continued: On Saturday, August 09, 2003 12:49 AM, Michael Everson wrote: At 14:22 -0700 2003-08-08, Kenneth Whistler wrote: Philippe, you are tilting at windmills, here. There is no chance that the UTC is going to consider

Re: Display of Isolated Nonspacing Marks (was Re: Questions on ZWNBS...)

2003-08-09 Thread Peter Kirk
On 05/08/2003 16:59, Curtis Clark wrote: on 2003-08-05 15:31 Peter Kirk wrote: Thank you, Mark. This helps to clarify things, but still doesn't explicitly answer my question of how to encode a sentence like In this language the diacritic ^ may appear above the letters ..., but instead of ^ I

Colourful scripts

2003-08-09 Thread Miikka-Markus Alhonen
Hi! Some time ago there was discussion about whether there are scripts using colour as a distinctive feature or not. I just came across the following pages: http://www.alphabets-world.com/edo_color.html http://www.library.cornell.edu/africana/Writing_Systems/Edo.html Anyone interested in

RE: Questions on ZWNBS - for line initial holam plus alef

2003-08-09 Thread Peter_Constable
Ken Whistler wrote on 08/06/2003 03:19:34 PM: Again, why should not a, ring above, cgj, dot below be canonically equivalent to a, dot below, cgj, ring above, when a, ring above, dot below is canonically equivalent to a, dot below, ring above? And I want a design answer, not a formal

Re: Questions on ZWNBS

2003-08-09 Thread Philippe Verdy
On Monday, August 04, 2003 11:59 PM, Kenneth Whistler [EMAIL PROTECTED] wrote: The function I think you have in mind is not isolated display of a combining mark, but rather trying to find a mechanism for getting around the conformance strictures of the standard, to get a combining mark to

Re: Colourful scripts and Aramaic

2003-08-09 Thread Karljürgen Feuerherm
My knowledge of Aramaic script is a little scanty, but my understanding is more or less the same as Peter's. Which leads me to suggest that encoding Aramaic separately would be a bit like encoding Old Akkadian (Cuneiform) separately from NeoAssyrian (Cuneiform). Which would be a bit silly (and

Re: Conflicting principles

2003-08-09 Thread Peter Kirk
On 08/08/2003 12:35, John Cowan wrote: Peter Kirk scripsit: What if there is a line break between the two characters joined by a double width combining character? That would be unbelievably atrocious typography. Double-width CCs are a hack, but a useful hack. Creating a factitious