On 08/15/2002 06:41:59 AM "William Overington" wrote: >>In essence, though not formally, U+FFF9..U+FFFC are non-characters as >>well, and the Unicode "semantics" just tells what programs *may* find them >>useful for. Unicode 4.0 editors: it might be a good idea to emphasize >>the close relationship of this small repertoire with the non-characters. > >That is not what the specification says.
William, John knows what he is talking about, and is exactly correct: in essense, though not formally, FFF9..FFFC are non-characters. No, the Standard doesn't say that; that's why he said, "not formally". The use intended by the Standard is, however, exactly comparable to the non-characters at FDD0..FDEF. If they had been defined in the Standard as non-characters, the world would not be different in any meaningful way. >It appears to me that the use of the annotation characters in document >interchange is never forbidden and is strongly discouraged only where there >is no prior agreement between the sender and the receiver, and that that >strong discouragement is because the content may be misinterpreted >otherwise. So, if there is a prior agreement, then there is no problem >about using them in interchanged documents. > >There appears to be nothing that suggests that U+FFFC cannot be used in an >interchanged document. Well, you've missed the intent of the authors of the Standard, and appear not to grasp the mindset. When it says interchange of IA characters may be OK given prior agreement, what's really in mind is that e.g. I've written code library A that handles some aspects of interlinear annotation, you've written code library B that handles different aspects of interlinear annotation, and we agree on certain interfaces so that my library can call yours or vice versa, and agree that strings passed by those interfaces can contain IA characters. That's the kind of thing that's in mind. It does *not* imply that anyone should consider create a document containing IA characters. >I know little about Bliss symbols, though I have seen a few of them and have >read a brief introduction to them, yet it seems to me that annotating Bliss >symbols with English or Swedish is entirely within the specification >absolutely and would be no more than strongly discouraged even if there is >no prior agreement between the sender and the receiver. Of course the Standard doesn't discourage anyone from annotating Bliss symbols with English or Swedish; it only discourages the use of IA characters as markup in documents. >Further, it seems to me from the published rules that these annotation >characters could possibly be used to provide a footnote annotation facility >within a plain text file That would not be a proposal worth pursuing; in fact, I'd say it's a very bad idea. The reason you DO NOT want to use IA characters in a document is that you do not know what someone's software will do with them. The characters have always been intended for use by software programmers, not by content authors. (Ditto for the object replacement character.) >An interesting point for consideration is as to whether the following >sequence is permitted in interchanged documents... >It seems to me that if that is indeed permissible that it could potentially >be a useful facility. On the whole, it would be very unwise to use these characters in documents for reasons I explained above. If two people agree to do this, nobody's going to send the Unicode police to stop them. But very few of us on this list are particularly interested in what is hypothetically possible for some pair of us to do. We're far more interested in how widely-used implementations should and do work, and in such implementations, FFF9..FFFC are assumed not to be use in content. - Peter --------------------------------------------------------------------------- Peter Constable Non-Roman Script Initiative, SIL International 7500 W. Camp Wisdom Rd., Dallas, TX 75236, USA Tel: +1 972 708 7485 E-mail: <[EMAIL PROTECTED]>

