>From: Roozbeh Pournader <[EMAIL PROTECTED]>
>Date: Sat, 15 Jun 2002 03:07:21 +0430 (IRST)
>
>On Fri, 14 Jun 2002 [EMAIL PROTECTED] wrote:
>
> > [...] Mr. Pournader had mentioned that he would summarize the discussion
> > points,
>
>Unfortunately I can't do that, since the discussions did not converge.


I am not sure what Roozbeh means by saying that the discussion did not 
converge. I think that I can converge it very conveniently as follows:

This discussion commenced when I queried why the glyph U+06C0 was rejected 
from the Persian IT standard. Two general objections were raised to its use: 
The first was that hamzeh changes its shape in Farsi composition; and the 
second was that U+06C0, as encoded in the Unicode standard, does not 
decompose correctly into its Farsi equivalents of <heh + hamzeh above>.

As regards the first objection, there are two points here: (1) Whether 
hamzeh changes its shape, and (2) whether that has any bearing on the 
discussion. I think that I have successfully argued that (1) hamzeh does not 
change its shape, and (2) that even if it did, that is irrelevant to the 
discussion. Hamzeh is just a shape. I has no intrinsic value or significance 
of its own. It assumes whatever significance we choose to give it by 
convention. Whether it changes its shape or not, that is irrelevant to the 
discussion of implementing <heh + hamzeh> in the IT standard.

As regards the second objection, that U+06C0 is not compatible with Farsi 
usage, that is a valid argument for not using this particular character in 
the Farsi standard, therefore it should not be used. That argument is 
settled. The question therefore is not whether we should use U+06C0 in the 
Farsi standard or not, but whether it is desirable to have an independent 
glyph in the Unicode standard with a unique code point value (call it 
U+XXXX) which DOES correctly decompose into its Farsi equivalents of <heh + 
hamzeh above>.

I believe that the answer to this question is a definitive yes, for exactly 
the same reasons that is desirable to have <alef + hamzeh above>, or <vav + 
hamzeh above> encoded as independent glyphs. Why is it that these shapes are 
encoded in Unicode as independent glyphs with unique code point values? Why 
don't they just enter them as two separate characters on the keyboard? There 
are at least two good reasons, as I have discussed before: (1) because of 
the difficulty of correctly representing them in font systems, and (2) 
because they are used so frequently that it is more efficient and economical 
to treat them as single characters which can be entered with single 
keystrokes rather than two. Exactly the same argument holds true for <heh + 
hamzeh above>. This glyph is used in Farsi so frequently (much more 
frequently than <alef + hamzeh> or <vav + hamzeh> for example) that it is 
desirable that it should be treated as a single character which can be 
entered by a single keystroke rather than two; and the same difficulty of 
representing them in font systems equally applies to this glyph. It is 
therefore equally desirable that it should be recognised in the Unicode 
charts as an independent glyph with its own unique code value U+XXXX.

The question therefore boils down to whether it is practical, and 
technically feasible, to encode such a glyph in the Unicode charts? I do not 
claim to be an expert on the Unicode standard, but as far as my 
understanding of the subject goes, there is no problem, therefore it should 
be implemented. As far as I know, the committee who drafted the Persian IT 
standard have never attempted to have it implemented in the Unicode 
standard. The question here is, Why have they not attempted to implement it, 
and whether they have any objections to its implementation? If they just 
forgot to do it at the time, or if it did not occur to them to do so, it is 
not too late to do so now. If they have some other objection to its 
implementation, I would like to know what that objection is. The 
recommendation in the Persian IT standard that the glyph can be entered by 
two keystrokes may be the easiest solution; but it is not it is neither the 
most logical, nor the most sensible, nor the most professional solution.

CONCLUSION
----------

I will here briefly summarise the main points of the discussion as follows:

1. It is desirable that the glyph < heh + hamzeh above> should be recognised 
in the Unicode standard as a single shape with a unique code point value so 
that it can treated as a single character both in fonts as well as on the 
Farsi keyboard.

2. The Unicode glyph U+06C0 is not suitable for that purpose because its 
decomposition in the Unicode standard is not compatible with Farsi usage, 
therefore it cannot be used.

3. It is therefore desirable that the implementation of a unique glyph in 
the Unicode charts which does decompose correctly into its Farsi equivalents 
of <heh + hamzeh above> be recommended to the Unicode consortium by the 
drafters of the Persian IT standard.

4. There does not appear to be any practical or technical difficulty in the 
way of its implementation, therefore it should be recommended to the Unicode 
consortium.

5. Do the committee who formulated the Persian IT standard have any 
objections to the implementation of this glyph in the Unicode standard? If 
so, what is their objection?

6. If they have no objection, do they intend to recommend it to the Unicode 
consortium?

I would like to obtain an official reply from the committee who drafted the 
IT standard to these questions.


_________________________________________________________________
MSN Photos is the easiest way to share and print your photos: 
http://photos.msn.com/support/worldwide.aspx

_______________________________________________
FarsiWeb mailing list
[EMAIL PROTECTED]
http://lists.sharif.edu/mailman/listinfo/farsiweb

Reply via email to