ZWNJ in MS Access
Can somebody help me to find out more efficient way to input ZWNJ (zero width non-joiner) U+200C into MS Access 2003 fields with Persian text under Windows XP? MS Access 2003 is very convenient for developing dictionaries, each field can be adjusted for one of the scripts, but there is no menu item like Insert/Symbol table in MS Word, so you have to insert some Unicode characters otherwise. To input ZWNJ I tried to press combination Shift + Space using Persian Keyboard (FA), as I did it in MS Word 2003. No result. Then I tried Keyman with other key combinations. No result. Tried typing ZWNJ in MS Word document and inputting it through the Office Clipboard. No result. Playing with MS Access macro SendKeys and AutoCorrect options was no good either. So I assumed that there is a special rejection filter for ZWNJ built into MS Access 2003 and I tried to deceive it. It turned out that you can deceive the filter, if you embed ZWNJ into a string of other symbols like X + ZWNJ + Y. You can copy that string from a Word document to the Clipboard and enter it from the Clipboard into MS Access 2003 fields. Then you delete X and Y and can enjoy ZWNJ in your Database. A similar problem seems to be with the Manual Line Break and may be some other non-printable symbols, but ZWNJ is most urgent for my work (sometimes I have to use it hundreds times a day). May be I have overlooked a simple way to do it? Can anybody show me how to input ZWNJ in less moves? Thank you in advance, Vladimir, [EMAIL PROTECTED]
Re: ZWNJ in MS Access
11.07.2004 18:11 Bob Hallissy wrote: I don't have MS Access handy, but I do know that the standard Farsi keyboard layout supplied with Windows XP permits direct entry of ZWJ and ZWNJ using ctl-shift-1 and ctl-shift-2. This should work in any application that permits such characters. Bob Both your key combinations are working fine in MS Word 2003 (you can check it by cursor movement and/or by Alt+X), but none of them has any effect on Persian script in my MS Access 2003. Thank you for your attention, Vladimir Ivanov, [EMAIL PROTECTED]
Persian sorting with Delphi
My program written in Delphi 7 opens a Word document and then sorts the paragraphs according to a binary table. The binary table is based on a tailored version 3.1.1 of Allkeys Table (http://www.unicode.org/reports/tr10/#AllKeys). The tailoring was done (according to the suggestion of Mark Davis) to interchange all the instances of Arabic letter Waw with He. Such a sorting, where Waw is preceding He, is useful for languages like Kurdish, Persian, Dari, Tajik, Pashto, Urdu etc. The program does right sorting of alpha-numeric and Arabic wide (Unicode) strings, but there are some inconveniences in usage: 1. I could not find in Delphi documentation any function that allows reading from the Word Application the position of the Selection (beginning and end) to set the range of sorting. Now I have to sort all the paragraphs in the document. 2. I could not find the way to preserve formatting of the sorted strings. They are coming out in default (Normal) style losing other applied fonts and styles. May be somebody can give me a hint or refer to a source where such questions are discussed. Thank you, Vladimir Ivanov
Re: Web Form: Old Russian charcaters
Please, have a look at the characters in the range 0460 - 0486 in the font Arial Unicode MS shipped with Office XP. May be those are the characters you are looking for. Can anyone on the Unicode list help? Thanks, Magda -Original Message- Date/Time:Thu Jan 16 13:11:06 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I am looking for a way to use old Russian charcaters that are no longer used in modern Russian langauage. Can you help or provide a direction in which to look?
Re: What characters have baseline?
Philipp Reichmuth wrote: I don't think it's fixed 27.5° in handwritten script, it varies quite considerably, partly depending on how much text has to fit in the line in calligraphy. In ordinary handwriting, the angle easily reaches 45° or more. I couldn't find any reference in books about such an angle. But it is an opinion of a Persian calligraph. It has something to do with the Golden Section (90°*0.618/2). The closer to this angle, the more beautiful is your handwriting considered to be (see Golden Section in Art). Vladimir Ivanov Zub3nst3.png Description: Binary data
Re: What characters have baseline?
Marco Cimarosti wrote: BTW, I heard strange things about baselines, such as that the baseline for Indic fonts should be near the top of letters, while for good Arabic typography it should be oblique. Can someone confirm these notions? The baseline of Arabic (in general meaning) typographical fonts (Nasx) is horizontal and must be aligned to the Latin-Cyrillic-Greek baseline. Beside that handwritten scripts like Nastaliq or Tahriri have an oblique baseline for each word. The angle of that secondary baseline is 27.5º, like this: Vladimir Ivanov attachment: clip_image002.jpg
Re: Private Use Agreements and Unapproved Characters
We should be encoding important stuff that will be a boost to endangered cultures and languages - to preserve those cultures, languages, and literatures; and to have a long-lasting importance. Rick McGowan How do you rescue a script that has a proposal but hasn't had any action in 5 years? David Starner Old Persian and Avestan are closely related ancient languages that usually go side by side. If a linguist refers to an Old Persian example, he must show its Avestan form or his work would be considered to be incomplete (see, for instance, Rastorgueva V.S., Edelman J.I. Etymological Dictionary of the Iranian Languages, vol. 1, ISBN 5-02-018124-2, Moscow, 2000, 1000 copies). The Avesta as a holy book began it's formation approximately 3000 years ago, when Prophet Zoroaster appeared to spread the wisdom of Ahura Mazda - God of the good. Mazdaism was the main religion of Ancient Persian Empire (558-330 BC.). In 330 BC. Alexander the Great burned Persepolis - its capital. Contemporary followers of Zoroaster (Parsis), who live in small communities in Iran, India and Pakistan, believe that in that fire the main copy of the Avesta was destroyed, which, according to the exaggerated accounts of antiquity, was said to number 25,000 parchment pages. Zoroastrianism was the main religion of the two following Persian Empires. The Avesta was passed on from generation to generation in oral-singing form, until in the 3rd century AD it was revived in written form. For that purpose a special Aramaic-based alphabet was invented. You can see those letters in Michael Everson's Proposal. The Avesta was then virtually destroyed in the 7th century AD by the Muslims in their victorious invasion. Most of the Zoroastrians were then compelled to accept the Qur'an (Koran), the sacred scriptures of Islam; many, however, fled to India for refuge and took with them what was left of their sacred writings. A few of the faithful remained behind in Persia and, although persecuted, they continued to practice their religion. These two groups, about 80,000 persons in India and 18,000 in Persia, were responsible for the preservation of the Avesta in its present form. (Microsoft® Encarta® Reference Library 2002. © 1993-2001 Microsoft Corporation. All rights reserved). What is the critical number of users to cause an encoding of a script? This January we organized a workshop on Persian. We were surprised to know that more than 100 students and teachers from all over Russia wanted to attend the classes of Old Persian and Avestan. A group from Ukraine and Moldova came to participate as well. We couldn't prepare a decent computer-written handout for the audience. Xeroxed pages of one century old books were used. Most of the time lecturers were working with a chalk at the blackboard. The Parsis community in Iran nowadays is considered to be a religious minority and therefore has its own representative in Parliament. They publish various books dedicated to the Avesta like Bahrami E. Dictionary of the Avesta in 4 volumes, Tehran, 1980, 3000 copies. English equivalents and Persian explanations are represented in typographical form, but Avestan words are hand-written in such books. Many ordinary members of the community say that they can neither read nor write in Avestan. To learn prayers they use printed books with Persian transliteration, which to my mind is inadequate for the purpose: only 3 of 14 Avestan vowels can be shown systematically in such a way. Does anyone happen to know, what technique is used in India and Pakistan in Parsis communities to avoid the difficulties of hand-writing? How do they learn their prayers? Is there any Avestan education there? Thank you, Vladimir Ivanov
Re: Keyboard Layouts for Office XP in WIndows 98
Peter Constable wrote: I'll revise my summary in relation to MS apps and Win9x/Me Unicode characters that can be input using Keyman 5Keyman 6 (when released) -- -- Word 2000limited by limited by Windows codepages Windows codepages Word 2000 w/ WordLinkall of BMP all (planes 0 - 16) other Office 2000 apps limited by limited by Windows codepages Windows codepages Word 2002limited by all (planes 0 - 16) Windows codepages Word 2002 w/ WordLinkall of BMP all (planes 0 - 16) Publisher 2002 all of BMP all (planes 0 - 16) other Office XP apps limited by limited by Windows codepages Windows codepages May I ask you to give some more explicit explanations concerning the following scripts? If I build an Avestan font according to Michael Everson's specifications (Avestan characters should be in BMP, see http://www.evertype.com/standards/iso10646/pdf/avestan.pdf) and a Keyman keyboard layout, would I be able to type Avestan texts in: Word 2002, Publisher 2002, Access 2002 under Windows 2000/XP? Is it necessary to add some special characters to Keyman keyboard to tell Uniscribe that Avestan is a right-to-left script? Or should this be done through the VOLT (Avestan font needs some right-to-left kerning)? Generally, how can an application know that a certain range of BMP belongs to a right-to-left script? (BTW it seems to me that in Table XXX page 1 of the Avestan proposal Column xx4 should be shifted 1 cell downward to meet the description on page 2). Should we wait for Keyman 6 to type Old Persian in the same applications because this script is in Plane 1? Thank you in advance, Vladimir Ivanov
Re: Keyboard Layouts for Office XP in WIndows 98
Tom Gewecke wrote: One way to possibly type Old Persian (not yet in Unicode but incorporated into at least one font in the Plane 15 PUA) is with the vitual keyboard at http://home.att.net/~jameskass/screenkeyindex.htm It may require Opera 6 and Win2k or XP to work right, I'm not sure. Worth a try perhaps. Thank very much for the link. I will try it.
Re: Keyboard Layouts for Office XP in WIndows 98
Stefan Persson wrote: Where in the UCS do you find the Old Persian characters? I'm referring to Michael Everson's Proposal to encode Old Persian Cuneiform in Plane 1 of ISO/IEC 10646-2 Anyways, if you can't use the code point for a Plane 1 character, simply use the code points for the surrogate pair. Thank you for the advice. I'll try to experiment with it, though it will take some time. Vladimir Ivanov
Re: Month names (was: Re: Standard Conventions and euro)
Patrick Andries wrote: I believe Russian also gives numeral names to most week days (but Eastern European languages are not my forte). (*) E.g. Monday = segunda-feira. 3 days of the week are related to numerals in Russian: $B'S'd'`'b'_'Z'\(B 'the second one' Tuesday $B'i'V'd'S'V'b'T(B 'the forth one' Thursday $B'a'q'd'_'Z'h'Q(B 'the fifth one' Friday Vladimir Ivanov
Re: Initials
Title: הודעה Many thanks to all of you for discussion of this topic. It gave some planned results, like better understanding of syntax of the initials in English. It also gave some unexpected but very valuable and interesting results, like the way the initials are read in Herbrew, or that people in Denmark avoid writing initials at all. Special thanks to Stefan Probst, who asked: what the long threats about R(o|u)mania, Canada, California, Yankees, and Initials in various countries.. have to do with Unicode? Sceptical opinions induce to better formulation :). All these cases deal with transliteration. And transliteration as far as I could understand is one of the fields covered by Unicode. At least I saw here many times discussions on such issues like transliteration algorithm, difference between trancription and transliteration etc. The task of transliteration is to substitute the letters of one script by another. But the practical goal of script conversion is to help people who do not know such a script to find desired structures and patterns in foreign text in order to operate with them. References and Bibliography is a good example of this. If due to some cultural or orthographical reasons they get strange or unrecognizable syntax of the target pattern, the practical goal of transliteration wouldn't be achieved. Now, after this discussion is over, I can clearly see that initials can be one of such difficulties. But the question: to complicate or not to complicate the transliteration algorithm with syntax rules can be another topic of discussion. Thank you once more, Vladimir Ivanov
Initials
By this message I would like to begin a discussion (if it would be found appropriate here and hasn’t been done earlier) about the rules of writing initials in various scripts. To make it clearer I’ll try to describe our everyday problems for Russian names and their transliterations in References and Bibliographies. I would be very grateful if someone could answer me then any of the following questions: 1) What are the rules for writing initials in your own language? 2) Are there any differences in rules for writing initials of foreign and domestic names? 3) Do you use different types of transliteration in different types of documents? 4) Does your grammar prohibit Russian-like rules for writing initials (i.e. may we apply our rules for initials when we write articles in your language)? 5) Is the problem of line breaking and “soft space” (as described below) relevant for your language? Definition for Russian initials: An initial is the first letter of the first name with a period after it like “A.” in “A.Sokolov”. Initials are: the first letter of the first name and the first letter of the patronymic name with a period after each of them like “A.V.” in “A.V.Sokolov”. (Letter N will be used to show format like N.N.Nn…n). These rules are applied not only to Cyrillic, but to other transliteration scripts too. Variants like N. N. Nn…n, N.N. Nn…n with spaces between initials and the last name can be found. But nowadays spaceless form N.N.Nn…n is used more often. Thus we get to N.[ ][N.[ ]]Nn…n. Some people with rare names like Vsevolod prefer two or more letters of their names to be shown in order not to be mixed up with frequent names like Vladimir, e.g. Vs.Safronov. It gives us N[n].[ ][N[n].[ ]]Nn…n. In Bibliographies the books are sorted by last names, so there we can see Nn…n[,] N[n].[ ][N[n].[ ]] with optional commas after the last name. It is not allowed to break the line after the first initial like this: N. N.Nn…n. But it is common to break the line after the second one like this: N.N. Nn…n. The last case can be simulated as N[n].[ ][N[n].[ |CR]]Nn…n. In such cases hyphens adjacent to CR are not allowed. Foreign names are usually initialed in the same way. Tomas Alva Edison becomes T.A.Edison. The exception is letter “J” because it represents two phonemes from the point of view of Russian phonetics. John Reed is shortened to Дж.Рид in Cyrillic script. I suspect that something else is meant by initials in English. My Webster's dictionarysays that initial is: a) the first letter of a name; b) pl: the first letter of each word in a full name. Nothing is mentioned about spaces and periods, no examples are provided. During registration some applications ask the user to enter initials. In this field usually “N.N.” is not accepted neither in Cyrillic, nor in Latin (I have not tried Arabic). It is difficult to guess what format is expected. Are there English standards for initials? Are there international rules for that? Format “NN” in applications is accepted instead, but that’s a mistake from the point of view of Russian rules. In diplomatic and consular affairs Russian names must be transliterated in a French-oriented way, otherwise they are English transliterated. For instance, the name “Ч.Айтматов” can look like “Tch.Aytmatov” in foreign passport, which will be normally equal to “Ch.Aytmatov” in Bibliography. It is convenient to put after Russian initials some zero width control character that can be called “Soft Space”. It must act like Soft Hyphen (i.e. not effective in the middle of a line, but allowing line breaking), and must be invisible at the edge of the line. Is there such a character in Unicode? Can it be found in text processors? Now we have to do the required line breaking manually. It is not wise to overload hyphenation rules with it, because language-independent direct control of this feature for all scripts is needed. Thank you in advance, Vladimir Ivanov
Re: spoof buddies
$B$m!;!;!;!;(B $B$m!;!;!;(B scripsit: Do there exist official Russian names of Unicode characters? They could probably be constructed with slightly less difficulty than English names. If these Russian names do not exist, maybe you can make some up! I know there are French names! I have not heard about official (i.e. approved by Unicode Consortium) Russian names of Unicode Characters. Of course, they could be constructed. But that implies that such names must be constructed for every official language in the world. It is a very huge task. My main idea, however, was that all versions of applications should be able to show English official names as reference. English for Unicode is like Latin for Medicine. Thank you for feedback, Vladimir Ivanov
Re: GRAPHEME JOINER vs. double diacritics
Eric Muller wrote: Is it correct that the sequences U+x U+0360 U+y and U+x U+034F U+y U+0303 should display the same? Would it be worth putting some words about those situations in section 13.2 of PDUTR #28? In what font can we find U+034F? In Arial Unicode MS just after U+0345 goes U+0360. Can we have the full path to section 13.2 of PDUTR #28? Thank you, Vladimir Ivanov
Re: An Azeri disk
Michael Everson wrote: I've got a small clay disc in Arabic script from Azerbaijan. I can't read it. I've tried to analyze the script, and if you want to help me puzzle it out, see http://www.evertype.com/standards/Arab/disc.html There are things on it which don't look like things I've seen in Unicode anyway. -- Michael Everson *** Everson Typography *** http://www.evertype.com The word on disk reads Muhammad. See Arabic Presentation Form U+FDF4. There are some additional ornamental elements as well. Sincerely, Vladimir Ivanov
Re: sample text ok?
Third form is a combination of the fourth form (from right to left) and Kashida U+0640. Vladimir Ivanov, [EMAIL PROTECTED] At 14:27 10/10/2001, Roozbeh Pournader wrote: No, it's not a fifth form. It's just the initial form. Since Heh is too similiar to the Arabic digit five, when refering to the letter, an initial form is usually used. The glyph in at U+0647 in Unicode charts is just that... I know it is based on the initial form, but I have seen a number of typefaces in which it is designed as a separate, variant glyph, the usual characteristic of which is that the left connecting stroke of the initial form is truncated. I've attached a small graphic showing the five forms. John Hudson 5heh.gif Description: Binary data
Re: microsoft font link
Mike Lischke wrote: I have linked to that page from my Unicode site www.delphi-unicode.net. 1) It seems to me that Persian transcription of Unicode (in the very first running string of this site) is written wrong. It should be either يونی کُد or يونيکُد . Roozbeh Pournader can say it for sure. 2) Does this library support calls to Uniscribe? Thank you, Vladimir Ivanov, [EMAIL PROTECTED]
Re: Unicode library (was: microsoft font link)
Roozbeh Pournader wrote: I don't know how hard that may be, but have you considered Pango http://www.pango.org/? Thank you, the project is worth reading. Vladimir Ivanov, [EMAIL PROTECTED]
How strong is rule 1?
According to the document http://anubis.dkuug.dk/JTC1/SC2/WG2/docs/N1502.doc ( III. Procedure for Encoding New Characters and Scripts SC 2/WG 2 Evaluation Procedure) Rule 1. Says: Do not encode a) If the proposed character is a (shape or other) variation of a character already encoded in ISO/IEC 10646 ... , or b) If the proposed character is a presentation form (glyph), variant, or ligature. Butthere isa whole block of Arabic presentation forms-A FB50-FDFF and Arabic presentation forms-B FE70-FEFF with glyphs and ligatures (contradiction to item b). Besides,there are letters which are variants of each other like Pashto letter Gaf 06AB (Arabic Letter Kaf with Ring) and 06AF (Arabic Letter Gaf). You can see in Pashto text either the first one (rather seldom) or the second one (normally), but not both at the same time (xoring attitude). That contradicts item a). Was such encoding done due to some historical reasons in the past? Could there be exceptions in the future in similar cases? Thank you, Vladimir Ivanov, [EMAIL PROTECTED]
Arabic based alphabets
Vladimir Ivanov wrote: Letter "dze" is represented in Unicode by U+0681 "Arabic letter heh with hamza above",though the sign above heh is not exactly hamza. John Hudson wrote: Does Pashto also make use of the regular hamza sign in other contexts? Yes, for instance in Arabic loan words like "shu'Un". Can you, or anyone, provide a visual example of this form? A picture of letter "dze" from Pashto-Russian dictionary is included in this message. Sicerely, Vladimir Ivanov, [EMAIL PROTECTED] Dze.tif
Russian Unicode Convertion
Vadim Snurnikov wrote: How can I read a text in Unicode (Russian) where every Russian letter is represented like that: D=B6 (or similar)? (The e-mail got transferred to this format.) What kind of software is used to get E-mail? I recommend Outlook Express 5.0 and above. It allows you to get Cyrillic messages both in Windows 1251 and/or Unicode. You'd better have Office 2000 on your machine. It installs correct set of default fonts to work with Cyrillic letters. Is there a tool to transfer this back into 2-byte-encoding or to any other readable form? I heard that they've invented something at Russian Sofware Club www.rusc.ru but you'd better put such a question directly to their Conference. Sincerely, Vladimir Ivanov, [EMAIL PROTECTED]
Arabic based alphabets
Marco Cimarosti wrote: Pournander and N.R. Liwal can help us: how is the "Arabic alphabet" called in Farsi, Urdu, and Pashtun?Roozbeh Pournader wrote: Persian speakers call it "alefbaa-ye faarsi". The same set of letters in Tajikistanis officially called "alifbo-i niyokon" - alphabet of ancestors. The same set of letters in Afghanistan is officially called "alefbA-ye dari" - Dari alphabet. In some textbooks and dictionaries 2 additional variants for so called majhul vowels in Dari can be found: vAv-e majhul U+FBD9 (long /o/) and yA-ye majhul U+FBE4 (long /e/), but they are rather rare. So you can consider all 3 alphabets identical. N.R.Liwal wrote: In Pashto they call it "Pashto Alefbe" having 15 Extra Characters. My Pashto informants call it "dI paxto alifbe", saying it has 10 extra letters. Letter "dze" is represented in Unicode by U+0681 "Arabic letter heh with hamza above", though the sign above heh is not exactly hamza. It is a zigzag-like sign of the same height as hamza, but they are well distinguished. My informants could not recall any special name for it. If you use "heh with hamza above", people usually accept it as a substitute, saying that "computer is not able to build a real Pashto letter" (?!). I could not find such a letter in Unicode. I would be glad to hear some comments on it. Sicerely, Vladimir Ivanov
Farsi keyboard
Roozbeh Pournader Wrote: We have just finished fixing Windows 2000's Farsi keyboard... The fixed version solves many of the annoying problems of Windows 2000keyboard:1. It has Persian digits instead of Latin ones. Persian digits is no problem in Windows 2000 if you work with Word 2000 and/or XP. We use Tools/Options/Right-to-Left/Numeral: Context. It switches digits automatically to Persian when you type in Persian context. What was really annoying for us - 2 different final "Ya"s: One without dots on "D", and the other one with unnecessary dots on Shift+X. Does the new version fix it? Sincerely, Vladimir Ivanov
Persian letters Kaf and Heh with Yeh above
Persian letter Kaf 06A9 (entered from Persian keyboard in Word 2000 or Word XP under Windows 2000 or Whistler Beta 2 Build 2462) seems to be handled like a delimiter. If you try to select the whole word KETA:B (see Attachment, Line 1) by double clicking, only last three letters could be checked. It may also be the reason for incorrect dictionary sorting of such words. In Windows 2000 there was no built in method to enter Heh with Yeh above 06C0 in Persian texts. Now, thanks to Whistler, you can do that by pressing Shift+G. Unfortunately: 1) It substituted the method for entering diacritic Sukun 0652, so that you can no longer enter it from Persian keyboard. It is a minor inconvenience because Sukun is used in Persian rather rarely to eliminate ambiguity and it can be entered with a self-made shortcut key. 2) Times New Roman font has no such symbol. Entering Heh with Yeh above 06C0 automatically switches the font to Tahoma. So,if you don't pay any attention, the rest of your text will be in Tahoma (see Lines 2 and 3). 3) Heh with Yeh above 06C0 is not linked to the preceding letter (see Line 3). Thank you, Vladimir Ivanov, [EMAIL PROTECTED] KafHamza.zip
Re: Using hex numbers considered a geek attitude
On Thu, 3 May 2001, James Kass wrote: So, tried using Windows 2000 character map feature. It's possible to select a single character this way, but for some reason a CR-LF is added when copying with Control-V. (This is true even with displayable characters or character strings, unfortunately.) My colleagues do that in the same way. They will put a ZWNJ in the cllipboard, and then use Shift-Insert for putting it in the text. The main problem is they will lose that as soon as they want to do cut and paste, so they should put it in the buffer again! I do it much easier. In Word XP try the following: Insert/Symbol/Special Characters Choose No-Width Optional Break Then press the button Shortcut key and associate with it whatever combination you like. I use Alt+J because it is easy to remember (it contains J for joiner). I used a similar thing in Word 2000, but I do not remember exact terms. Vladimir Ivanov, [EMAIL PROTECTED]
Old Persian
In James Kass' Plane One font Code2001.ttf (built according to Unicode 3.1 standard and given for testing at http://home.att.net/~jameskass/code2001.htm) one can find surrogates for Old Persian Cuneiform signs. Surrogate U+D800DF90, that must represent sound "di", has 3 short horizontal and 3 long vertical wedges. According to Roland G.Kent's "Old Persian", American Oriental Society, New Haven, 1953, p.12 (a scanned picture of the table is included in this message) it must have only 2 verticals (see column 2). Can anyone give me the reference for the cuneiform signs in the range of U+D800DFC0 - U+D800DFDE? In the same package with Code2001.ttf I found a text file "Plane1.txt" with a fragment of an Old Persian citation. As far as I understand Old Persian, character #5 in the 2nd word (or char #11 from the beginning of the fragment) is "ji" U+D800DF8A and it must be changed to "va" U+D800DF9E in order to give "Darayavaush" - the name of the Persian King - Darius. Sincerely, Vladimir Ivanov, [EMAIL PROTECTED] OldPersianCuniform.tif