ZWNJ in MS Access

2004-07-11 Thread Vladimir Ivanov





Can 
somebody help me to find out more efficient way to input ZWNJ (zero width 
non-joiner) U+200C into MS Access 2003 fields with Persian text under Windows 
XP?
MS 
Access 2003 is very convenient for developing dictionaries, each field can be 
adjusted for one of the scripts, but there is no menu item like Insert/Symbol 
table in MS Word, so you have to insert some Unicode characters 
otherwise.
To 
input ZWNJ I tried to press combination Shift + Space using Persian Keyboard 
(FA), as I did it in MS Word 2003. No 
result.
Then 
I tried Keyman with other key combinations. No 
result.
Tried 
typing ZWNJ in MS Word document and inputting it through the Office Clipboard. 
No result.
Playing with MS Access macro SendKeys and AutoCorrect 
options was no good either.
So I 
assumed that there is a special rejection filter for ZWNJ built into MS Access 
2003 and I tried to deceive it. It turned out that you can deceive the filter, 
if you embed ZWNJ into a string of other symbols like X + ZWNJ + Y. You can copy 
that string from a Word document to the Clipboard and enter it from the 
Clipboard into MS Access 2003 fields. Then you delete X and Y and can enjoy ZWNJ 
in your Database.
A 
similar problem seems to be with the Manual Line Break and may be some other 
non-printable symbols, but ZWNJ is most urgent for my work (sometimes I have to 
use it hundreds times a day).
May 
be I have overlooked a simple way to do it? Can anybody show me how to input 
ZWNJ in less moves?
Thank 
you in advance,
Vladimir,
[EMAIL PROTECTED]


Re: ZWNJ in MS Access

2004-07-11 Thread Vladimir Ivanov



11.07.2004 18:11 Bob Hallissy wrote:
I don't have MS Access handy, but I do 
know that the standard Farsi keyboard layout supplied with Windows XP permits 
direct entry of ZWJ and ZWNJ using ctl-shift-1 and ctl-shift-2. This should work 
in any application that permits such characters. Bob

Both your key combinations are working fine in MS Word 2003 
(you can check it by cursor movement and/or by Alt+X), but none of them has any 
effect on Persian script in my MS Access 2003.

Thank you for your attention,
Vladimir Ivanov,
[EMAIL PROTECTED]


Persian sorting with Delphi

2003-02-20 Thread Vladimir Ivanov




My 
program written in Delphi 7 opens a Word document and then sorts the 
paragraphs according to a binary table. The binary table is based on a tailored 
version 3.1.1 of Allkeys Table (http://www.unicode.org/reports/tr10/#AllKeys). 
The tailoring was done (according to the suggestion of Mark Davis) to 
interchange all the instances of Arabic letter Waw with He. Such a sorting, 
where Waw is preceding He, is useful for languages like Kurdish, Persian, Dari, 
Tajik, Pashto, Urdu etc. The program does right sorting of alpha-numeric and 
Arabic wide (Unicode) strings, but there are some inconveniences in 
usage:
1. I 
could not find in Delphi documentation any function that allows reading 
from the Word Application the position of the Selection (beginning and end) to 
set the range of sorting. Now I have to sort all the paragraphs in the 
document.
2. I 
could not find the way to preserve formatting of the sorted strings. They are 
coming out in default (Normal) style losing other applied fonts and 
styles.
May 
be somebody can give me a hint or refer to a source where such questions are 
discussed.

Thank 
you,
Vladimir 
Ivanov


Re: Web Form: Old Russian charcaters

2003-02-07 Thread Vladimir Ivanov
Please, have a look at the characters in the range 0460 - 0486 in the font
Arial Unicode MS shipped with Office XP. May be those are the characters you
are looking for.

 Can anyone on the Unicode list help?
 Thanks,
 Magda


  -Original Message-
 
  Date/Time:Thu Jan 16 13:11:06 EST 2003
  Contact:  [EMAIL PROTECTED]
  Report Type:  Other Question, Problem, or Feedback
 
  I am looking for a way to use old Russian charcaters that are
  no longer used in modern Russian langauage.
 
  Can you help or provide a direction in which to look?





Re: What characters have baseline?

2002-04-19 Thread Vladimir Ivanov

Philipp Reichmuth wrote:

 I don't think it's fixed 27.5° in handwritten script, it varies quite
 considerably, partly depending on how much text has to fit in the line
 in calligraphy. In ordinary handwriting, the angle easily reaches 45°
 or more.

I couldn't find any reference in books about such an angle. But it is an
opinion of a Persian calligraph. It has something to do with the Golden
Section (90°*0.618/2). The closer to this angle, the more beautiful is your
handwriting considered to be (see Golden Section in Art).

Vladimir Ivanov




Zub3nst3.png
Description: Binary data


Re: What characters have baseline?

2002-04-17 Thread Vladimir Ivanov

Marco Cimarosti wrote:


 BTW, I heard strange things about baselines, such as that the baseline for
 Indic fonts should be near the top of letters, while for good Arabic
 typography it should be oblique. Can someone confirm these notions?



The baseline of Arabic (in general meaning) typographical fonts (Nasx) is
horizontal and must be aligned to the Latin-Cyrillic-Greek baseline.

Beside that handwritten scripts like Nastaliq or Tahriri have an oblique
baseline for each word. The angle of that secondary baseline is 27.5º, like
this:





Vladimir Ivanov


attachment: clip_image002.jpg


Re: Private Use Agreements and Unapproved Characters

2002-03-17 Thread Vladimir Ivanov

We should be encoding important stuff that will be a boost to endangered
cultures and languages - to preserve those cultures, languages, and
literatures; and to have a long-lasting importance.

Rick McGowan



How do you rescue a script that has a proposal but hasn't had any action in
5 years?

David Starner



Old Persian and Avestan are closely related ancient languages that usually
go side by side. If a linguist refers to an Old Persian example, he must
show its Avestan form or his work would be considered to be incomplete (see,
for instance, Rastorgueva V.S., Edelman J.I. Etymological Dictionary of the
Iranian Languages, vol. 1, ISBN 5-02-018124-2, Moscow, 2000, 1000 copies).

The Avesta as a holy book began it's formation approximately 3000 years ago,
when Prophet Zoroaster appeared to spread the wisdom of Ahura Mazda - God of
the good. Mazdaism was the main religion of Ancient Persian Empire (558-330
BC.). In 330 BC. Alexander the Great burned Persepolis - its capital.
Contemporary followers of Zoroaster (Parsis), who live in small communities
in Iran, India and Pakistan, believe that in that fire the main copy of the
Avesta was destroyed, which, according to the exaggerated accounts of
antiquity, was said to number 25,000 parchment pages.

Zoroastrianism was the main religion of the two following Persian Empires.
The Avesta was passed on from generation to generation in oral-singing form,
until in the 3rd century AD it was revived in written form. For that purpose
a special Aramaic-based alphabet was invented. You can see those letters in
Michael Everson's Proposal.

The Avesta was then virtually destroyed in the 7th century AD by the
Muslims in their victorious invasion. Most of the Zoroastrians were then
compelled to accept the Qur'an (Koran), the sacred scriptures of Islam;
many, however, fled to India for refuge and took with them what was left of
their sacred writings. A few of the faithful remained behind in Persia and,
although persecuted, they continued to practice their religion. These two
groups, about 80,000 persons in India and 18,000 in Persia, were responsible
for the preservation of the Avesta in its present form. (Microsoft®
Encarta® Reference Library 2002. © 1993-2001 Microsoft Corporation. All
rights reserved).

What is the critical number of users to cause an encoding of a script? This
January we organized a workshop on Persian. We were surprised to know that
more than 100 students and teachers from all over Russia wanted to attend
the classes of Old Persian and Avestan. A group from Ukraine and Moldova
came to participate as well. We couldn't prepare a decent computer-written
handout for the audience. Xeroxed pages of one century old books were used.
Most of the time lecturers were working with a chalk at the blackboard.

The Parsis community in Iran nowadays is considered to be a religious
minority and therefore has its own representative in Parliament. They
publish various books dedicated to the Avesta like Bahrami E. Dictionary of
the Avesta in 4 volumes, Tehran, 1980, 3000 copies. English equivalents and
Persian explanations are represented in typographical form, but Avestan
words are hand-written in such books. Many ordinary members of the community
say that they can neither read nor write in Avestan. To learn prayers they
use
printed books with Persian transliteration, which to my mind is inadequate
for the purpose: only 3 of 14 Avestan vowels can be shown systematically in
such a way.

Does anyone happen to know, what technique is used in India and Pakistan in
Parsis communities to avoid the difficulties of hand-writing? How do they
learn their prayers? Is there any Avestan education there?



Thank you,

Vladimir Ivanov








Re: Keyboard Layouts for Office XP in WIndows 98

2002-03-10 Thread Vladimir Ivanov

Peter Constable wrote:

 I'll revise my summary in relation to MS apps and Win9x/Me

  Unicode characters that can be input using
  Keyman 5Keyman 6 (when released)
 --
--
 Word 2000limited by  limited by
  Windows codepages   Windows codepages

 Word 2000 w/ WordLinkall of BMP  all (planes 0 - 16)

 other Office 2000 apps   limited by  limited by
  Windows codepages   Windows codepages

 Word 2002limited by  all (planes 0 - 16)
  Windows codepages

 Word 2002 w/ WordLinkall of BMP  all (planes 0 - 16)

 Publisher 2002   all of BMP  all (planes 0 - 16)

 other Office XP apps limited by  limited by
  Windows codepages   Windows codepages

May I ask you to give some more explicit explanations concerning the
following scripts?
If I build an Avestan font according to Michael Everson's specifications
(Avestan characters should be in BMP, see
http://www.evertype.com/standards/iso10646/pdf/avestan.pdf) and a Keyman
keyboard layout, would I be able to type Avestan texts in:
Word 2002, Publisher 2002, Access 2002 under Windows 2000/XP?
Is it necessary to add some special characters to Keyman keyboard to tell
Uniscribe that Avestan is a
right-to-left script? Or should this be done through the VOLT (Avestan font
needs some right-to-left kerning)? Generally, how can an application know
that a certain range of BMP belongs to a right-to-left script?
(BTW it seems to me that in Table XXX page 1 of the Avestan proposal Column
xx4 should be shifted 1 cell downward
to meet the description on page 2).
Should we wait for Keyman 6 to type Old Persian in the same applications
because this script is in Plane 1?

Thank you in advance,
Vladimir Ivanov








Re: Keyboard Layouts for Office XP in WIndows 98

2002-03-10 Thread Vladimir Ivanov

Tom Gewecke wrote:

 One way to possibly type Old Persian (not yet in Unicode but incorporated
 into at least one font in the Plane 15 PUA) is with the vitual keyboard at

 http://home.att.net/~jameskass/screenkeyindex.htm

 It may require Opera 6 and Win2k or XP to work right, I'm not sure.  Worth
 a try perhaps.

Thank very much for the link. I will try it.





Re: Keyboard Layouts for Office XP in WIndows 98

2002-03-10 Thread Vladimir Ivanov

Stefan Persson wrote:

 Where in the UCS do you find the Old Persian characters?

I'm referring to Michael Everson's Proposal to encode Old Persian Cuneiform
in Plane 1 of ISO/IEC 10646-2

 Anyways, if you can't use the code point for a Plane 1 character, simply
use
 the code points for the surrogate pair.

Thank you for the advice. I'll try to experiment with it, though it will
take some time.

Vladimir Ivanov





Re: Month names (was: Re: Standard Conventions and euro)

2002-03-03 Thread Vladimir Ivanov
Patrick Andries wrote:

I believe Russian also gives numeral names to most week days
 (but Eastern European languages are not my forte).
 
 (*) E.g. Monday = segunda-feira.

3 days of the week are related to numerals in Russian:
$B'S'd'`'b'_'Z'\(B 'the second one' Tuesday
$B'i'V'd'S'V'b'T(B 'the forth one' Thursday
$B'a'q'd'_'Z'h'Q(B 'the fifth one' Friday

Vladimir Ivanov


Re: Initials

2002-02-28 Thread Vladimir Ivanov
Title: הודעה



Many thanks to all of you for 
discussion of this topic.
It gave some planned results, like 
better understanding of syntax of the initials in English.
It also gave some unexpected but very 
valuable and interesting results, like the way the initials are read in Herbrew, 
or that people in Denmark avoid writing initials at all.
Special thanks to Stefan Probst, who 
asked:
what the long threats about 
R(o|u)mania, Canada, California, Yankees, and Initials in various 
countries.. have to do with Unicode?
Sceptical opinions induce to better 
formulation :).
All these cases deal with 
transliteration. And transliteration as far as I could understand is one of the 
fields covered by Unicode. At least I saw here many times discussions on such 
issues like transliteration algorithm, difference between trancription and 
transliteration etc.
The task of transliteration is to 
substitute the letters of one script by another. But the practical goal of 
script conversion is to help people who do not know such a script to find 
desired structures and patterns in foreign text in order to operate with them. 
References and Bibliography is a good example of this.
If due to some cultural or 
orthographical reasons they get strange or unrecognizable syntax of the target 
pattern, the practical goal of transliteration wouldn't be achieved. Now, after 
this discussion is over, I can clearly see that initials can be one of such 
difficulties. But the question: to complicate or not to complicate the 
transliteration algorithm with syntax rules can be another topic of 
discussion.

Thank you once more,
Vladimir 
Ivanov


Initials

2002-02-23 Thread Vladimir Ivanov



By this message I 
would like to begin a discussion (if it would be found appropriate here and 
hasn’t been done earlier) about the rules of writing initials in various 
scripts. To make it clearer I’ll try to describe our everyday problems for 
Russian names and their transliterations in References and Bibliographies. I 
would be very grateful if someone could answer me then any of the following 
questions:
1) What are the 
rules for writing initials in your own language?
2) Are there any 
differences in rules for writing initials of foreign and domestic 
names?
3) Do you use 
different types of transliteration in different types of 
documents?
4) Does your 
grammar prohibit Russian-like rules for writing initials (i.e. may we apply our 
rules for initials when we write articles in your 
language)?
5) Is the problem 
of line breaking and “soft space” (as described below) relevant for your 
language?

Definition for 
Russian initials: An initial is the first letter of the first name with a period 
after it like “A.” in “A.Sokolov”. Initials are: the first letter of the first 
name and the first letter of the patronymic name with a period after each of 
them like “A.V.” in “A.V.Sokolov”. (Letter N will be used to show format like 
N.N.Nn…n). These rules are applied not only to Cyrillic, but to other 
transliteration scripts too.
Variants like N. 
N. Nn…n, N.N. Nn…n with spaces between initials and the last name can be found. 
But nowadays spaceless form N.N.Nn…n is used more often. Thus we get to N.[ 
][N.[ ]]Nn…n.
Some 
people with rare names like Vsevolod prefer two or more letters of their names 
to be shown in order not to be mixed up with frequent names like 
Vladimir, e.g. Vs.Safronov. It gives us 
N[n].[ ][N[n].[ ]]Nn…n.
In Bibliographies 
the books are sorted by last names, so there we can see Nn…n[,] N[n].[ ][N[n].[ 
]] with optional commas after the last name.
It is not allowed 
to break the line after the first initial like 
this:
N.
N.Nn…n.
But it is common 
to break the line after the second one like this:
N.N.
Nn…n.
The last case can 
be simulated as N[n].[ ][N[n].[ |CR]]Nn…n. In such cases hyphens adjacent to CR 
are not allowed.
Foreign 
names are usually initialed in the same way. Tomas Alva Edison becomes 
T.A.Edison. The exception is letter “J” because it represents two phonemes from 
the point of view of Russian phonetics. John Reed is shortened to Дж.Рид in 
Cyrillic script.
I suspect that something else is meant by initials in 
English. My Webster's dictionarysays that initial 
is:
a) the first letter of a 
name;
b) pl: the first letter of each word in a full 
name.
Nothing is mentioned about spaces and periods, no 
examples are provided.
During registration some applications ask the user to 
enter initials. In this field usually “N.N.” is not accepted neither in 
Cyrillic, nor in Latin (I have not tried Arabic). It is difficult to guess what 
format is expected. Are there English standards for initials? Are there 
international rules for that? Format “NN” in applications is accepted instead, 
but that’s a mistake from the point of view of Russian 
rules.
In diplomatic and 
consular affairs Russian names must be transliterated in a French-oriented way, 
otherwise they are English transliterated. For instance, the name “Ч.Айтматов” can look like 
“Tch.Aytmatov” in foreign passport, which will be normally equal to 
“Ch.Aytmatov” in Bibliography.
It is convenient to put after Russian initials some zero 
width control character that can be called “Soft Space”. It must act like Soft 
Hyphen (i.e. not effective in the middle of a line, but allowing line breaking), 
and must be invisible at the edge of the line. Is there such a character in 
Unicode? Can it be found in text processors? Now we have to do the required line 
breaking manually. It is not wise to overload hyphenation rules with it, because 
language-independent direct control of this feature for all scripts is 
needed.
Thank 
you in advance,
Vladimir 
Ivanov


Re: spoof buddies

2002-02-12 Thread Vladimir Ivanov
$B$m!;!;!;!;(B $B$m!;!;!;(B scripsit:

 Do there exist official Russian names of Unicode characters? They could
 probably be constructed with slightly less difficulty than English names.

 If these Russian names do not exist, maybe you can make some up! I know
 there are French names!

I have not heard about official (i.e. approved by Unicode Consortium)
Russian names of Unicode Characters. Of course, they could be constructed.
But that implies that such names must be constructed for every official
language in the world. It is a very huge task.
My main idea, however, was that all versions of applications should be able
to show English official names as reference. English for Unicode is like
Latin for Medicine.

Thank you for feedback, Vladimir Ivanov


Re: GRAPHEME JOINER vs. double diacritics

2002-01-04 Thread Vladimir Ivanov

Eric Muller wrote:

 Is it correct that the sequences U+x U+0360 U+y and U+x U+034F U+y
 U+0303 should display the same? Would it be worth putting some words
 about those situations in section 13.2 of PDUTR #28?

In what font can we find U+034F? In Arial Unicode MS just after U+0345 goes
U+0360.
Can we have the full path to section 13.2 of PDUTR #28?

Thank you,
Vladimir Ivanov





Re: An Azeri disk

2001-12-08 Thread Vladimir Ivanov

Michael Everson wrote:

 I've got a small clay disc in Arabic script from Azerbaijan. I can't
 read it. I've tried to analyze the script, and if you want to help me
 puzzle it out, see http://www.evertype.com/standards/Arab/disc.html

 There are things on it which don't look like things I've seen in
 Unicode anyway.
 --
 Michael Everson *** Everson Typography *** http://www.evertype.com

The word on disk reads Muhammad. See Arabic Presentation Form U+FDF4. There
are some additional ornamental elements as well.

Sincerely, Vladimir Ivanov






Re: sample text ok?

2001-10-11 Thread Vladimir Ivanov

Third form is a combination of the fourth form (from right to left) and
Kashida U+0640.

Vladimir Ivanov, [EMAIL PROTECTED]


 At 14:27 10/10/2001, Roozbeh Pournader wrote:

 No, it's not a fifth form. It's just the initial form. Since Heh is too
 similiar to the Arabic digit five, when refering to the letter, an
initial
 form is usually used. The glyph in at U+0647 in Unicode charts is just
 that...

 I know it is based on the initial form, but I have seen a number of
 typefaces in which it is designed as a separate, variant glyph, the usual
 characteristic of which is that the left connecting stroke of the initial
 form is truncated. I've attached a small graphic showing the five forms.

 John Hudson



5heh.gif
Description: Binary data


Re: microsoft font link

2001-08-05 Thread Vladimir Ivanov

Mike Lischke wrote:

I have linked to that page from my Unicode site www.delphi-unicode.net.

1) It seems to me that Persian transcription of Unicode (in the very first
running string of this site) is written wrong.
It should be either يونی کُد or يونيکُد . Roozbeh Pournader can say it 
for
sure.
2) Does this library support calls to Uniscribe?

Thank you,
Vladimir Ivanov, [EMAIL PROTECTED]





Re: Unicode library (was: microsoft font link)

2001-08-05 Thread Vladimir Ivanov

Roozbeh Pournader wrote:

 I don't know how hard that may be, but have you considered Pango
 http://www.pango.org/?

Thank you, the project is worth reading.

Vladimir Ivanov, [EMAIL PROTECTED]





How strong is rule 1?

2001-07-18 Thread Vladimir Ivanov



According to the document http://anubis.dkuug.dk/JTC1/SC2/WG2/docs/N1502.doc 
( 
III. Procedure for Encoding 
New Characters and Scripts
SC 
2/WG 2 Evaluation Procedure)

Rule 1. Says: Do not encode
a) If the proposed character is a (shape or other) variation 
of a character already encoded in ISO/IEC 10646 ... , or
b) If 
the proposed character is a presentation form (glyph), variant, or 
ligature.

Butthere isa whole block of Arabic 
presentation forms-A FB50-FDFF and Arabic presentation forms-B FE70-FEFF with 
glyphs and ligatures (contradiction to item b).
Besides,there are letters which are variants 
of each other like Pashto letter Gaf 06AB (Arabic Letter Kaf with Ring) and 06AF 
(Arabic Letter Gaf). You can see in Pashto text either the first one (rather 
seldom) or the second one (normally), but not both at the same time (xoring 
attitude). That contradicts item a).
Was such encoding done due to some historical 
reasons in the past?
Could there be exceptions in the future in similar 
cases?

Thank you,
Vladimir Ivanov, [EMAIL PROTECTED]


Arabic based alphabets

2001-06-16 Thread Vladimir Ivanov



Vladimir Ivanov wrote:

Letter "dze" is represented in Unicode by U+0681 "Arabic letter heh 
with hamza above",though the sign above heh is not exactly 
hamza.

John Hudson wrote:

 Does Pashto also make use of the regular hamza sign in other 
contexts?

Yes, for instance in Arabic loan words like 
"shu'Un".

 Can you, or anyone, provide a visual example of this form?

A picture of letter "dze" from Pashto-Russian 
dictionary is included in this message.

Sicerely,
Vladimir Ivanov, [EMAIL PROTECTED]
 Dze.tif


Russian Unicode Convertion

2001-06-12 Thread Vladimir Ivanov



Vadim Snurnikov wrote:

 How 
can I read a text in Unicode (Russian) where every Russian letter is 
represented like that: D=B6 (or similar)? (The e-mail got 
transferred to this format.)

What kind of software is used to get E-mail? 
I recommend Outlook Express 5.0 and above. It allows you to get Cyrillic 
messages both in Windows 1251 and/or Unicode. You'd better have Office 2000 on 
your machine. It installs correct set of default fonts to work with Cyrillic 
letters.

 Is 
there a tool to transfer this back into 2-byte-encoding or to any other 
readable form?

I heard 
that they've invented something at Russian Sofware Club www.rusc.ru but you'd better put such a question 
directly to their Conference.

Sincerely,
Vladimir Ivanov, [EMAIL PROTECTED]



Arabic based alphabets

2001-06-09 Thread Vladimir Ivanov



Marco Cimarosti wrote:
 Pournander and N.R. Liwal can help us: how is the "Arabic 
alphabet" called in Farsi, Urdu, and Pashtun?Roozbeh Pournader wrote:
Persian speakers call it "alefbaa-ye faarsi".

The same set of letters in Tajikistanis officially called "alifbo-i 
niyokon" - alphabet of ancestors.
The same set of letters in Afghanistan is officially called "alefbA-ye 
dari" - Dari alphabet.
In some textbooks and dictionaries 2 additional variants for so called 
majhul vowels in Dari can be found:
vAv-e majhul U+FBD9 (long /o/) and yA-ye majhul U+FBE4 (long /e/), but they 
are rather rare. So you can consider all 3 alphabets identical.

N.R.Liwal wrote:
In Pashto they call it "Pashto Alefbe" having 15 Extra 
Characters.
My Pashto informants call it "dI paxto alifbe", saying it has 10 extra 
letters.
Letter "dze" is represented in Unicode by U+0681 "Arabic letter heh with 
hamza above",
though the sign above heh is not exactly hamza. It is a zigzag-like sign of 
the same height as hamza, but they are well distinguished. My informants could 
not recall any special name for it.
If you use "heh with hamza above", people usually accept it as a 
substitute, saying that "computer is not able to build a real Pashto letter" 
(?!).
I could not find such a letter in Unicode. I would be glad to hear some 
comments on it.

Sicerely,
Vladimir Ivanov


Farsi keyboard

2001-06-09 Thread Vladimir Ivanov



Roozbeh Pournader Wrote:

We 
have just finished fixing Windows 2000's Farsi keyboard...
The 
fixed version solves many of the annoying problems of Windows 
2000keyboard:1. It has Persian digits instead of Latin 
ones.
Persian digits is no 
problem in Windows 2000 if you work with Word 2000 and/or 
XP.
We use Tools/Options/Right-to-Left/Numeral: 
Context.
It switches digits automatically to Persian 
when you type in Persian context.

What was really 
annoying for us - 2 different final "Ya"s:
One without dots on "D", and the other one 
with unnecessary dots on Shift+X.
Does the new version fix it?

Sincerely,
Vladimir Ivanov



Persian letters Kaf and Heh with Yeh above

2001-05-09 Thread Vladimir Ivanov



Persian letter Kaf 06A9 (entered from Persian 
keyboard in Word 2000 or Word XP under Windows 2000 or Whistler Beta 2 Build 
2462) seems to be handled like a delimiter. If you try to select the whole word 
KETA:B (see Attachment, Line 1) by double clicking, only last three letters 
could be checked. It may also be the reason for incorrect dictionary sorting of 
such words.
In Windows 2000 there was no built in method 
to enter Heh with Yeh above 06C0 in Persian texts. Now, thanks to Whistler, you 
can do that by pressing Shift+G. Unfortunately:
1) It substituted the method for entering 
diacritic Sukun 0652, so that you can no longer enter it from Persian keyboard. 
It is a minor inconvenience because Sukun is used in Persian rather rarely to 
eliminate ambiguity and it can be entered with a self-made shortcut 
key.
2) Times New Roman font has no such symbol. 
Entering Heh with Yeh above 06C0 automatically switches the font to Tahoma. 
So,if you don't pay any attention, the rest of your text will be in Tahoma 
(see Lines 2 and 3).
3) Heh with Yeh above 06C0 is not linked to 
the preceding letter (see Line 3).

Thank you, Vladimir Ivanov, [EMAIL PROTECTED]
 KafHamza.zip


Re: Using hex numbers considered a geek attitude

2001-05-04 Thread Vladimir Ivanov


 On Thu, 3 May 2001, James Kass wrote:

  So, tried using Windows 2000 character map
  feature.  It's possible to select a single character this way,
  but for some reason a CR-LF is added when copying with
  Control-V.  (This is true even with displayable characters
  or character strings, unfortunately.)

 My colleagues do that in the same way. They will put a ZWNJ in the
 cllipboard, and then use Shift-Insert for putting it in the text. The main
 problem is they will lose that as soon as they want to do cut and paste,
 so they should put it in the buffer again!

I do it much easier. In Word XP try the following:
Insert/Symbol/Special Characters
Choose No-Width Optional Break
Then press the button Shortcut key and associate with it whatever
combination you like.
I use Alt+J because it is easy to remember (it contains J for joiner).
I used a similar thing in Word 2000, but I do not remember exact terms.

Vladimir Ivanov, [EMAIL PROTECTED]





Old Persian

2001-04-02 Thread Vladimir Ivanov



In James Kass' Plane One font Code2001.ttf 
(built according to Unicode 3.1 standard and given for testing at http://home.att.net/~jameskass/code2001.htm) 
one can find surrogates for Old Persian Cuneiform signs.

Surrogate U+D800DF90, that must represent 
sound "di", has 3 short horizontal and 3 long vertical wedges.
According to Roland G.Kent's "Old Persian", 
American Oriental Society, New Haven, 1953, p.12 (a scanned picture of the table 
is included in this message) it must have only 2 verticals (see column 
2).

Can anyone give me the reference for the 
cuneiform signs in the range of U+D800DFC0 - U+D800DFDE?

In the same package with Code2001.ttf I found 
a text file "Plane1.txt" with a fragment of an Old Persian 
citation.
As far as I understand Old Persian, character 
#5 in the 2nd word (or char #11 from the beginning of the fragment) is "ji" 
U+D800DF8A and it must be changed to "va" U+D800DF9E in order to give 
"Darayavaush" - the name of the Persian King - Darius.

Sincerely, Vladimir Ivanov, [EMAIL PROTECTED]
 OldPersianCuniform.tif