Re: Combining Marks and Variation Selectors

2020-02-01 Thread Roozbeh Pournader via Unicode
They are actually allowed on combining marks of ccc=0. We even define one such variation sequence for Myanmar, IIRC. On Sat, Feb 1, 2020, 2:12 PM Richard Wordingham via Unicode < unicode@unicode.org> wrote: > Why are variation selectors not allowed for combining marks? I can see > a reason for t

Re: Proposal to add Roman transliteration schemes to ISO 15924.

2019-12-02 Thread Roozbeh Pournader via Unicode
You don't need an ISO 15924 script code. You need to think in terms of BCP 47. Sanskrit in Latin would be sa-Latn. Now, if you want to distinguish the different transcription systems for writing Sanskrit in Latin, you can apply to registry a BCP 47 variant. There are also BCP 47 extension T, which

Re: Aquaφοβία

2017-12-11 Thread Roozbeh Pournader via Unicode
Jonathan, I've been trying to gather a list of the Arabic marks that actually happen in Hebrew for a while now, but don't have sources. I want to add them to ScriptExtensions data in Unicode. Do you know of a source that lists them? On Sat, Dec 9, 2017 at 12:56 PM, Jonathan Rosenne via Unicode <

Re: Emoji Variation Sequences: relaxing VS15/16

2016-08-22 Thread Roozbeh Pournader
I agree that this should be relaxed for VS15 and VS16. For example, the current draft version of UTR #51 even suggests that this be done for three sequences that can not get variation sequences defined until Unicode 10.0. Would you please write a proposal document or send a note through http://www

Re: Running text requirement?

2016-07-23 Thread Roozbeh Pournader
In my experience, no such requirement is a binary yes/no. If you have a good character candidate, run it by the list, or just write a proposal. UTC tends to look at all the merits together, instead of a list of things that should all be there or else there won't be a character. On Sat, Jul 23, 201

Re: Mongolian Joining Type

2015-08-05 Thread Roozbeh Pournader
These were the original proposals: http://www.unicode.org/L2/L2012/12202-shaping.txt http://www.unicode.org/L2/L2012/12360-mong-shaping.txt (with considerable UTC discussions). A good trick is going through the posted UTC minutes and searching for the topic you are interested in. Or just do Google

Re: Android 5.1 ships with support for several minority scripts

2015-03-14 Thread Roozbeh Pournader
Yes, the one and only ₾ (sent from my Android phone). On Mar 14, 2015 3:16 PM, "Doug Ewell" wrote: > Roozbeh Pournader wrote: > > (Android 5.0, released last year, had already added the Georgian lari, >> > > The one that's going to be officially standardiz

Re: Android 5.1 ships with support for several minority scripts

2015-03-14 Thread Roozbeh Pournader
On Sat, Mar 14, 2015 at 2:18 PM, Andrew Cunningham wrote: > Testing on Thai Tham will occur ... I was curious as to what the original > design parameters forvthe font was. It is easier to evaluate a fonts > language support knowing what was originally indended. > I unfortunately don't know that.

Re: Android 5.1 ships with support for several minority scripts

2015-03-14 Thread Roozbeh Pournader
; Akhar Srak . Western Cham script remains unsupported. > > Which languages was the Thai Tham font designed to support? And which > variety of the script? > > Andrew > > > On Saturday, 14 March 2015, Roozbeh Pournader wrote: > > Android 5.1, released earlier this week, ha

Re: Android 5.1 ships with support for several minority scripts

2015-03-14 Thread Roozbeh Pournader
ita > > > > On Fri, Mar 13, 2015 at 7:00 PM, Roozbeh Pournader > wrote: > >> Android 5.1 >> <http://officialandroid.blogspot.com/2015/03/android-51-unwrapping-new-lollipop.html>, >> released earlier this week, has added support for 25 minority scripts. The &

Android 5.1 ships with support for several minority scripts

2015-03-13 Thread Roozbeh Pournader
Android 5.1 , released earlier this week, has added support for 25 minority scripts. The wide coverage can be reproduced by almost everybody for free, thanks to the Noto and HarfBu

Re: Is there an IBM group mark symbol?

2015-01-30 Thread Roozbeh Pournader
There may be something like it in the math symbols sets, but if there's not, please feel free to submit a proposal. On Jan 30, 2015 8:59 AM, "Ken Shirriff" wrote: > I'm writing about the IBM 1401 and there's one character from its > character set that I couldn't find in Unicode: the group mark. T

Re: Western Cham in Akhar Jawi

2014-10-27 Thread Roozbeh Pournader
This is the first time I'm seeing the character. I suggest writing a Unicode proposal. On Oct 26, 2014 10:42 PM, "Andrew Cunningham" wrote: > Hi all, > > > When Western Cham is written in the Arabic script, there is regional > variation in the Arabic characters used. Two varieties I am looking at

Re: Christian Palestinian Aramaic

2014-10-09 Thread Roozbeh Pournader
Two things: 1. You should be able to get the behavior you want through 'calt' or 'dlig' OpenType features. You'd need to add more lookups to your font than a typical Syriac font, but that's to be expected, considering that you are working with slightly atypical material. 2. Based on the exact sit

Re: Proposals for Arabic honorifics

2014-10-09 Thread Roozbeh Pournader
Hi, I'm the author of L2/14-147. Nice to see you interested in the topic. It would be great to have more samples of the usage of these characters. Feel free to send the samples you can find to me or as a separate proposal document for the UTC, whichever way you prefer. On Oct 9, 2014 7:32 AM, "Na

Re: Transliterate Ugaritic aleph letters

2014-09-08 Thread Roozbeh Pournader
What Michael said, assuming the i loses its dot when the half-ring is displayed over it. If it keeps its dot, you should use instead. On Mon, Sep 8, 2014 at 10:33 AM, Michael Everson wrote: > On 8 Sep 2014, at 17:42, Miller Prosser wrote: > > > 10380 Ugaritic letter alpha > > 1039B Ugaritic le

Re: Parsers for the UnicodeSet notation?

2014-07-23 Thread Roozbeh Pournader
On Wed, Jul 23, 2014 at 3:23 PM, Eric Muller wrote: > I would like to work with the exemplarCharacters data in the CLDR. That > uses the UnicodeSet notation. Is there somewhere a parser for that > notation, that would return me just the list of characters in the set? Note that it's a set of str

Re: Noto adds CJK, plus new user-facing website

2014-07-15 Thread Roozbeh Pournader
d mapping). > > Let's just hope that OSes will support all these new scripts (Windows has > always been leaving users behind if they did not use the lastest version > whose linguistic support was frozen at least 2 years before the last > release, with few extensions with OS or Office ser

Re: Thai unalom symbol

2014-07-01 Thread Roozbeh Pournader
I think this is a very good candidate for encoding. I would recommend writing a proposal for UTC and including the discussion about potential location. On Tue, Jul 1, 2014 at 8:10 PM, James Clark wrote: > One of the most pervasive religious symbols in traditional Thailand > culture is the "unal

RE: Bidi reordering of soft hyphen

2014-04-01 Thread Roozbeh Pournader
Adding Behdad for his insight on the rendering stack. But as for user requirements and expectations, the first option, with the hyphen on the right side of "car" as "car-" is what a good publisher would want to print in his magazine or book. The second option is harder to decipher for an RTL read

Re: The Ruble sign has been approved

2013-12-11 Thread Roozbeh Pournader
Fortunately, we have it already encoded at U+0554 ;-) On Wed, Dec 11, 2013 at 7:51 AM, Leo Broukhis wrote: > The board of directors of the Central Bank of Russia has [finally] > approved the de facto standard ruble sign. > > http://lenta.ru/news/2013/12/11/symbol/ > > Leo >

Behdad Esfahbod won an O'Reilly Open Source Award!

2013-07-29 Thread Roozbeh Pournader
Some of you probably have heard the news already, but in case you haven't, Behdad won the prestigious O'Reilly Open Source Award, announced last Friday. Here's the announcement: http://www.oscon.com/oscon2013/public/schedule/detail/29956 Selected quotes: "The O’Reilly Open Source Awards recogniz

Re: Arabic quoting characters

2013-06-14 Thread Roozbeh Pournader
They are unified with the double angle quotation marks. Persian also uses the round version (and if if I remember correctly, Greek too). On Fri, Jun 14, 2013 at 10:45 AM, Michael Fayez wrote: > Hi all, > I noticed that double small parentheses that are used in professional > printing in Arabic p

Re: RLI and "bdi", and how to get an update of changes

2013-01-15 Thread Roozbeh Pournader
On Tue, Jan 15, 2013 at 4:38 PM, Jukka K. Korpela wrote: > Code points 2066, 2067, and 2068 are unassigned. > Not anymore. See the pipeline and the latest proposed update of UAX#9: http://www.unicode.org/alloc/Pipeline.html http://www.unicode.org/reports/tr9/proposed.html

Re: Compiling a list of Semitic transliteration characters

2012-09-05 Thread Roozbeh Pournader
On 2012/9/5 Bill Poser wrote: > It is also at least logically possible for there to be transliterations > from Semitic writing systems to non-Roman writing systems. I'm not aware of > such a thing, but one can imagine, for example, Russian work using a > Cyrillic-based transliteration. Even if su

Re: Unicode 6.2 to Support the Turkish Lira Sign

2012-05-29 Thread Roozbeh Pournader
2012/5/29 Doug Ewell > I was specifically, and only, referring to a character proposal—any > proposal—being dubbed "urgent" on the basis that a font hack has been > identified. > Just look what happened when the Japanese did their own font/character set hack. The backslash/yen problem is still w

Re: Cantilation Fathatan, dammatan and kasratan

2012-04-23 Thread Roozbeh Pournader
There are two different versions of these in the Unicode Standard. The normal ones are at U+064B, U+064C, and U+064D. The open ones (a new addition to the standard) are at U+08F0, U+08F1, and U+08F2. Roozbeh 2012/4/23 Escape Landsome > Hello all, > > I want to type some quranic verses, but the

Re: Detailed description of every Arabic character in Unicode

2012-02-28 Thread Roozbeh Pournader
For that, you need to find the proposals that led to the encoding of the character. It's sometimes an art to find the right document in WG2 and UTC archives. For the Arabic year sign, it seems that the public proposal itself is available publicly among the WG2 documents at: http://std.dkuug.dk/jtc

Re: Proposed new characters updated in Pipeline Table

2011-08-12 Thread Roozbeh Pournader
On 8/12/2011 6:41 PM, mmarx wrote: I attach a Garshuni document from Beit al-Qammar showing Arab vowel marks -- just as the Syriac communmties are using Syriac vowel marks in Arabic script text -- and (in the second line on the left) wasla above olaph. So whatever the status of wasla mark will b

Re: Proposed new characters updated in Pipeline Table

2011-08-12 Thread Roozbeh Pournader
On Fri, 2011-08-12 at 17:43 -0500, Bob Hallissy wrote: > For Arabic, I've been trying to keep this page up-to-date, and it may > help you: > > http://scriptsource.org/entry/hqr6rc9md5 And I try to keep a rather updated list of still-unproposed/undecided stuff at: http://goo.gl/tlfgx Roozbeh

Re: Rendering of U+0653 ARABIC MADDAH ABOVE

2011-07-21 Thread Roozbeh Pournader
On Wed, 2011-07-20 at 20:55 -0500, Bob Hallissy wrote: > And for further clarification: when does U+06E4 ARABIC SMALL HIGH MADDA > get used? Yes. In the Korans only. It tends to be rare, but it is used. Roozbeh

Re: Rendering of U+0653 ARABIC MADDAH ABOVE

2011-07-20 Thread Roozbeh Pournader
On Wed, 2011-07-20 at 14:42 -0500, Bob Hallissy wrote: > Wondering if anyone knows (or has an educated guess): > > When an Arabic combining character sequence includes both > * U+0653 ARABIC MADDAH ABOVE and > * an Arabic vowel mark above (e.g., U+064E ARABIC FATHA or maybe >

My take on the Unicode 6.0 release

2010-10-12 Thread Roozbeh Pournader
Here is the tailored announcement I wrote for the Persian computing community: http://www.advogato.org/person/roozbeh/diary/163.html Roozbeh

Re: Pashto yeh characters

2010-10-01 Thread Roozbeh Pournader
This is a rather late reply, but I think this document should be useful: http://www.evertype.com/standards/af/af-locales.pdf The first few pages discuss and recommend various Yeh forms to be used, and a recommendation for avoiding some in certain forms. Roozbeh On Thu, 2010-07-22 at 12:17 -0500

Re: Pashto yeh characters

2010-10-01 Thread Roozbeh Pournader
On Thu, 2010-07-29 at 09:56 +0200, André Szabolcs Szelp wrote: > I seem to remember, that Persian used kaf with three dots above (like > your Moroccan example) at least in the 19th century. No idea when they > switched to the double-lined version. (and I can well imagine how the > three dots would

Re: Meaning of a.o. Persian

2003-07-30 Thread Roozbeh Pournader
> But is ZWJ actually required > within text, or only for display of joining forms in isolation? A simple > example would be of interest. Yes, it is used in normal text. But the only frequent use is after a single Heh, to make it appear in the initial form (which is by tradition used to distingu

Re: Meaning of a.o. Persian

2003-07-29 Thread Roozbeh Pournader
On Tue, 2003-07-29 at 21:06, Patrick Andries wrote: > UTR 20, table 4.1 writes that ZWNJ and ZWJ are needed for « a.o. Persian ». > > What does the abbreviation « a.o. » mean ? Arabic O..? I have no clue about that, but ... > Is this current Farsi or some historical Persian ? ZWNJ and ZWJ are r

Re: I am not in India

2003-07-19 Thread Roozbeh Pournader
On Sat, 2003-07-19 at 02:46, Doug Ewell wrote: > I got something titled " Re: Coptic II?" (note leading space) from > "[EMAIL PROTECTED]", which I am pretty sure is not Roozbeh > Pournader. I definitely now *nothing* about Coptic but that's it's related to Greek to some degree. roozbeh

Re: Problem with Arial Unicode MS font for BOLD/ITALICS in PDF

2003-06-20 Thread Roozbeh Pournader
On Sat, 2003-06-21 at 02:04, Michael Everson wrote: > Roozbeh informs me that "oblique [Naskh] is a standard things > nowadays, specially since it can usually be done automatically in > software. Both slanted and backslanted." Certainly I saw italic > signage in Kabul. Just to confirm. BTW, o

[OT] No more IE for Mac

2003-06-14 Thread Roozbeh Pournader
For those who were worried when is Microsoft going to implement good Unicode support for Mac OS's IE, there is now an answer: Never. Read it yourself: http://news.com.com/2100-1045_3-1017126.html roozbeh

RE: Persian or Farsi? (was RE: Decimal separator with more than o ne c haracter?)

2003-06-01 Thread Roozbeh Pournader
On Sat, 24 May 2003, Rick Cameron wrote: > When I was looking at the President's site, I tried to find a link to > something like the Academy, without success - thanks for providing it. One > problem: the home page of the site is all in Persian! (See how I've > assimilated? ;^) I visited a few oth

Unicode and Iranian Blogs

2003-06-01 Thread Roozbeh Pournader
How Unicode is helping Iranians express themselves: http://www.wired.com/news/culture/0,1284,58976,00.html roozbeh PS: Well, it calls the language 'Farsi' :(

Re: Unicode 4.0 in ICU demos

2003-05-29 Thread Roozbeh Pournader
On Mon, 28 Apr 2003, Mark Davis wrote: > BTW, the ICU demos have been all upgraded to Unicode 4.0, on > http://oss.software.ibm.com/icu/demo/. > They include: > > [...] > > IDNA Demo > This simple demo performs IDNA transformations as described in RFC 3490. But isn't the IDNA repertoire limited

Re: Awful Unicode character names (was Re: I-Ching Hexagrams)

2003-04-09 Thread Roozbeh Pournader
On Tue, 8 Apr 2003, Rick McGowan wrote: > What are your favorites? Let's make a list for the FAQ! :-) U+262B FARSI SYMBOL that actually appears on my passport (Michael, Ken, Asmus and Cathy saw it in Tokyo). Neither Farsi, nor a symbol. In real life, it is the official emblem of the goverment of

RE: ANSI requires licence fees to use ISO language and country code?

2003-03-21 Thread Roozbeh Pournader
On Fri, 21 Mar 2003, Michael Everson wrote: > >However, if you load the list of country codes and names in a > >commercial product, thus giving an added value to your product, we > >consider it normal that ISO asks for the payment of a royalty fee. What about Open Source software like ICU (that i

Re: Surrogate supported in Mozilla 1.3

2003-03-16 Thread Roozbeh Pournader
On Sun, 16 Mar 2003, Tex Texin wrote: > Version 1.3 right? Downloaded and installed three different builds of Mozilla 1.3 for Linux (RH8 Xft version, RH8 gtk2 version, plain version). The first two don't even understand surrogates. The last one only displays question marks. Right to left Etruscan

Re: Surrogate supported in Mozilla 1.3

2003-03-16 Thread Roozbeh Pournader
On Sun, 16 Mar 2003, Tex Texin wrote: > Seems odd. It works on Windows and Mac os X. We will see what Simon says. > Version 1.3 right? Oh sorry! this is 1.3b I'm running. I'll try with 1.3 to see. roozbeh

Re: Surrogate supported in Mozilla 1.3

2003-03-15 Thread Roozbeh Pournader
On Sat, 15 Mar 2003, Tex Texin wrote: > I will update the Unicode example intro page to reflect that Mozilla 1.3 > supports this. Just for Windows, I guess. I have a Red Hat Linux 8.0 box, just installed code2001, but only see question marks on the page. The number of question marks looks to be

Re: Re: farsi calendar components

2003-03-13 Thread Roozbeh Pournader
On Thu, 13 Mar 2003 [EMAIL PROTECTED] wrote: > > Do you mean the Persian calendar (as used in Iran and Afghanistan, also > > known as the solar Islamic calendar or "Hejri-e Shamsi"), or a Gregorian > > the Persian one. Some C code is available with an LGPL license at: http://www.farsiw

Re: farsi calendar components

2003-03-13 Thread Roozbeh Pournader
On Thu, 13 Mar 2003, Paul Hastings wrote: > does anybody know of any java farsi calendar components? thanks. Do you mean the Persian calendar (as used in Iran and Afghanistan, also known as the solar Islamic calendar or "Hejri-e Shamsi"), or a Gregorian calendar translated to Persian (Farsi)? ro

Re: ZWNJ & Persian Collation

2003-03-12 Thread Roozbeh Pournader
On Tue, 11 Mar 2003, Markus Scherer wrote: > The Unicode Collation Algorithm (UCA) for which allkeys.txt is the > default weight table does treat ZWNJ and a number of other characters as > special. For these, they are completely ignored by the UCA - same as if > you stripped them from the text. W

Re: FAQ entry (was: Looking for information on the UnicodeData file)

2003-03-07 Thread Roozbeh Pournader
On Fri, 7 Mar 2003, John H. Jenkins wrote: > since different people speaking different languages > often have different perceptions of what a symbol is. Reminds me of ISIRI 3342 that officially considered symbol and character the same thing and used one word ("namaad", Noon, Meem, Alef, Dal) for

Re: Impossible combinations?

2003-03-02 Thread Roozbeh Pournader
On Sun, 2 Mar 2003, Kevin Brown wrote: > Does anyone know of a Latin-based language in which it is possible to > have a lowercase immediately followed by an uppercase in the SAME word? That happens in many common names, like McGowan. It will also be used in tech terms that need to avoid space for

Re: Unicode 4.0 BETA available for review

2003-02-28 Thread Roozbeh Pournader
On Thu, 27 Feb 2003, Mark Davis wrote: > The Unicode Standard *is* free of charge; the entire text is posted on > www.unicode.org. Well, free of charge to *read personally on the screen*, of course. You can't print the major versions yourself, Addison-Wesley must be asked for that ;) And you can'

Re: UTF-8 Editors? (Was XML and tags)

2003-02-23 Thread Roozbeh Pournader
On Sat, 22 Feb 2003, Jungshik Shin wrote: > > Arabic is not in vim yet. They are putting it in now that we're talking, > > and there have been a lot of discussions on something called 'cream' that > > is a vim distribution that has included the Arabic patch. > > You meant a standalone-gui vim

Re: UTF-8 Editors? (Was XML and tags)

2003-02-22 Thread Roozbeh Pournader
On Sat, 22 Feb 2003, Edward H Trager wrote: > It turns out that the version of vim that I have does indeed work under > xterm for an assortment of LTR languages (Indian languages not tested), > but not Arabic (the only RTL language tested) Arabic is not in vim yet. They are putting it in now that

Re: Panjabi or Punjabi?

2003-02-21 Thread Roozbeh Pournader
On Fri, 21 Feb 2003, Aditya Gokhale wrote: > It is like "Panama" where we have 'pa' strong sound of 'a' and "Sun" > where even though we have a sound of 'a' we still spell it with 'u'. So both > the spellings (Punjabi and Panjabi) are correct. As far as Punjabi / Panjabi > goes, in India it is

Panjabi or Punjabi?

2003-02-21 Thread Roozbeh Pournader
I just found a difference between online reference on official ISO 639 names of the "pa" language. mentions: pa Punjabi while says: Panjabi pendjabi pan pa Which sp

Re: Bidi overrides and chocolate paper

2003-02-17 Thread Roozbeh Pournader
On Mon, 17 Feb 2003, Doug Ewell wrote: > >http://www.farsiweb.info/unicode/chocolatepaper.jpg > > The number "100" isn't supposed to be RTL, is it? Or is that what you > meant by "override"? Exactly. If one puts the correct sequence of characters in a Right-to-Left override mode (like putt

Bidi overrides and chocolate paper

2003-02-17 Thread Roozbeh Pournader
Look how a Turkish chocolate-making company writes all Arabic in Bidi override mode: http://www.farsiweb.info/unicode/chocolatepaper.jpg roozbeh

Re: BOM's at Beginning of Web Pages?

2003-02-17 Thread Roozbeh Pournader
On Mon, 17 Feb 2003, Jungshik Shin wrote: > Incidentally, it just occurred to me that ftp/ssh clients may offer an > user-configurable option for the automatic removal of 'UTF-8 BOM' at > the beginning of a text file in UTF-8 when moving files from Windows to > non-Windows platforms (Unix/Uni

Re: BOM's at Beginning of Web Pages?

2003-02-17 Thread Roozbeh Pournader
On Mon, 2003-02-17 at 15:42, Michael Everson wrote: > I would like to repeat, all of this BOM discussion is all very well > and good, but the Unicode home page displays incorrectly on three OS > X browsers, and the keepers of that home page should delete the first > character before the HTML beg

Re: BOM's at Beginning of Web Pages? Mac IE's Euro

2003-02-17 Thread Roozbeh Pournader
On Mon, 2003-02-17 at 14:27, Tex Texin wrote: > > AFAICR, there is supposed to be no single non-ASCII character before that > > tag. > > I don't believe the standard says that. However, it is recommended that the > META content-type statement is placed as early as possible, [...] Yes, it was som

Re: BOM's at Beginning of Web Pages? Mac IE's Euro

2003-02-16 Thread Roozbeh Pournader
On Sun, 16 Feb 2003, Doug Ewell wrote: > The Unicode home page includes the following line, right where it should > be, in the section: > > > > Any User Agent that takes a page properly marked as UTF-8, as above, and > still tries to autodetect a local code page, is badly misguided. How > wou

Re: BOM's at Beginning of Web Pages? Mac IE's Euro

2003-02-16 Thread Roozbeh Pournader
On Sun, 16 Feb 2003, Doug Ewell wrote: > The mystery remains as to why U+FEFF (or the bytes 0xEF 0xBB 0xBF, > however interpreted) would be displayed as a Euro sign. Autodetection as some other codepage? roozbeh

Re: BOM's at Beginning of Web Pages?

2003-02-16 Thread Roozbeh Pournader
On Sun, 16 Feb 2003 [EMAIL PROTECTED] wrote: > The lack of the BOM in the 'white space' section of the specs may > just be an oversight. I like the idea. This looks practical to me. Ammending HTML 4 to consider this. > Since plain text files can have any kind of file extension, and the > *.TXT e

Re: BOM's at Beginning of Web Pages?

2003-02-15 Thread Roozbeh Pournader
On Sun, 16 Feb 2003 [EMAIL PROTECTED] wrote: > The W3C Mark Up Validation Service at: > http://validator.w3.org/ > > ...validates a UTF-8 web page with a BOM as valid HTML 4.01, > suggesting that the BOM is not at all illegal. Well, I found that, but mischievously tried to hide the fact ;) Acco

Re: BOM's at Beginning of Web Pages?

2003-02-15 Thread Roozbeh Pournader
On Sun, 16 Feb 2003 [EMAIL PROTECTED] wrote: > Let's deprecate those dang BOMs and require all character set > information to be indicated with Plane 14 tags... No, just let's recommend explicitly against BOM in UTF-8 instead of politely telling that it's OK to put a BOM only because somebody li

Re: BOM's at Beginning of Web Pages?

2003-02-15 Thread Roozbeh Pournader
On Sat, 15 Feb 2003, Michael (michka) Kaplan wrote: > And to whatever extent UTF-8 has a BOM, it would fall under the same > category. Certainly that is how processors that understand the UTF-8 > BOM deal with it. Well, that needs researching into what UTF-8 is in W3C and HTML 4.0 terms: What is

Re: BOM's at Beginning of Web Pages?

2003-02-15 Thread Roozbeh Pournader
Found it! It's forbidden to start a HTML 4.0 page with a UTF-8 BOM. Proof: 1. Open the main page of Unicode. You can see that the HTML header says: So, we are talking about HTML 4.0 here. The reference for HTML 4.0 is: http://www.w3.org/TR/1998/REC-html40-19980424/ The section about HT

Re: BOM's at Beginning of Web Pages?

2003-02-15 Thread Roozbeh Pournader
On Sat, 15 Feb 2003, Doug Ewell wrote: > But as I wrote earlier, a zero-width no-break space at the start of a > Web page should not disrupt the content or layout of the page in any > way. It's a space. It's zero-width. A Euro symbol is non-conformant > and just plain wrong; the page starts wit

Re: Everson Mono

2003-02-14 Thread Roozbeh Pournader
On Fri, 14 Feb 2003, Michael Everson wrote: > >Well, since many of us can't open that on a PC, would you tell us > >the number of glyphs so we can correct you if we found about any > >bigger monowidth font? > > 7,072 Wow! The next biggest monowidth non-CJK font I know, has just 5,013. It has L

Re: Everson Mono

2003-02-14 Thread Roozbeh Pournader
On Thu, 13 Feb 2003, Michael Everson wrote: > Anyway, while there are gaps in this font, of course there are gaps > in all the other fonts out there as well. Announcing, then, the > biggest monowidth font I'm aware of Please see > http://www.evertype.com/emono/ Well, since many of us can't

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Doug Ewell wrote: > Also, when you need to search the Unicode mailing list archives, the > search engine on the Unicode site is often more useful than Yahoo: > > http://www.unicode.org/mail-arch/ Thanks a lot! I was misdirected to the Yahoo archives from the mailing list pag

Re: Never say never

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Doug Ewell wrote: > I would imagine that issues like this are also discussed on the > members-only UnicoRe list. It's not about discussions, but final decisions. Discussions happen, but from my personal experience I can tell you that what will be decided finally is not always

Re: Glyph of Pipeline Characters ?

2003-02-12 Thread Roozbeh Pournader
On Tue, 11 Feb 2003, Doug Ewell wrote: > > The new character code charts are available for review at > > http://www.unicode.org/charts/u40-beta.html > > (Note: It would be nice if the Unicode site had a link to the beta > charts somewhere. The Pipeline page still says, "Charts of the > character

RE: Never say never

2003-02-12 Thread Roozbeh Pournader
On Wed, 12 Feb 2003, Andy White wrote: > I do regularly check the Unicode site for proposed new characters and > scripts. I did not notice the ORIYA WA before. This must have only just > recently been added. You are right, it was accepted in last December's ISO/IEC JTC1/SC2/WG2 meeting, in Tokyo

Re: 4701

2003-02-04 Thread Roozbeh Pournader
On Tue, 4 Feb 2003, Andrew Cunningham wrote: > From memory, although my memory may be faulty, there are some slight > differences between the animals assigned in the Chinese calendars and > the animals assigned in the Vietnamese calendar. There's also a Turkic cycle of animals borrowed from th

Re: OT: Haikus for Unicode-Haters

2003-02-02 Thread Roozbeh Pournader
On Sun, 2 Feb 2003, Shlomi Tal wrote: > Arabic shaping > Difficult to implement > It's a complex script. I can't understand how Arabic suddenly appears in your list. The complexities are in the script itself, and not in Unicode. I have yet to see any sound standard for Arabic information intercha

Re: ARABIC LETTER KEHEH

2003-01-04 Thread Roozbeh Pournader
On Sat, 4 Jan 2003, Michael Everson wrote: > Someone just wrote me privately: Wondering who was that! r

Re: Coptic II?

2002-12-25 Thread Roozbeh Pournader
On Wed, 25 Dec 2002, John Cowan wrote: > In fact the glyph for Kurdish Q often looks like a large q, similarly to > Cyrillic h; this is an inappropriate glyph for Latin Q. This should be enough evidence. Any samples? roozbeh

Re: Farsi Keheh +06A9 vs. Arabic Kaf +0643 ??

2002-12-22 Thread Roozbeh Pournader
> > I'm trying to find out some information about the extended > > Arabic character KEHEH (06A9), which the code chart indicates > > is used in the Persian (Farsi) language. Staff familiar with > > Persian in the library use the Arabic letter KAF (0643) > > instead of this and the two charact

Re: Walid Patent

2002-11-23 Thread Roozbeh Pournader
On Fri, 22 Nov 2002, Michael Everson wrote: > Can there possibly be any truth in any of this? There was a lot of debates about this in the IDN mailing list. People finally agreed that the patent doesn't apply simply since there has been prior art by Martin Duerst (which was called UTF-5). rooz

Re: [OT] Curious about persian page numbers (was RE: ISIRI 6219:2002)

2002-11-19 Thread Roozbeh Pournader
On Tue, 19 Nov 2002, Marco Cimarosti wrote: > This is what the page numbers look like: > > ..."???", "?", "?", "?", "?" > ("Alif", "B", "T", "Th", "J"...) You are missing the "Peh" between "Beh" and "Teh". > I guess that the name of the letter "?" ("alif") is spelled out in full to > avoid conf

ISIRI 6219:2002

2002-11-19 Thread Roozbeh Pournader
/finalversion.pdf?download A paper copy may be acquired at the price of 4125 IRR from: Institute of Standards and Industrial Research of Iran PO Box 31585-163 Karaj, IRAN Fax: +98 (261) 280-7045 Roozbeh Pournader, for the FarsiWeb Project Group

Call for Papers: EuroTeX 2003

2002-11-07 Thread Roozbeh Pournader
>> Call for Papers! < Fourteenth European TeX Conference (EuroTeX'2003) "Back to Typography" June 24-27th, 2003 Brest, Brittany, France

RE: Header Reply-To

2002-11-06 Thread Roozbeh Pournader
On Mon, 4 Nov 2002, Marco Cimarosti wrote: > Using ` and ' as quotation marks is a long-standing Internet convention. > [...] Not a reply to your message really, but on this topic, I really recommend the following page. It's really well-researched: http://www.cl.cam.ac.uk/~mgk25/ucs/quo

RE: Hindi keyboard with the Microsoft Hindi font Mangal

2002-10-15 Thread Roozbeh Pournader
On Tue, 15 Oct 2002, John McConnell wrote: > Both the Persian (Farsi) keyboard and the Devanagari keyboard include > both ZWJ and ZWNJ. ZWJ is 1+Shift+Ctrl, while ZWNJ is 2+Shift+Ctrl. But these don't work in many applications, and most important of all, Microsoft Word. > The problem appears wi

Re: Hindi keyboard with the Microsoft Hindi font Mangal

2002-10-15 Thread Roozbeh Pournader
On Tue, 15 Oct 2002 [EMAIL PROTECTED] wrote: > > Shift-Control-1 will insert a ZWJ; and Shift-Control-2 will insert a ZWNJ. This > combination is present across all Indic keyboards that ship with Windows 2K and > later. > kr > But sadly, apart from the obvious pain in that certain place to t

Re: Hindi keyboard with the Microsoft Hindi font Mangal

2002-10-15 Thread Roozbeh Pournader
On Mon, 14 Oct 2002, Doug Ewell wrote: > I visited Microsoft's keyboard site at: > > http://www.microsoft.com/globaldev/keyboards/keyboards.asp > > and couldn't find any mappings for ZWJ or ZWNJ on the Hindi keyboard > either. Which also means that the layouts a bad reference for keyboard layo

Re: Capital Letter H with line below

2002-10-11 Thread Roozbeh Pournader
On Fri, 11 Oct 2002, Michael Everson wrote: > Well trying to load that page totally hangs my machine Then blame it first on your browser, then on your OS, and finally on yourself for choosing those, or worse, not updating them regularly. roozbeh ;)

Re: Scripts in Unicode 4.0

2002-08-15 Thread Roozbeh Pournader
On Wed, 14 Aug 2002, Doug Ewell wrote: > These flag characters were accepted and will probably find widespread > use in other contexts, but their politically charged origin is a bit > disturbing. Similar characters may already be anywhere else in Unicode. One interesting example is U+262B, the s

Re: An idea for keeping U+FFFC usable. (spins off from Re: Furigana)

2002-08-14 Thread Roozbeh Pournader
On Wed, 14 Aug 2002, James Kass wrote: > One, the use of *.html clearly violates the standard file naming > convention of eight uppercase ASCII letters followed by a period > followed by a *three* letter uppercase ASCII file name extension. I was wondering if the capitalization, "ASCII", is for

RE: Digraphs as Distinct Logical Units

2002-08-14 Thread Roozbeh Pournader
On Wed, 14 Aug 2002, Marco Cimarosti wrote: > Standing its usage in text, couldn't it be considered as a punctuation mark? No, I don't agree. More like a dingbat it looks to me, as far as you don't get very philosophical. > If this is the case, decomposing the mark into the Arabic letters it d

Re: Digraphs as Distinct Logical Units

2002-08-14 Thread Roozbeh Pournader
On Wed, 14 Aug 2002, Martin Kochanski wrote: > (1) Most software doesn't know what characters exist in any particular > font that the user happens to have chosen, and it doesn't want to know. > This is straightforward modular software design: some part of the > *operating system* is responsible f

Re: Digraphs as Distinct Logical Units

2002-08-13 Thread Roozbeh Pournader
On Tue, 13 Aug 2002, James Kass wrote: > A further note about the proposed "character", a user who is > expecting to see this presentation form would probably be > quite dismayed to find a missing glyph in its place. That may > well be a frequent occurence, especially at first. And it's also a

Re: Digraphs as Distinct Logical Units

2002-08-13 Thread Roozbeh Pournader
On Tue, 13 Aug 2002, Michael Everson wrote: > Doesn't matter where it's encoded. It is to be considered, if you > will pardon the term, as a kind of dingbat, if I understand correctly. I don't have anything against the term. Others may. > Because it isn't a "logo", is used officially and oblig

Re: Digraphs as Distinct Logical Units

2002-08-09 Thread Roozbeh Pournader
On Fri, 9 Aug 2002, Doug Ewell wrote: > StandardizedVariants.html doesn't list anything for vulgar fractions. I > assume they decided the distinction wasn't worth making. Maybe it's reserved for 4.0? Unfortunately there seems that variants are not listed in the pipeline, or it is so possibly be

Re: Digraphs as Distinct Logical Units

2002-08-09 Thread Roozbeh Pournader
On Fri, 9 Aug 2002, Doug Ewell wrote: > Re: Mixed up priorities > From: Michael Everson > Date: Sun Oct 24 1999 - 06:34:24 EDT > [...] > > (I just love that name, don't you? I could say it all day, if only I > knew how. !Xóõ !Xóõ !Xóõ.) > > -Doug Ewell > Fullerton, California which ma

Re: Digraphs as Distinct Logical Units

2002-08-09 Thread Roozbeh Pournader
On Fri, 9 Aug 2002, Doug Ewell wrote: > The horizontal-bar fractions can be mapped to the existing Unicode > fractions, and the only thing lost in round-tripping is the exact glyph > shape. Was there anything decided about using variant selectors for selecting exact shapes? roozbeh

  1   2   3   4   >