Re: About the European MES-2 subset (was: PUA Audio Description,Subtitle, Signing)

2003-07-18 Thread Michael Everson
At 00:57 +0200 2003-07-18, Philippe Verdy wrote: Why is row 03 so resticted? Shouldn't it include those accents and diacritics that are used by other characters once canonically decomposed? Or does it imply that MES-2 is only supposed to use strings if NFC form? Also, is this list under full

Re: About the European MES-2 subset (was: PUA Audio Description, Subtitle, Signing)

2003-07-18 Thread Philippe Verdy
On Friday, July 18, 2003 7:36 AM, Michael Everson [EMAIL PROTECTED] wrote: At 00:57 +0200 2003-07-18, Philippe Verdy wrote: Why is row 03 so resticted? Shouldn't it include those accents and diacritics that are used by other characters once canonically decomposed? Or does it imply that

Re: About the European MES-2 subset (was: PUA Audio Description,Subtitle, Signing)

2003-07-18 Thread Michael Everson
At 12:16 +0200 2003-07-18, Philippe Verdy wrote: Is there some work at CEN to align its MES-2 subset into a revized (MES-2.1 ???) which not only takes into consideration the ISO10646 reference but also its Unicode properties to make this set self-closed, and actually implementable, at least

RE: About the European MES-2 subset (was: PUA Audio Description, Subtitle, Signing)

2003-07-18 Thread Kent Karlsson
Philippe Verdy wrote: MES-2 is a collection of characters independant of their actual encoding. To support MES-2 in a Unicode-compliant application, extra characters need to be added, notably if the minimum requirement for information interchange is the NFC form used by XML and HTML related

Re: About the European MES-2 subset

2003-07-18 Thread Peter Kirk
On 18/07/2003 03:16, Philippe Verdy wrote: I still note that modern Hebrew and Arabic are excluded from MES-2, as they are not used in any official language in the European Union or EFTA, or future EU candidates. ... But they are used in official publications within the EU, those targeted at

Re: Putting Unicode to Work

2003-07-18 Thread Peter Kirk
On 17/07/2003 18:36, djinn wrote: Although some list members may already be aware of these pages, because there are still very few web sites today that present text using a variety of Unicode ranges for purposes other than 'display testing', I thought the entire list should know about the

Re: About the European MES-2 subset (was: PUA Audio Description, Subtitle, Signing)

2003-07-18 Thread Philippe Verdy
On Friday, July 18, 2003 12:42 PM, Michael Everson [EMAIL PROTECTED] wrote: At 12:16 +0200 2003-07-18, Philippe Verdy wrote: Is there some work at CEN to align its MES-2 subset into a revized (MES-2.1 ???) which not only takes into consideration the ISO10646 reference but also its

Re: About the European MES-2 subset

2003-07-18 Thread Philippe Verdy
On Friday, July 18, 2003 1:13 PM, Peter Kirk [EMAIL PROTECTED] wrote: On 18/07/2003 03:16, Philippe Verdy wrote: I still note that modern Hebrew and Arabic are excluded from MES-2, as they are not used in any official language in the European Union or EFTA, or future EU candidates. ...

Re: About the European MES-2 subset

2003-07-18 Thread Peter Kirk
On 18/07/2003 06:21, Philippe Verdy wrote: But for these Asian languages, I think it's best to have fonts designed to handle correctly their corresponding scripts, instead of a giant font poorly hinted for readability at small sizes, and without support of common ligatures. Agreed. Giant fonts

Re: About the European MES-2 subset

2003-07-18 Thread John Cowan
Peter Kirk scripsit: Agreed. Giant fonts have their uses, e.g. Arial Unicode MS and Code2000 let me get a flavour of complex script pages which I browse to on the Internet, often by mistake, without having to install special fonts for scripts I don't read. However, a font like Last

Re: Char Set Detector

2003-07-18 Thread Shanjian Li
Try universal detector in mozilla tree, most eastern asia character set encodings can be detected with high accuracy. http://lxr.mozilla.org/seamonkey/source/extensions/universalchardet/ shanjian Yogesh Kumar Ahuja wrote: Hi, Can any body give me the path/url to get a Char set Detector. which

Re: About the European MES-2 subset (was: PUA Audio Description,Subtitle, Signing)

2003-07-18 Thread Michael Everson
At 13:35 +0200 2003-07-18, Philippe Verdy wrote: I note that you prefer the European Multilingual Subset to MES-2. Is it an extended set that includes MES-2, and fills the holes by using all characters defined in blocks of some version of the Unicode set? It is script-based, not character

I am not in India

2003-07-18 Thread Michael Everson
Colleagues, Apparently some of you have got copies of mail I wrote in December 2002 entitled Coptic II? which has some virus attachment to it. This has been sent by [EMAIL PROTECTED] which is not me, and I didn't send it, and I use Mac OS X and Eudora so I don't have a virus. Thanks. --

Re: Thank you. (was Re: [Private Use Area] Audio Description,Subtitle, Signing)

2003-07-18 Thread Michael Everson
At 12:11 +0100 2003-07-18, William Overington wrote: Thank you for the list of code points for MES-2. I have already found that the DVB-MHP minimum set does not have some of them and that the DVB-MHP minimum set does have some which are not in MES-2, such as U+1EB0 to U+1EB5. If this is of

I am not in India II

2003-07-18 Thread Michael Everson
Your message has encountered delivery problems to the following recipient(s): [EMAIL PROTECTED] Delivery failed 554 delivery error: dd This user doesn't have a yahoo.co.in account ([EMAIL PROTECTED]) [-5] - mta104.mail.in.yahoo.com See? -- Michael Everson * * Everson Typography * *

Re: About the European MES-2 subset

2003-07-18 Thread Michael Everson
At 11:28 -0400 2003-07-18, John Cowan wrote: However, a font like Last Resort (the world's smallest giant font, as it were) does that just about as well. While I hate seeing the Last Resort font show up, I love seeing it when it does. :-) S much better than ?. -- Michael Everson * * Everson

Re: I am not in India II

2003-07-18 Thread Thomas M. Widmann
Michael Everson [EMAIL PROTECTED] writes: Your message has encountered delivery problems to the following recipient(s): [EMAIL PROTECTED] Delivery failed 554 delivery error: dd This user doesn't have a yahoo.co.in account ([EMAIL PROTECTED]) [-5] - mta104.mail.in.yahoo.com See?

Re: I am not in India

2003-07-18 Thread Doug Ewell
Michael Everson everson at evertype dot com wrote: Apparently some of you have got copies of mail I wrote in December 2002 entitled Coptic II? which has some virus attachment to it. This has been sent by [EMAIL PROTECTED] which is not me, and I didn't send it, and I use Mac OS X and Eudora so

RE: About the European MES-2 subset (was: PUA Audio Description,Subtitle, Signing)

2003-07-18 Thread Michael Everson
At 13:07 +0200 2003-07-18, Kent Karlsson wrote: This is not to say that the MESes are unproblematic. To mention just two points not already mentioned: none of the new math characters are included even in MES-3 (a, b), despite that all math characters were supposed to be included That isn't true.

Re: About the European MES-2 subset

2003-07-18 Thread Michael \(michka\) Kaplan
A question mark is a sign of a bad conversion from Unicode (to a code page that did not contain the character). This would likely happen on the Mac too rather than the Last Resort font, wouldn't it? On Windows, the cannot find a font for it situation is the NULL glyph. The Last Resort font is

Re: I am not in India II

2003-07-18 Thread Adam Twardoch
From: Michael Everson [EMAIL PROTECTED] [EMAIL PROTECTED] This merely means that somebody has a virus who had both Michael and Roozbeh in his/her address book. People who believe that e-mails with a particular name in the From field must come from that very person can be called, ehem, naiive.

Re: I am not in India II

2003-07-18 Thread Michael Everson
At 00:44 +0200 2003-07-19, Adam Twardoch wrote: From: Michael Everson [EMAIL PROTECTED] [EMAIL PROTECTED] This merely means that somebody has a virus who had both Michael and Roozbeh in his/her address book. People who believe that e-mails with a particular name in the From field must come from

Re: About the European MES-2 subset

2003-07-18 Thread Michael Everson
At 15:45 -0700 2003-07-18, Michael \(michka\) Kaplan wrote: A question mark is a sign of a bad conversion from Unicode (to a code page that did not contain the character). This would likely happen on the Mac too rather than the Last Resort font, wouldn't it? No, it wouldn't. A not a character

Re: About the European MES-2 subset

2003-07-18 Thread Michael \(michka\) Kaplan
I am pretty sure you have to be wrong here, Michael. Attend me: 1) API converts from Unicode to the wrong code page 2) API does some sort of work with the string 3) API tries to display the string How on earth could it from the Last Resort font, unless it is a generic glyph that contains no

Proposed Draft UTR #31: Identifier and Pattern Syntax

2003-07-18 Thread Mark Davis
There is a new proposed draft TR available for public comment on http://www.unicode.org/reports/tr31/. This document describes specifications for recommended defaults for the use of Unicode in the definitions of identifiers and in pattern-based syntax. It incorporates the Identifier section of

Re: About the European MES-2 subset

2003-07-18 Thread John Cowan
Michael (michka) Kaplan scripsit: In any case, Code2000 giving some glyph for more cases is still a better solution. In any case, if you cannot read any of the languages that use a given script, you are unlikely to care much what glyph appears, and if it turns out that you do care, the LR font

Re: I am not in India II

2003-07-18 Thread Curtis Clark
Michael Everson wrote: People who believe that e-mails with a particular name in the From field must come from that very person can be called, ehem, naiive. That's an interesting way of writing the diaeresis on nave, Adam. :-) It's a good thing it's soft-dotted! Or perhaps he meant nave. :-)