RE: Emoji boom?

2019-05-01 Thread Phillips, Addison via Unicode
languages digitally accessible. What's more, when implementations support emoji features, such as ZWJ sequences, variation selectors, etc., they are also building necessary mechanisms for supporting complex scripts (many of which are recently encoded or on the roadmap). Addison Phillips Sr

RE: Unicode in the Curriculum?

2015-12-30 Thread Phillips, Addison
the industry in thinking that educational establishments could do a better job of preparing developers with at least the basics of Unicode, character encodings, and internationalization. Addison Phillips Principal SDE, I18N Architect (Amazon) Chair (W3C I18N WG) Internationalization is n

RE: Concise term for non-ASCII Unicode characters

2015-09-20 Thread Phillips, Addison
I agree, although I note that sometimes the additional (redundant) specificity of "non-7-bit-ASCII characters" is needed when talking to people unclear on what "ASCII" means. Addison > -Original Message- > From: Unicode [mailto:unicode-boun...@unico

RE: Usage stats?

2015-03-27 Thread Phillips, Addison
What you might be looking for would be the CLDR project’s “exemplar sets” (see for example [1]), which describes which characters are customarily used for a given language and which are sometimes used. However, this is not the same thing as statistical distribution. One of the points of Unicode

RE: Unicode Sets in 'Unicode Regular Expressions'

2014-05-27 Thread Phillips, Addison
). Or at least this is how I have already read it. Addison -Original Message- From: Unicode [mailto:unicode-boun...@unicode.org] On Behalf Of Richard Wordingham Sent: Tuesday, May 27, 2014 3:18 PM To: unicode@unicode.org Subject: Unicode Sets in 'Unicode Regular Expressions' UTS#18

Re: Origin of Ellipsis (was: RE: Empty set)

2013-09-15 Thread Phillips, Addison
Not if the limit is counted in characters and not in bytes. Twitter, for example, counts code points in the NFC representation of a tweet. Doug Ewell d...@ewellic.org wrote: Andre Schappo wrote: U+2026 is useful for microblogs when one is looking to save characters Not if the microblog is in

Re: Origin of Ellipsis (was: RE: Empty set)

2013-09-15 Thread Phillips, Addison
Actually, that's my bad: I meant to type scalar value. Stephan Stiller stephan.stil...@gmail.com wrote: On 9/15/2013 3:07 PM, Phillips, Addison wrote: Not if the limit is counted in characters and not in bytes. Twitter, for example, counts code points in the NFC representation of a tweet

RE: Can a single text document use multiple character encodings?

2013-08-28 Thread Phillips, Addison
What kind of document do you mean? For Web formats (HTML, etc.), the answer is no. Addison Addison Phillips Globalization Architect (Amazon Lab126) Chair (W3C I18N WG) Internationalization is not a feature. It is an architecture. -Original Message- From: unicode-bou

RE: What to backup after corruption of code units?

2013-08-27 Thread Phillips, Addison
be pointed at DC00, which is a trailing (low) surrogate. The pointer needs to back up (decrement) by one to position 3 (0xD800) to find the start of the character (each of the other code units refers to a single code point). Addison Phillips Globalization Architect (Amazon Lab126) Chair (W3C I18N

RE: Latvian and Marshallese Ad Hoc Report (cedilla and comma below)

2013-07-03 Thread Phillips, Addison
in a Marshallese locale (or have another such external hint). The real issue is the need for language metadata in file formats and other places to enable language-specific rendering to take place. Lacking language metadata, you get what you get. Addison Addison Phillips Globalization Architect

RE: Text in composed normalized form is king, right? Does anyone generate text in decomposed normalized form?

2013-02-01 Thread Phillips, Addison
being dead, other normalization forms like NFD are useful for various kinds of processing. Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N WG) Internationalization is not a feature. It is an architecture. -Original Message- From: unicode-bou...@unicode.org

RE: RLI and bdi, and how to get an update of changes

2013-01-15 Thread Phillips, Addison
. Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N WG) Internationalization is not a feature. It is an architecture.

RE: Are there Unicode processors?

2013-01-07 Thread Phillips, Addison
://perldoc.perl.org/5.14.0/perlunicode.html#Unicode-Character-Properties Or: http://userguide.icu-project.org/strings/properties Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N WG) Internationalization is not a feature. It is an architecture. -Original Message- From

RE: latin1 decoder implementation

2012-11-16 Thread Phillips, Addison
The block is actually named for it. See: http://www.unicode.org/charts/PDF/U0080.pdf This FAQ talks about it: http://www.unicode.org/faq/blocks_ranges.html Finally, p217 of the standard actually says so explicitly: http://www.unicode.org/versions/Unicode6.2.0/ch07.pdf Addison

RE: Searching data: map countries to scripts

2012-08-21 Thread Phillips, Addison
. Addison

RE: Klingon on Unicode site?

2012-04-03 Thread Phillips, Addison
and provide for this (as well as other use cases). Cf. http://norbertlindenberg.com/2012/02/ecmascript-internationalization-api/index.html Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N WG) Internationalization is not a feature. It is an architecture.

Re: Japanese font on Non-Japanese Android phones

2011-10-08 Thread Phillips, Addison
That's for search analysis, not rendering. Sent from my iPhone On Oct 8, 2011, at 7:45 AM, Andreas Prilop prilop4...@trashmail.net wrote: On Fri, 7 Oct 2011, Gerrit wrote: So if somebody from Google reads this, [...] Additionally, if the standard Android web browser could then use the

RE: UNICODE version of _T(x) macro

2010-11-22 Thread Phillips, Addison
terminology such as character. If you want to use UTF-32, which uses 32-bit code units, one per code point, you can use a 32-bit data type instead. Those are always 4 bytes wide. Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N, IETF IRI WGs) Internationalization

RE: composite graphemes

2010-10-13 Thread Phillips, Addison
with cases such as this. See UAX#15 for more details. Regards, Addison Addison Phillips Globalization Architect (Lab126) Chair (W3C I18N, IETF IRI WGs) Internationalization is not a feature. It is an architecture. -Original Message- From: unicode-bou...@unicode.org [mailto:unicode

RE: CLDR locales: Filipino (fil/ph?) Pilipino/Tagalog (tl/tlg)

2004-12-27 Thread Addison Phillips [wM]
Following draft-langtags (and CLDR usage), it would be tl-Tglg-PH Addison Addison P. Phillips Director, Globalization Architecture http://www.webMethods.com Chair, W3C Internationalization Working Group http://www.w3.org/International Internationalization is an architecture

RE: (base as a combing char)

2004-11-27 Thread Addison Phillips [wM]
of logical characters to behave as a single character-- (which is also encoded as a precomposed character U+00FC). So: what application do you have for two base characters treated as a combining mark? Then list members might be able to comments with precision on your request. Best Regards, Addison

RE: Ligatures

2004-11-27 Thread Addison Phillips [wM]
). PS Flarn isn't the reference I think it is, is it? Best Regards, Addison Addison P. Phillips Director, Globalization Architecture http://www.webMethods.com Chair, W3C Internationalization Working Group http://www.w3.org/International Internationalization is an architecture. It is not a feature

RE: Shift-JIS conversion.

2004-11-25 Thread Addison Phillips [wM]
ains about 9,000 characters. Unicode (andthus its encoding UTF-8)contains about 93,000 characters. There will not be a mapping to SJIS for every Unicode character. Hope that helps. Best Regards, Addison Addison P. PhillipsDirector, Globalization Architecturehttp://www.webMethods.com

RE: My Querry

2004-11-23 Thread Addison Phillips [wM]
to 7F hex. That is, amoung other things UTF-8 was designed specifically to be compatible with C language strings. Addison Addison P. Phillips Director, Globalization Architecture http://www.webMethods.com Chair, W3C Internationalization Working Group http://www.w3.org/International

RE: My Querry

2004-11-23 Thread Addison Phillips [wM]
] [mailto:[EMAIL PROTECTED]] (B On Behalf Of Addison Phillips [wM] Sent: Tuesday, (B November 23, 2004 9:14 AM (B One of the nice things about UTF-8 is that the ASCII (B bytes from 0 to 7F hex (including the C0 control (B characters from \x00 through \x01f---including (B NULL)

RE: valid characters in user names- esp. compatibility characters

2004-08-11 Thread Addison Phillips [wM]
-locale.com. I can't remember, but I think this one was a substitute paper at IUC22, so it probably isn't in the program proceedings. Hope this helps, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C

W3C I18N rechartering: input invited

2004-07-15 Thread Addison Phillips [wM]
. Feedback, suggestions and comments are welcome (please use the [EMAIL PROTECTED] list for public feedback). Member-only or team-only comments should be directed to Martin Drst or Richard Ishida. Best Regards, Addison Phillips, Chair W3C-I18N-WG Martin Drst, I18N Activity Lead Richard Ishida

RE: number of bytes for simplified chinese

2004-06-28 Thread Addison Phillips [wM]
verage byte count per character. If you are worried about storage (disk, database, etc.), then the specifics of your situation will determine what the "right answer" may be for you. Best Regards, Addison Addison P. PhillipsDirector, Globalization ArchitecturewebMethods | Delivering Global

RE: Latin Characters

2004-06-24 Thread Addison Phillips [wM]
ions in my little document "Learn to Type Japanese and Other Languages"here: http://www.inter-locale.com/whitepaper/LearnToType.pdf (Acrobat PDF format) Addison P. PhillipsDirector, Globalization ArchitecturewebMethods | Delivering Global Business Visibilityhttp://www.webM

RE: Latin long vowels

2004-06-22 Thread Addison Phillips [wM]
other fonts, you can use the Character Map utility to explore the character ranges supplied by various fonts on your system. Best Regards, Addison Addison P. PhillipsDirector, Globalization ArchitecturewebMethods | Delivering Global Business Visibilityhttp://www.webMethods.comChair, W3C

RE: ISO 15924 draft fixes

2004-05-20 Thread Addison Phillips [wM]
. Spelling or nomenclatural (non-substantive) changes in the descriptions are errata. But I view changes, additions, and deletions to/from the data tables as changes to the standard and they should, in my opinion, be treated as such even if they are only to correct errors. Best Regards, Addison

RE: ISO 15924 draft fixes

2004-05-20 Thread Addison Phillips [wM]
first, IMO. But if the file is in some other format, that's fine, so long as the format is stable. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N

RE: TR35

2004-05-13 Thread Addison Phillips [wM]
preferences. Check out our group's document (and the forthcoming requirements document) and see if you don't agree... but we should be wary of very broad global statements (both all language tags are also locale tags and language tags are never locale tags). Best Regards, Addison Addison P. Phillips

RE: multibyte char display

2004-03-15 Thread Addison Phillips [wM]
). Writing out files as UTF-8 (instead of System.out.println()) will let you use the native2ascii utility or other programs to investigate the actual codepoints you are retrieving. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business

RE: German characters not correct in output webform

2004-01-13 Thread Addison Phillips [wM]
documentation). You might want to be aware of the W3C's Internationalization mailing list (See http://lists.w3.org/Archives/Public/www-international/)and of the FAQs at http://www.w3.org/International/geo (alas, the FAQ on this topic hasn't been published yet!) Best Regards, Addison Addison P

RE: Fonts on Web Pages

2003-12-02 Thread Addison Phillips [wM]
to look in the right places :-). Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services Task Force http://www.w3.org

RE: Proposed Successor to RFC 3066 (language tags)

2003-11-19 Thread Addison Phillips [wM]
Regards, Addison Addison P. PhillipsDirector, Globalization ArchitecturewebMethods | Delivering Global Business Visibilityhttp://www.webMethods.comChair, W3C Internationalization (I18N) Working GroupChair, W3C-I18N-WG, Web Services Task Forcehttp://www.w3.org/InternationalInternationalization

OFF-TOPIC: Proposed Successor to RFC 3066 (language tags)

2003-11-19 Thread Addison Phillips [wM]
thread over there from escaping back to the Unicode list. I HAVE posted a response to your message to you privately, copy that list. Thanks again for the comments. Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http

RE: Ewellic

2003-11-13 Thread Addison Phillips [wM]
, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services Task Force http://www.w3.org/International Internationalization

RE: Ill-formed sequences (was: Re: UTF-16 inside UTF-8)

2003-11-05 Thread Addison Phillips [wM]
with UTF-16. I am under the impression that they mean combining sequences. Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services

RE: UTF-9

2003-10-30 Thread Addison Phillips [wM]
Pub date = 1 April 2003. I think that's the salient part. Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services Task Force http

RE: Beyond 17 planes, was: Java char and Unicode 3.0+

2003-10-16 Thread Addison Phillips [wM]
implementations to support this stuff. Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility 432 Lakeside Drive, Sunnyvale, CA, USA +1 408.962.5487 (office) +1 408.210.3569 (mobile) mailto:[EMAIL PROTECTED] Chair, W3C-I18N-WG

RE: Beyond 17 planes, was: Java char and Unicode 3.0+

2003-10-16 Thread Addison Phillips [wM]
Maybe I should have said it the other way first: yuck. I don't want any characters beyond the planes ever. Nor a change in encoding rules. Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility 432 Lakeside Drive, Sunnyvale, CA, USA +1

DEC-MCS mapping, anyone?

2003-10-10 Thread Addison Phillips [wM]
I'm looking for an authoritative mapping table of this old VMS encoding to Unicode and can't find one anywhere. Anyone got one handy? Thanks in advance! Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility 432 Lakeside Drive

Re: Euro Currency for UK

2003-10-09 Thread Addison Phillips [wM]
correctly. Most programming environments provide a way to format a value using separate locale rules (for grouping and decimal separators) and currency. More information about what you're trying to do would help in recommending a solution. Best Regards, Addison -- Addison P. Phillips

RE: Web Form: Other Question: Java Tool

2003-09-19 Thread Addison Phillips [wM]
to substitute the plain fonts in order to see the characters, but they won't be bold or italic or what-have-you. Hope this helps. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility 432 Lakeside Drive, Sunnyvale, CA, USA +1

GB18030 mapping table....

2003-08-19 Thread Addison Phillips [wM]
don't have them bookmarked on this machine. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. -- Global Business Visibility 432 Lakeside Drive, Sunnyvale, CA, USA +1 408.962.5487 (office) +1 408.210.3569 (mobile) mailto:[EMAIL PROTECTED] Chair, W3C

Re: Display and Store Problem!

2003-08-08 Thread Addison Phillips [wM]
a paper that describes some of these issues on the webMethods website (it's Creating Multi-lingual and Multi-locale Databases) here: http://www.webmethods.com/cmb/solutions/solutions_white_paper_email_entry/ Best Regards, Addison -- Addison P. Phillips Director, Globalization Architecture

Re: ISO 639 duplicate codes (was: Re: Ligatures in Turkish andAzeri, was: Accented ij ligatures)

2003-07-15 Thread Addison Phillips [wM]
code... In any event, you really ought to try the newer versions of ICU4J out. They are a lot easier to work with. And a light version isn't that hard to create, if that's what you want. Best Regards, Addison -- Addison P. Phillips Director, Globalization Architecture webMethods, Inc. +1

Re: ISO 639 duplicate codes (was: Re: Ligatures in Turkish andAzeri, was: Accented ij ligatures)

2003-07-15 Thread Addison Phillips [wM]
source code... In any event, you really ought to try the newer versions of ICU4J out. They are a lot easier to work with. And a light version isn't that hard to create, if that's what you want. Best Regards, Addison -- Addison P. Phillips Director, Globalization Architecture webMethods, Inc

[W3C-I18N] Working Draft, Web Services Usage Scenarios, Published

2002-12-23 Thread Addison Phillips [wM]
public mailing list (public-i18n-ws @ w3.org). Instructions on joining the Task Force and mailing list are located on our home page, located here: http://www.w3.org/International/ws. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. +1

[W3C-I18N-WG] Call for Participation, Web Services TF

2002-10-14 Thread Addison Phillips [wM]
check our how public page here: http://www.w3.org/International/ws Best Regards, Addison [1] http://www.w3.org/Consortium/Member/List Addison P. Phillips Director, Globalization Architecture webMethods, Inc. +1 408.962.5487 (phone) +1 408.210.3569 (mobile

RE: Problems converting from UTF-8 to UCS-2 and vice-versa using JRun 3.1, SQL Server 2000, Windows 2000 and Java 3.1

2002-09-12 Thread Addison Phillips [wM]
Regards, Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Marco Cimarosti Sent: Thursday, September 12, 2002 5:12 AM To: '[EMAIL PROTECTED]'; '[EMAIL PROTECTED]' Subject: RE: Problems converting from UTF-8 to UCS-2 and vice-versa using JRun 3.1

RE: Problems converting from UTF-8 to UCS-2 and vice-versa using JRun 3.1, SQL Server 2000, Windows 2000 and Java 3.1

2002-09-12 Thread Addison Phillips [wM]
code is correct: byte[] utf8bytes = myString.getBytes(UTF-8); Note that I have omitted try/catch blocks for clarity, but the compiler will insist on them... Hope that helps. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. 432 Lakeside Drive

RE: Problems converting from UTF-8 to UCS-2 and vice-versa using JRun 3.1, SQL Server 2000, Windows 2000 and Java 3.1

2002-09-12 Thread Addison Phillips [wM]
Hope that helps. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. 432 Lakeside Drive Sunnyvale, California, USA +1 408.962.5487 (phone) +1 408.210.3569 (mobile) - Internationalization is an architecture

RE: UCS-2 and UTF-16

2002-09-12 Thread Addison Phillips [wM]
. In this context, UCS-2 vs. UTF-16 is unimportant. Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. 432 Lakeside Drive Sunnyvale, California, USA +1 408.962.5487 (phone) +1 408.210.3569 (mobile) - Internationalization

RE: FW: New version of TR29:

2002-08-20 Thread Addison Phillips [wM]
How about I'll or it's. Regards, Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of John Cowan Sent: Tuesday, August 20, 2002 4:40 AM To: Marco Cimarosti Cc: '[EMAIL PROTECTED]' Subject: Re: FW: New version of TR29: Marco Cimarosti

RE: NLS_LANG for russian characterset in UNIX

2002-08-14 Thread Addison Phillips [wM]
rs that you retrieve from the database, so you'll probably end up setting both (again, UTF-8 locales in UNIX work nicely with Unicode encoded databases). Hope that helps. Best Regards, Addison -Original Message-From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of

(long) Making orthographies computer-ready (was *not* Telephoning Tamil)

2002-07-29 Thread Addison Phillips [wM]
the days of the typewriter), for my money would probably be to get ISO-639 to issue the language a 2-letter code so you can have locale (and Unicode character database) data tagged with it ;-). Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods, Inc. 432

RE: REALLY *not* Tamil - changing scripts (long)

2002-07-29 Thread Addison Phillips [wM]
. If you've had an orthography since 1937 based on locally available typewriters, then you probably won't want to change. If you have NO writing tradition, you're better off avoiding unnecessary headaches, IMO. Addison

RE: (long) Making orthographies computer-ready (was *not* Telephoning Tamil)

2002-07-29 Thread Addison Phillips [wM]
preventing it. Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of [EMAIL PROTECTED] Sent: Monday, July 29, 2002 3:02 PM To: [EMAIL PROTECTED] Subject: Re: (long) Making orthographies computer-ready (was *not* Telephoning Tamil) On 07/29/2002

RE: (long) Making orthographies computer-ready (was *not* Telephoning Tamil)

2002-07-29 Thread Addison Phillips [wM]
a bang or an octothorpe ;-). Best Regards, Addison

RE: REALLY *not* Tamil - changing scripts (long)

2002-07-27 Thread Addison Phillips [wM]
, so heck...) Addison -Original Message- From: David Starner [mailto:[EMAIL PROTECTED]] Sent: Saturday, July 27, 2002 12:07 AM To: Addison Phillips [wM]; Curtis Clark Cc: [EMAIL PROTECTED] Subject: Re: REALLY *not* Tamil - changing scripts (long) At 08:46 PM 7/26/02 -0700, Addison

REALLY *not* Tamil - changing scripts (long)

2002-07-26 Thread Addison Phillips [wM]
conveniently do this). Obviously I'm not an expert in these linguistic areas (and hence rarely comment on them), but it seems to me that the lack of other mechanisms makes Unicode an attractive target for criticism in this area. Best Regards, Addison Addison P. Phillips Director, Globalization

RE: User interface for keyboard input

2002-07-18 Thread Addison Phillips [wM]
Hi Martin, I install the Chinese Unicode keyboard myself... When I confronted this specific problem recently in our products, the main solution I adopted was to allow \u notation as input (of course, our products are for developers...) Hope that helps. Addison Addison P. Phillips

RE: Shift-JIS - Unicode?

2002-06-15 Thread Addison Phillips [wM]
because the conversions will be similar to other programs running in that environment, providing the behavior that users expect. Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487

RE: Shift-JIS - Unicode?

2002-06-15 Thread Addison Phillips [wM]
://ftp.unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/JIS/ ~Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Stefan Persson Sent: 2002年6月15日 8:35 To: Addison Phillips [wM]; [EMAIL PROTECTED] Subject: Re: Shift-JIS - Unicode? - Original

RE: Running Japanese programs under Win2K

2002-06-09 Thread Addison Phillips [wM]
do this. However, you might regard programs that display ??? and mojibake text as broken and gently remind the manufacturer of the defects ;-). Good luck, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA

RE: Running Japanese programs under Win2K

2002-06-09 Thread Addison Phillips [wM]
Small correction here: MUI is not required in order to change the default system locale to Japanese and keep the user interface as English on a Win2K enu machine. Not in a Java program ::sigh::. Some parts of Swing and AWT use the user locale, especially older JREs... Addison

RE: To hell with Unicode ;)

2002-05-28 Thread Addison Phillips [wM]
shortcuts to support a single, specific language. Unicode *is* complex. But I suspect that my copy of TUS (sans cover) actually weighs less than my copy of Lunde/blowfish. *I* would rather face that putative horde of zombies than go back to multibyte enabling for a living. Best Regards, Addison

RE: Normalization forms

2002-05-13 Thread Addison Phillips [wM]
Hi Lars, Some information below... Addison Addison P. Phillips Globalization Architect webMethods, Inc. 432 Lakeside Drive Sunnyvale, California, USA +1 408.962.5487 (phone) +1 408.210.3569 (mobile) - Internationalization is an architecture

RE: regarding unicode support in Oracle8i

2002-05-12 Thread Addison Phillips [wM]
Whups. Typo. Try http://www.inter-locale.com/IUC19.pdf (IUC is International Unicode Conference. ICU is (IBM) International Classes for Unicode). Addison -Original Message- From: J M Sykes [mailto:[EMAIL PROTECTED]] Sent: 2002年5月12日 3:29 To: Addison Phillips [wM]; [EMAIL

RE: regarding unicode support in Oracle8i

2002-05-10 Thread Addison Phillips [wM]
that may help you a little. Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3659 (mobile) - Internationalization

RE: accessing extended ranges

2002-03-27 Thread Addison Phillips [wM]
is actually the hard bit. Somehow I suspect that a BugParade issue would get rejected as Unicode 3.0++ not supported in JDK1.4, even if everyone on this list went and voted for it ;-). Regards, Addison Addison P. Phillips Globalization Architect webMethods, Inc. 432 Lakeside Drive Sunnyvale, California

RE: accessing extended ranges

2002-03-26 Thread Addison Phillips [wM]
problematic until Swing or AWT are fixed. Hope that helps. Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3659 (mobile

RE: 16 bit unicode

2002-02-18 Thread Addison Phillips [wM]
with your confusion about 16- or 8-bit flavors of Unicode in databases (http://www.inter-locale.com/IUC19.ppt) Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1

RE: ICU website

2002-02-09 Thread Addison Phillips [wM]
The server changed to www-124.ibm.com Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3659 (mobile

RE: Unicode and Security: Domain Names

2002-02-07 Thread Addison Phillips [wM]
creating I18n domain names. The early implementers have mostly ignored these issues and the interplay between the ultimate standard and existing registrars should be interesting. Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc

RE: Arabic and Hindi digits, what to store ?

2002-01-26 Thread Addison Phillips [wM]
Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. | The Business Integration Company 432 Lakeside Drive, Sunnyvale, California, USA +1 408.962.5487 (phone) +1 408.210.3569 (mobile

RE: Microsoft's Japanese IME has no Unicode option

2002-01-25 Thread Addison Phillips [wM]
. Note that adding a META tag to your page is a very good idea if you decide to use UTF-8 as the encoding. You can see that this works here: http://www.geocities.com/apphillips2000/index.html You will note that I included a META tag. Otherwise you have to manually select UTF-8 as the pag

RE: Allow to type URL in Unicode

2002-01-23 Thread Addison Phillips [wM]
can make things work in most cases with a little care. Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. | The Business Integration Company 432 Lakeside Drive, Sunnyvale, California, USA +1 408.962.5487 (phone) +1

RE: Unicode Search Engines

2002-01-16 Thread Addison Phillips [wM]
hat I know well (I did some I18N work for them) is AltaVista. If you go to the search page and click on "Customize Settings", you can set the search engine to use UTF-8 as your search encoding (both input and output). The back end is entirely Unicode. Regards, Addison Addison P. P

RE: How to print the byte representation of a wchar_t string with non -ASCII ...

2001-11-02 Thread Addison Phillips [wM]
Hi William, The third case rests on how the compiler interpreted the string at compile time. What's the encoding of your source file? What was the encoding of the locale at compile time? Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc

RE: Right-to-left whitespace?

2001-10-08 Thread Addison Phillips [wM]
, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3569 (mobile) - Internationalization is an architecture. It is not a feature

RE: Unicode characters in applet.

2001-10-04 Thread Addison Phillips [wM]
bably font releated. If you check out http://www.unicode.org/help/display_problems.html you will see information on how to turn those infuriating hollow boxes into the characters you expected to see the first time. Addison Addison P. PhillipsGlobalization Architect / Manager, Glo

RE: surrogate at java's property file

2001-10-04 Thread Addison Phillips [wM]
has been adopted. It should be interesting. Best Regards, Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Carl W. Brown Sent: Thursday, October 04, 2001 5:37 PM Cc: [EMAIL PROTECTED] Subject: RE: surrogate at java's property file Addison

RE: surrogate at java's property file

2001-10-01 Thread Addison Phillips [wM]
to be a 32-bit value and switch Java to UTF-32 (yeah, sure.) Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3569 (mobile

RE: surrogate at java's property file

2001-10-01 Thread Addison Phillips [wM]
But then, it's my day to be an idiot... Of course an int can store more than 16 bits. It's char that's defined at 0..65535 in Java. int's will work fine in the APIs. It's the chars that are a problem. Must be the heat. ;-) Addison Addison P. Phillips Globalization Architect / Manager

CESU-8: to document or not

2001-09-17 Thread Addison Phillips [wM]
, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3659 (mobile) - Internationalization is an architecture

RE: Adding fonts to JVM

2001-09-04 Thread Addison Phillips [wM]
Hi Sameer, The "font list" in your JVM comes from the operating system. The list you see is what the JVM thinks you have installed locally. To add a font to the list, you have to install the font. Best Regards, Addison Addison P. PhillipsGlobalization Architect / Manager, Glo

RE: japanese xml

2001-08-30 Thread Addison Phillips [wM]
outside EUC-JP represented as NCRs, and our parser handles that quite well... Addison -Original Message- From: Ayers, Mike [mailto:[EMAIL PROTECTED]] Sent: Thursday, August 30, 2001 10:00 AM To: 'Addison Phillips [wM]' Cc: [EMAIL PROTECTED] Subject: RE: japanese xml From: Addison Phillips

RE: Unicode in Asia Question

2001-08-01 Thread Addison Phillips [wM]
, but you should be aware of it as a potential objection that Asian users might have to a pure Unicode approach. Good luck with your implementation. Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive

RE: RTF language codes

2001-07-23 Thread Addison Phillips [wM]
Hi John, The IDs correspond exactly to Microsoft's (proprietary) LCID (locale ID) codes used in the Windows operating systems. So they are a Microsoft standard. Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432

RE: Is there Unicode mail out there?

2001-07-11 Thread Addison Phillips [wM]
with getting the settings set to send than with receivers receiving! Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1 408.962.5487 (phone) +1 408.210.3659 (mobile

RE: MIME standards(RFCs) (was..RE: Unicode, UTF-8.....)

2001-07-11 Thread Addison Phillips [wM]
You're right. I just have those numbers memorized from long usage ::sigh:: Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Jungshik Shin Sent: Wednesday, July 11, 2001 12:27 PM To: Unicode Mailing List Subject: MIME standards(RFCs

RE: Is there Unicode mail out there?

2001-07-11 Thread Addison Phillips [wM]
the encoding for the reply you can override that behavior. In Outlook 2000 this is Format | Encoding Best Regards, Addison -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Ayers, Mike Sent: Wednesday, July 11, 2001 12:42 PM To: Unicode List

RE: Unicode, UTF-8 and Extended 8-Bit Ascii - Help Needed

2001-07-10 Thread Addison Phillips [wM]
be in UTF-8. See RFC 1341 and 1342 for the details on how such stuff is labeled. Best Regards, Addison === Addison P. Phillips Manager, Globalization Engineering webMethods, Inc.Globalization Architect

Normalization and the sample code

2001-06-13 Thread Addison Phillips [wM]
and FF3F should become 0060 and 005F, but nothing about 0060 and 005F ultimately. Neither does it handle 0020 + 03xx in any way. So, what's right? 1. I should get the sequence I get; or 2. There is a bug in the code; or 3. There is an omission in the tables. Best Regards, Addison Addison P

RE: Normalization and the sample code

2001-06-13 Thread Addison Phillips [wM]
. The tables and charts, of course, are correct, but it took a lot of spelunking to figure out that I wasn't crazy and #2 below was what it was. Best Regards, Addison Addison P. Phillips Globalization Architect / Manager, Globalization Engineering webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA +1

Re: UCN (Java) notation beyond the BMP

2001-05-22 Thread addison
meaningfully!) Best Regards, Addison === Addison P. Phillips Globalization Architect webMethods, Inc http://www.webmethods.com Sunnyvale, CA, USA mailto:[EMAIL PROTECTED] +1

  1   2   >