[EMAIL PROTECTED] wrote:
No. These are at most the building blocks for braille. A better parallel
would be to consider these "presentation glyphs" for braille. (But I think
that the main reason why these patterns are in Unicode is to encode runs of
braille-looking characters in didactic texts
Halldor G. Gestsson:
Can I find a list where all languages supported in the basic latin
(0x-0x00FF)?
[...]
Wich languages uses the latin extensions A,B and C?
Page http://www.eki.ee/letter/ contains the information to build your
lists.
_ Marco
As an aside:
Are there good (authorative) references on the so called
swiss numerical format with its peculiar thousand separator?
I only know about a manual shipped with some Aldus software product
as a reference. I own several books printed in Switzerland and they
show the typical swiss
Jörg Knappen wrote:
Are there good (authorative) references on the so called
swiss numerical format with its peculiar thousand separator?
Why not comparing the locale settings of main operating systems? I think
that at least WinNT, Apple, Linux, and other Unixes are widely represented
on this
Steven R. Loomis wrote:
[...] Presumably the unicode codepoints in braille
would make a great format for these translations on their way to a
printer. One would hope they would get such use and not simply for
braille-looking characters on paper or screen.
You are right, I didn't catch it:
How do you sort text with some in Roman and some
in non-Roman alphabets? Currently, I'm just romanizing
everything but I don't know if that is that good.
Should I just kanize Japanese?
I would love a system that just goes by characters,
and I would much prefer it if the Han digits collated
in
See UTR #10 (Unicode Collation Algorithm) at
http://www.unicode.org/unicode/reports/tr10/
for a very firm report on how you should indeed be handling collation.
At some level, as I mentioned to you earlier, there are many Latin sorts
that contradict other Latin sorts; in THOSE cases a decision
Halldor G. Gestsson wrote:
Can I find a list where all languages supported in the basic latin
(0x-0x00FF)?
E.g:
Basic Latin support
English
German
Spanish
French
Danish
Icelandic
... etc. etc.
What is your
If it comes to coining a new term, I'd like to propose "oligograph"...
There are really just a few, not many, characters to form such a
compound -- and we would avoid those puns on lying-detectors ;-)
Not a bad idea, methinks.
- Peter
Robert Lozyniak wrote:
How do you sort text with some in Roman and some
in non-Roman alphabets?
I never sort texts, only lists of items (words,
names, titles, whatever).
Depending of the ratios, I see two main solutions:
- if Latin is the most current, _and_ only other Greek-
derived
Doug Ewell wrote:
Can anyone comment on this? If RFC 1766 can realistically be read as
requiring outdated versions of ISO 639 and 3166, then it seems that UTR
#7 should be updated to bypass RFC 1766 entirely and refer directly to
ISO 639 and 3166.
With all due respect to all participants,
the claim that RFC 1766 freezes obsolete versions
Actually, the claim is that RFC 1766 could be interpreted that way, not that
it is actually trying to say so. The RFC author's recent statement of intent
clarifies that a more lenient interpretation is prudent.
The reason it is important to
Marco said:
These are at most the building blocks for braille. A better parallel
would be to consider these "presentation glyphs" for braille. (But I think
that the main reason why these patterns are in Unicode is to encode runs of
braille-looking characters in didactic texts for *sighted*
Roozbeh Pournader wrote:
That seems problematic to me, when used for Arabic. How should one use
ZWNJ between two Arabic letters to stop the ligature? The'll get
disconnected!
Good point.
ZWJ+ZWNJ+ZWJ comes to mind, but it is really not the maximum of elegance...
_ Marco
On Thu, 10 Aug 2000 [EMAIL PROTECTED] wrote:
ZWJ+ZWNJ+ZWJ comes to mind, but it is really not the maximum of elegance...
No! Please! I have a lot of difficulties forcing old staff to use Unicode,
add this and they will escape. ;)
This surely creates many many problems.
--roozbeh
You have a good point: does nu-alpha-tau-alpha-sigma-alpha
spell "Natasa" or "Natasha"? The Greek letters given
are obviously an attempt to write "Natasha" in Greek,
but they romanize to "Natasa".
And a, b, c, d, e, f, g, h, ... HATES a, i, u, e,
o, ka, ki, ku, ...
Maybe I should just
I bet few things would be rarer than,
say, a Georgian female rap CD in the US!!
Tobacco chewing killer whales in Picadilly Circus, surely.
Would somebody PLEASE tell me, IN THE DEFAULT UNICODE
COLLATION ALGORITHM, WHAT COMES AFTER WHAT?!
Read the technical report! (It's available
On Thu, 10 Aug 2000 [EMAIL PROTECTED] wrote:
Thanks! The - Iran System -: that's what I meant. It was on the tip of my
tongue, but I could not recall the name (BTW, is it also called ISIRI?).
ISIRI is the Iranian standards organization. ISIRI 2900 was the old
charset standard, and ISIRI 3342
Once again, if collation info is what you want, see
http://www.unicode.org/unicode/reports/tr10/
Beyond that, it is unclear what you are looking for, really. But if you were
to actually read and try to understand that document, I am fairly certain
that one of two things will happen:
1) You
At 09:36 AM 8/10/00 -0800, Roozbeh Pournader wrote:
That seems problematic to me, when used for Arabic. How should one use
ZWNJ between two Arabic letters to stop the ligature? The'll get
disconnected!
(in those rare cases...)
Use ZWJ ZWNJ ZWJ and you will get the intended effect.
A./
On Thu, 10 Aug 2000, Asmus Freytag wrote:
Use ZWJ ZWNJ ZWJ and you will get the intended effect.
Technical Vice President
The Unicode Consortium
Official answer?! too bad for us...
Anybody happen to know: Is there no Win32 API that allows you to determine
a codepage given a LANGID or a charset value (i.e. one of the two
parameters provided by WM_INPUTLANGCHANGE)?
- Peter
---
Peter Constable
Yes, there is a way to extract this info using GetLocaleInfo. Pass the LCID
or langid with one of the following params:
LOCALE_IDEFAULTANSICODEPAGE
LOCALE_IDEFAULTCODEPAGE
LOCALE_IDEFAULTEBCDICCODEPAGE (win2k only)
LOCALE_IDEFAULTMACCODEPAGE
michka
Michael Kaplan
Trigeminal Software, Inc.
Sorry about the duplicate message. Notes was hanging on me, and I thought
the first one was lost.
On 08/10/2000 04:36:32 PM Peter Constable wrote:
Anybody happen to know: Is there no Win32 API that takes a LANGID and
returns a codepage? Or, alternately, that takes a charset value and
returns
Windows 2000 actually has a Georgian keyboard layout, and a font that many
people in Georgia (and also I) find more visually pleasing than Arial
Unicode MS (Sylfaen). See
http://www.microsoft.com/globaldev/keyboards/keyboards.asp
for the layout. Note that the keyboard layout and font only
I should add that for that project, since they did not have Windows 2000 but
instead had NT4, the actual work ended up being to support a Georgian
keyboard layout on NT4 (the argument was that as long as it was mucking with
the uppercase, why not muck with the lowercase, too? g). Mainly
26 matches
Mail list logo