Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-10-03 Thread Tomohiro KUBOTA

Hi,

At Wed, 3 Oct 2001 16:58:36 +1000 (EST),
Jim Breen [EMAIL PROTECTED] wrote:

 [David Starner (Re: [I18n]ISO 10646 Fonts and XFontSet question) writes:]
   Would it be possible to give a quick rundown on what the
  Monbushou rules are?
 
 Well, a quick overview.
 
 - from the 1890s there were rumbles in educational circles in Japan about
 the difficulty of learning so many kanji, and the desirability to reduce
 the working set to a reasonable number. There were also concerns about the
 spelling af the kana representation of the readings of kanji
(snip)

Thank you for your report.  You know very well (well, bettern than I)
about written or official rules.

However, you don't know well on the real state of average Japanese
people, which cannot be studied from literatures...

 The limited set of kanji has survived pretty well. Only a small percentage 
 of Japanese know many kanji outside that list, and often they are just 
 for names.

Many (I think majority part of) Japanese people know Kanjis outside
the rules of Monbushou.  I think newspapers and so on limit the set
of usable Kanjis to these about 2000 Kanjis because they should
be read by everyone.  Of course it is unlikely that every Japanese
are perfect Kanji students.

---
Tomohiro KUBOTA [EMAIL PROTECTED]
http://www.debian.or.jp/~kubota/
Introduction to I18N  http://www.debian.org/doc/manuals/intro-i18n/
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-10-02 Thread Pablo Saratxaga

Kaixo!

On Tue, Oct 02, 2001 at 02:15:36PM +0900, Tomohiro KUBOTA wrote:

   Thus, I think we need bi-width or variable-width fonts which covers
  
  You only need that for charcell fonts. I have already seen Japanese text
 
 My intention is mainly on variable-width fonts.  So far XFree86

I mean for variable width fonts you don't need to worry about single width
and double width variants of latin/greek/cyrillic/katakana scripts; that
concept simply doesn't apply for variable width fonts.

 In short, I want an iso10646 substitution of a font named variable.

Is that alias already in use ?
I would prefer sans as the default variable width font. 

-- 
Ki ça vos våye bén,
Pablo Saratxaga

http://www.srtxg.easynet.be/PGP Key available, key ID: 0x8F0E4975
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-10-02 Thread Tomohiro KUBOTA

Hi,

At Wed, 3 Oct 2001 10:37:17 +1000 (EST),
Jim Breen [EMAIL PROTECTED] wrote:

 Of course we can all (well, 99.9% of us 8-} ) whip out lists of common words
 that contain non-Jouyou kanji, some of which didn't even make Level 1. 
 Donburi-no-don is another obvious one. I usually see kampeki written with the
 peki in kana for this reason. (Monbushou rules.)

I know the Monbushou rules.  For example, newspapers admit it.
However, Average Japanese people can read it and daily publications
such as novels usually use the Kanji.  Since I am a native Japanese
speaker who live in Japan, I know not only official rules but also
the real state of Japanese language.

Many people think the Monbushou rules is too strict not to use Kanji.
One of the role of the mixture of Kanji and Hiragana is like a whitespace
between words in English.  Writing too many words in Hiragana or
a part ofwor ds inHiraga na isconf usi ng like this.

Of course I don't insist that there are no Japanese people who
like the Monbushou rules.

---
Tomohiro KUBOTA [EMAIL PROTECTED]
http://www.debian.or.jp/~kubota/
Introduction to I18N  http://www.debian.org/doc/manuals/intro-i18n/
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-10-01 Thread Jungshik Shin

On Mon, 1 Oct 2001, Jim Breen wrote:

 [Keith Packard (Re: [I18n]ISO 10646 Fonts and XFontSet question) writes:]
 
  Han unification arguably makes these four character sets the most
  problematic.  What this means is that font selection must be done by a

 Problematic, yes, but is must be addressed cleanly and effectively.
 Arguments over glyphs have bedevilled the acceptance of Unicode in the
 CJK countries, and you still hear people in Japan saying Unicode

  Please, drop 'K' here.  You may even drop 'C'. Most Koreans, if not
all, do NOT care. Unicode has been perceived as 'the  way' to go not
only for multilingual use but also for Korean alone in Korea.

  Jungshik Shin

___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-10-01 Thread Jim Breen

[David Starner (Re: [I18n]ISO 10646 Fonts and XFontSet question) writes:]
 Someone (Jim Breen?) pointed out the extreme level to which Microsoft
 embeds bitmaps in their scalable fonts for good display. 

I think it was Markus.

Jim

-- 
Jim Breen  [[EMAIL PROTECTED]  http://www.csse.monash.edu.au/~jwb/]
Computer Science  Software Engineering,Tel: +61 3 9905 3298
P.O Box 26, Monash University,  Fax: +61 3 9905 5146
Clayton VIC 3800, Australia  $B%8%`!%V%j!%s(B@$B%b%J%7%eBg3X(B
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-30 Thread David Starner

On Sun, Sep 30, 2001 at 03:44:18PM +0100, Markus Kuhn wrote:
 Usually, users are prefectly able to pick the fonts manually that have the
 coverage and style they really need, and if a glyph is missing from that
 font, it will typically also not be readable for the user. Mixing fonts
 automatically can lead often to far more ugly results than had the
 application just stuck with a single font and used its default glyph for
 everything missing.

What about U+2000-U+27BE? There's a lot characters in there that your
average user would want to see correctly, but might not be in their
fonts. The number forms, the arrows, some of the math symbols, the
currency symbols, the superscripts and subscripts, the general
punctuation, the enclosed alphanumerics, the miscellaneous symbols, and
the dingbats could easily show up one by one in random English text, and
be stuff that the user would want displayed. And most of them wouldn't
suffer that much from coming from a different font.

-- 
David Starner - [EMAIL PROTECTED]
Pointless website: http://dvdeug.dhis.org
I saw a daemon stare into my face, and an angel touch my breast; each 
one softly calls my name . . . the daemon scares me less.
- Disciple, Stuart Davis
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-30 Thread Jim Breen

[Pablo Saratxaga (Re: [I18n]ISO 10646 Fonts and XFontSet question) writes:]
 The problem is not the mixing of a lot of script the problem is that
 even with only two scripts you have big chances to lack a font; for example
 I know of *no* font having glyphs for both Japanese and French.

[Click, click]. OK, I'm now looking at a line of text which has 
Japanese, French and German words. All the acutes, graves, umlauts, etc.
are there. The line is in EUC-JP, and the Latin_with_diacritics
characters have come from JIS X 0212. The .bdf font for JIS X 0212 is
standard in RH7.1 and works out of the box in kterm. OK, I'm using
two font files,  but if I wrote entirely in discriticed letters and
obscure kanji, I could meet Pablo's criterion.  8-)}

Also, I'm seeing the bloated JIS ASCII, so my solution is rather
artificial..

 French studiants of Japanese are continuosly faced with that problem.

Most of the French people I am interacting with about Japanese matters
are using UTF-8. Sadly they mostly are using Windblows.

Jim

-- 
Jim Breen  [[EMAIL PROTECTED]  http://www.csse.monash.edu.au/~jwb/]
Computer Science  Software Engineering,Tel: +61 3 9905 3298
P.O Box 26, Monash University,  Fax: +61 3 9905 5146
Clayton VIC 3800, Australia  $B%8%`!%V%j!%s(B@$B%b%J%7%eBg3X(B
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-30 Thread Jim Breen

G'day

[Tomohiro KUBOTA (Re: [I18n]ISO 10646 Fonts and XFontSet question) writes:]
 At Mon, 1 Oct 2001 12:59:22 +1000 (EST),
 Jim Breen [EMAIL PROTECTED] wrote:
  Arguments over glyphs have bedevilled the acceptance of Unicode in the
  CJK countries, and you still hear people in Japan saying Unicode
  characters look Chinese, despite JIS X 0221 carefully showing each
  ideograph in distinct national glyph styles.
 
 Such arguments is obsolete in Japan.  

Obsolete, yes. But one still hears it and reads it.

 I think nobody in Japan believes
 that usage of Unicode forces Japanese people to use Chinese glyph.

I heard comments to that effect several times in the last 12 months. Not
from up-to-date computer-literate people, but from linguistics people
who had been influenced by the nonsense that had been said and written.
There is at least one vituperative anti-Unicode book out that I glanced
through (the name and author escape me.)

 For example, Japanese version of MS Windows has TrueType Unicode
 fonts with Japanese glyph and it displays Japanese text properly.

Quite true, and although I twinge to say it, Bill's Road-to-Damascus
conversion to Unicode and its subsequent embedding in NT and derivatives
has guaranteed its success in what may have been a difficult battle.
There are still the die-hards, and the BTRON people, but they are
doomed, I hope.

 French people who learn Japanese using Unicode must be careful
 to choose Japanese glyph, especially in case when (s)he doesn't
 have a supervisor who can point out when wrong glyphs are used.

Fortunately the CJK arena seems to be the only one where national glyph
styles are so important. Imagine if we had to provide French-looking
alphabetics, Yiddish-looking Hebrew, Korean-looking hanja, Saudi-looking
Arabic, Uttar Pradesh-looking Devanagari,  etc. etc. all in the one font
file.

Yoroshiku

Jim

-- 
Jim Breen  [[EMAIL PROTECTED]  http://www.csse.monash.edu.au/~jwb/]
Computer Science  Software Engineering,Tel: +61 3 9905 3298
P.O Box 26, Monash University,  Fax: +61 3 9905 5146
Clayton VIC 3800, Australia  $B%8%`!%V%j!%s(B@$B%b%J%7%eBg3X(B
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-30 Thread Tomohiro KUBOTA

Hi,

At Mon, 1 Oct 2001 15:00:23 +1000 (EST),
Jim Breen [EMAIL PROTECTED] wrote:

 I heard comments to that effect several times in the last 12 months. Not
 from up-to-date computer-literate people, but from linguistics people
 who had been influenced by the nonsense that had been said and written.

I imagine the discussion is not that they are forced to use Chinese glyph
but that they don't have way to distinguish Japanese and Chinese glyphs.
This discussion is right and wrong.  We have now language-tag but I 
don't know any products which implements language-tags.

Anyway, not only Unicode but also JIS are obviously insufficient for
serious literature purpose.  However, I think XFree86 doesn't need to
support such serious usage.  Average Japanese people will be satisfied
if Japanese text is written in Japanese glyphs.  Non-negligble number
of Japanese people can read or are studying Chinese and/or Korean and
they need to read Chinese and Korean in Chinese and Korean glyphs.
I think this level of support is adequate for XFree86.

 There are still the die-hards, and the BTRON people, but they are
 doomed, I hope.

Sorry, they are not doomed.  New versions of BTRON are released every
a few years and they are sold well in reality, though I don't have.


 Fortunately the CJK arena seems to be the only one where national glyph
 styles are so important. Imagine if we had to provide French-looking
 alphabetics, Yiddish-looking Hebrew, Korean-looking hanja, Saudi-looking
 Arabic, Uttar Pradesh-looking Devanagari,  etc. etc. all in the one font
 file.

Software developers just have to try to meet the real demands.
CJK glyph distinction is a reality.  (Even this is an opinion
from a specific viewpoint.  Some think that they are different
_character_ and Unicode unified different _characters_.  This
distinction of viewpoint comes from the distinction of viewpoint
about what is glyph and character, which is ambiguous for
CJK Han Ideogram.)  On the other hand, your imagination is
mere an imagination and against the reality.

---
Tomohiro KUBOTA [EMAIL PROTECTED]
http://www.debian.or.jp/~kubota/
Introduction to I18N  http://www.debian.org/doc/manuals/intro-i18n/

___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Brian Stell



Keith Packard wrote:
 ...
 Now that's a good idea -- allow the application to hand additional fonts to
 Xft to fill-in the holes.  Applications would then be in control of how
 those additional fonts were selected if they so chose.  Would you like a
 per-font callback invoked when an attempt to access non-existant glyphs was
 made?  Or should the application detect this case before presenting data to
 the library?

The app should be able to present a list but my gut level feeling 
is a callback is needed.

  Regardless of the order in which character are encountered
  the list must always have the same fonts in the same order.
  Moz does this by have a place holder for each font regardless
  of whether a X font was actually needed (eg: loaded). Thus
  the mLoadedFonts list holds the font placeholders and only
  when needed is a font is actually loaded into the placeholder.
 
 Yes, of course.  That would be automatically handled if Xft started
 caching FreeType faces.

This is not a caching issue. The concern is that, if for example 
there are 27 fonts needed to display unicode then regardless of 
whether the 27 font is needed 1st or not, the font placeholders 
in the list are in the same order. Number 27 is always #27 in
the list.

Brian Stell
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Brian Stell



Brian Stell wrote:
 
 Keith Packard wrote:
  ...
  Now that's a good idea -- allow the application to hand additional fonts to
  Xft to fill-in the holes.  Applications would then be in control of how
  those additional fonts were selected if they so chose.  Would you like a
  per-font callback invoked when an attempt to access non-existant glyphs was
  made?  Or should the application detect this case before presenting data to
  the library?
 
 The app should be able to present a list but my gut level feeling
 is a callback is needed.

Additionally: if the app did not define a callback then it would
be nice if Xft could do something reasonable.

-- 
Brian Stell
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Brian Stell



Brian Stell wrote:
 
 Keith Packard wrote:
  ...
  Now that's a good idea -- allow the application to hand additional 
  fonts to Xft to fill-in the holes.  

This is where/why the list of available glyphs in a font is 
needed.

-- 
Brian Stell
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Keith Packard


Around 9 o'clock on Sep 29, Brian Stell wrote:

 Xft is clearly the 1st choice (and perhaps only choice) most 
 single language apps should look at to make this transition.

As Xft can also use core fonts, and will provide a Unicode API for those, 
Xft should be useful even when running on legacy X servers.  Of course, 
the fun TrueType features will all be missing, but at least apps won't 
have to deal with ISO2022...

 Once there is an efficient way to find out which glyphs are
 available, multilingual apps will need code to combine these
 fonts in to a multilingual font (Pango) and do complex text 
 layout (Pango?, FTLayout?, ICU?).

I'm considering whether font merging would be useful in Xft. Once I've
finished adding the Unicode coverage information for each font, adding the
ability to do font merging would be relatively easy. It might extend the
coverage for simple applications, or it just might encourage people to
ignore the harder layout issues which Xft will not address.

What do other people think?  Should Xft do simplistic merging?  Obviously,
there would be a way to disable this as well.

[EMAIL PROTECTED]XFree86 Core Team  SuSE, Inc.


___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Brian Stell



Markus Kuhn wrote:
 ...
 Is there already a way to use pixel fonts with render? For some
 applications (terminal emulators with small glyphs most notably, e.g.
 popular with programmers and sys-admins), bitmap fonts will always be
 preferable over TrueType fonts. Please do not neglect them in any future
 font architectures.

Using hand tuned bitmap fonts is a very important consideration 
as they almost always look better than outline rendered bitmaps.
Many people prefer them over anti-aliased fonts at the same
size.

This is why better TrueType fonts have hand tuned bitmaps *in* 
them.

For example, MSGothic (with ~7500 glyphs) has embedded bitmaps 
for sizes 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, and 22.
(I'll bet that took some time.)

-- 
Brian Stell
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Brian Stell



Pablo Saratxaga wrote:
 ...
 If it would also be possible for users (or sys admin) to have the choice
 to merge those default pseudo fonts with real fonts (of the same style),
 that is, all fonts will always have the maximum coverage, the face name
 will only say which real font to get the glyphs to, each time it is  possible; then 
it would be the nearest to perfection for me.

Mozilla supports this now. Earlier this year we added code to
look for fonts with the same foundry/family (eg: adobe-helvetica)
when making the list of fonts. I already see several Japanese
system vendors making an alias-misc font where various JISX0201,
JISX0208, JISX0212, and JISX0213 fonts are aliased to this
common name. They are controlling this merged font.

When I get to my system at work I will send a URL.

Brian Stell
___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Keith Packard


Around 16 o'clock on Sep 29, Brian Stell wrote:

 My only word of caution here is that recently people have
 been generating bitmap fonts from outline fonts and installing
 them as bitmap fonts. These are at best only fair and often
 are mediorce. At present I do not know how to distinguish these
 automatically generated bitmaps from hand tuned bitmaps.

Right now, I'd like to get support for bitmaps embedded in TrueType fonts 
which should avoid this particular issue.  I don't think too many TrueType 
fonts embed untuned bitmaps.

 Outline generated bitmaps are not great. AA CJK fonts can be 
 quite usable if not as nice as the hand tuned bitmaps.

The outline CJK fonts I have seen are essentially worthless without AA; at
normal sizes, they are impossible for my eyes to decipher.

[EMAIL PROTECTED]XFree86 Core Team  SuSE, Inc.


___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-29 Thread Keith Packard


Around 11 o'clock on Sep 29, Brian Stell wrote:

 Using hand tuned bitmap fonts is a very important consideration 
 as they almost always look better than outline rendered bitmaps.
 Many people prefer them over anti-aliased fonts at the same
 size.

I'll fix Xft to use them; I believe it currently avoids them.  Certainly 
hand-tuned bitmaps are better for non-AA text.  And, hand-tuned bitmaps 
are certainly easier to find than high-quality TrueType fonts, which are 
needed for decent AA text.

For CJK, using bitmaps is even more important; the outlines generate 
illegible results at moderate sizes.

[EMAIL PROTECTED]XFree86 Core Team  SuSE, Inc.


___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n



Re: [I18n]ISO 10646 Fonts and XFontSet question

2001-09-28 Thread Keith Packard


Around 23 o'clock on Sep 28, Juliusz Chroboczek wrote:

 We're aware of that (Tifinagh is the example I like to give).  The
 point Keith was making (if I understood him correctly) was that core X
 fonts should only be used for glyphs covered by legacy encodings.  For
 new glyphs, client-side fonts (Xft or otherwise) should be used
 instead.

Juliusz is mostly correct; the problem is that any fonts encoded in
iso10646-1 must be opened and queried before an application can discover
which glyphs are available.  It is the only common X encoding in use today
that has this property.  For any other encoding, fonts are expected to
provide the complete repertoire of glyphs.  If we use iso10646-1 encodings,
we'll avoid this particular issue.

XLFD has no standard mechanism for specifying which glyphs are available.
As it was developed before Unicode, there was no concept that a font 
wouldn't encode every glyph.

I believe that iso10646-1 encoded fonts can only be effectively used from
the client side; a database can accompany the fonts themselves allowing
applications to perform font selection based on available glyphs.  I hope
to finish work on Xft in the near future which demonstrates how this works.

Any way of making iso10646-1 fonts work reasonably will require a change 
in the X server.  As any such change takes essentially the same amount of 
time to propogate to all X servers, we might as well use the one which 
solves a host of other issues at the same time (Render).

[EMAIL PROTECTED]XFree86 Core Team  SuSE, Inc.


___
I18n mailing list
[EMAIL PROTECTED]
http://XFree86.Org/mailman/listinfo/i18n