Re: New Unifont release

Markus Kuhn Tue, 10 Jul 2001 03:48:25 -0500

On Tue, 10 Jul 2001, David Starner wrote:
> Also, the hex2bdf now produces two bdf
> files, since the old bdf file was incorrectly labeled a charcell font
> while having multiple widths for characters. This produced problems in
> xterm, among others. (Some versions of Yudit that previously showed all
> the characters may only show the single-width characters now. Since
> xterm is correct and more important than Yudit, the change was made.)


I strongly recommend that you follow the practice we established in
XFree86 for the -misc-fixed-*-iso10646 fonts and split up GNU Unifont into
two separate charcell font files, one 8x16 and one 16x16. This way, the
width decision becomes factored out of the font and you can use the fonts
with all the different wcwidth conventions that are currently in use. Some
Japanese folks do insist on their double-width cyrillic and box drawings
for backwards compatibility. Even if they agree with Europeans in
principle that double-width Greek/Cyrillic is typographically nonsense and
even considering that JIS standards don't define any charcell width
convention, some don't want to change the kterm-convention layout of their
existing EUC plaintext files.

> Devanagari

We deliberately kept the Indic glyphs out of the -misc-fixed-* fonts,
because Unicode encodes characters, not glyphs, and for Indic, Syriac,
etc. the difference does matter significantly. I doubt whether the
Devanagari in GNU Unifont is of any practical use whatsoever at the
moment.

The hex source format and tools for Unifont should definitely be extended
to allow a glyph to be assigned to several sequences of UCS characters,
not just as currently to one single UCS character per glyph.

> Known Issues: We still have circles on combining characters

Those definitely should be removed. The circles are just technical
annotations in the Unicode book. Combining characters should also be
placed such that simple overstriking (as currently implemented in xterm)
places them in an acceptable position onto both lower and upper case
characters. This is what we did for the 9x18 font in the -misc-fixed-*
collection.

You can also implement combining characters are precomposed ligatures of a
base character and one or more combining characters. This is a good idea
for those base/accent combinations, where simple overstriking leads to
unpleasant results.

> Some of the ideographs use all four
> edges of the 16 by 16 square, meaning they may blur together when used
> in many programs.

To make them readable, programs have to add 1-2 empty pixel rows between
the glyphs. But this breaks up box drawing chartacters. Therefore, in the
-misc-fixed-* font project, we went for 9x18 and 18x18 as the format of
what we aim to be the most comprehensive Unicode pixel font on the planet,
instead of 8x16/16x16. (Another reason was that 9x18 has exactly the
additional head/foot space that the existing 9x15 needed for the use of
overstriking combining accents.) We also split up the 18x18 font into CJK
style variants.

Currently, 9x18/18x18 covers 31053 Unicode characters (including full
MES-3A/B coverage and all other major sets except for BIG5), or 51527
glyphs (including overlaps). Contributions are welcome ...

http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html

Markus

--
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

Re: New Unifont release

Reply via email to