Bob Stayton wrote:
Thanks, that clarifies the situation. This seems to be a two-byte
encoding, perhaps specific to Microsoft's Symbol font? I thought the
Symbol font was single byte, so I'm not understanding those numbers.
Anybody else recognize this?
There is something of an explanation here:
http://scripts.sil.org/fontfaq_unicodeword
Specifically:
------------------
To further complicate the picture, there are two different ways to
encode 8-bit fonts: as normal text fonts, called UGL, or as symbol
fonts. Most fonts containing alphabetic characters (e.g., Times New
Roman, Arial) are encoded as UGL fonts. Fonts containing symbols (e.g.
Wingdings) are typically encoded as symbol fonts. Word 97/2000 uses two
different translation schemes between Unicode values and 8-bit values,
depending on whether the font used for the text in question is a UGL
font or a symbol font. If the font is a UGL font, Word 97/2000 converts
the characters between the standard 8-bit and Unicode values defined by
the active codepage. No such standard conversion exists for symbols,
however, so if the font is a symbol font, Word 97/2000 converts the
characters to a different set of Unicode values in what is called the
“Private Use Area” (PUA) of Unicode.
---------------------
There's a PDF link at the bottom of the page which goes into more details.
--
Mike Maxwell
What good is a universe without somebody around to look at it?
--Robert Dicke, Princeton physicist
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]