On Tue, 23 Jan 2001 16:49:29 +0000, John Delacour wrote:

>All the character sets are mapped to Unicode at <ftp://unicode.org/> in the mappings 
>directory of the public directory.  I can't give you the full URL because it's 
>case-sensitive and I forget what their whim was when they created the directories.

The FTP no longer seems to work. (It didn't when I tried it.) But http
does:

        <http://www.unicode.org/Public/MAPPINGS/VENDORS/>

In detail:

For Windows, look at 
<http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT>

For the Mac:
<http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ROMAN.TXT>

And, to be pedant, you don't want the Window character set to turn into
HTML, but ISO-Latin-1. That character set fully agrees with the lower
256 codes of Unicode. So, just a conversion from Mac to Unicode, is
enough.

And, for those characters that don't exist in ISO-Latin-1: you can still
use numerical entities, with Unicode character number, and it'll work.
For example, the bullet is 0x2022, or 8226 in decimal. Well: both
"&#x2022;" and "&#8226;" work, even on a PC. Tested with Netscape 4.76
and MSIE5.0.

-- 
        Bart.

Reply via email to