On Tue, 23 Jan 2001 16:49:29 +0000, John Delacour wrote:
>All the character sets are mapped to Unicode at <ftp://unicode.org/> in the mappings
>directory of the public directory. I can't give you the full URL because it's
>case-sensitive and I forget what their whim was when they created the directories.
The FTP no longer seems to work. (It didn't when I tried it.) But http
does:
<http://www.unicode.org/Public/MAPPINGS/VENDORS/>
In detail:
For Windows, look at
<http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT>
For the Mac:
<http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ROMAN.TXT>
And, to be pedant, you don't want the Window character set to turn into
HTML, but ISO-Latin-1. That character set fully agrees with the lower
256 codes of Unicode. So, just a conversion from Mac to Unicode, is
enough.
And, for those characters that don't exist in ISO-Latin-1: you can still
use numerical entities, with Unicode character number, and it'll work.
For example, the bullet is 0x2022, or 8226 in decimal. Well: both
"•" and "•" work, even on a PC. Tested with Netscape 4.76
and MSIE5.0.
--
Bart.