On Sun, Jun 19, 2011 at 10:07:19AM -0400, erik quanstrom wrote:
> >
> > Why latin1? Not only because, as a French, I use it, but because it is
> > compatible with unicode.
>
> perhaps you mean the subset of unicode corresponding to the codepoints
> encoded by latin1 encoded in utf-8. the system character set is utf-8,
> and latin1 is not a compatable encoding. utf-8 is assumed everwhere except
> when the data is inbound, and explicitly tagged as having a different
> caracter set. programs like upas/fs and webfs do the conversion at the
> border.
>
> there's really no reason for latin1 in 2011.
There is a reason here: for now, TeX is 8 bits and that's all. So, if
allowing to use, at least, all of the 8 bits means something, it shall
be latin1. This does not prevent somebody to use whatever character set
one wants; but as a default, and _for now_, it's better than nothing;
and significantly better than some random character set that no tcs(1)
will know how to deal with.
To accept directly utf-8 as input will not be addressed for the 1.0
release of kerTeX.
And if people think that I'm too slow: be my guest. I claim it is easier
to tackle the task with kerTeX, than with TeXlive.
--
Thierry Laronde <tlaronde +AT+ polynum +dot+ com>
http://www.kergis.com/
Key fingerprint = 0FF7 E906 FBAF FE95 FD89 250D 52B1 AE95 6006 F40C