Karlsson Kent - keka wrote on 2000-10-10 16:31 UTC:
> > The ISO TR 14652 transliteration mechanism (which is already partly
> 
> There is no such TR!!!  There have been drafts for such a TR, and I guess
> the editor is still working on it.  But there is opposition to progressing
> 14652 any further.  It was demoted from standards track to report track
> to make it less of an issue.  However, it is not a TR yet, and it might
> never become a TR.

I myself have problems with parts of these drafts (it contains lots of
utterly useless stuff on postal address formats, etc.), but the general
idea of having a transliteration mechanism (automatic wide string
substitution applied before the actual multi-byte encoding in the
multi-byte encoding functions) seems to me highly useful and desirable
for a variety of purposes. There certainly are ways in which the precise
wording in the document on how it should be implemented can be made much
more precise.

A fundamental problem that I have with transliteration is that I might
want to restrict its application narrower than just the locale. For
example, I am happy with the idea that LC_CTYPE sets my character
encoding for everything (files, filenames, terminal I/O, environment
variables, etc.), because think/hope that all this will go to be UTF-8
soon anyway. However, I might not want to have transliteration applied
everywhere, because it is a lossy conversion process. Transliteration is
fine for texts at the end of the display pipeline, but it might be much
less desireable to file I/O (depends on the application context
however). A facility that allows the application to selectively activate
and deactivate transliteration would seem sensible here. This could be
done as a stream attribute, but it could also be done by separating
transliteration out of LC_CTYPE into LC_TRANSLIT as Keld suggested.

Keld, what is the current status of the draft, where does the latest
working copy live (URL?), etc.? Can we make this issue a bit more
transparent for the members of linux-utf8? What are the 14652 mailing
lists that we can join? Thanks!

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to