Henry Spencer wrote on 2003-04-01 16:31 UTC: > On Tue, 1 Apr 2003, Jungshik Shin wrote: > > >>If there is an urgent need for this in other scripts... > > >Not in Latin-alphabet text generally. Writing systems that have > > >such needs include Vietnamese, IPA, Math, Polytonic Greek, > > > > Does Vietnamese need diacritic marks ? Sure, it does, but > > I think all it needs are encoded as precomposed... > > As I understand it, the usual written forms of Vietnamese explicitly need > multiple marks per letter; there are no precomposed forms for that.
As I understand it, ISO 10646 contains all the precomposed characters necessary to write modern Vietnamese. The subset of ISO 10646 necessary (for both precomposed and decomposed encoding) is identified in Vietnamese standard TCVN 6909:2001 and comprises the 240 UCS characters # Plane 00 # Rows Positions (Cells) 00 20-7E A0 C0-C3 C8-CA CC-CD D2-D5 D9-DA DD E0-E3 E8-EA EC-ED F2-F5 00 F9-FA FD 01 02-03 10-11 28-29 68-69 A0-A1 AF-B0 03 00-03 06 09 1B 23 1E A0-F9 20 1C-1D You can drop the U+03xx characters from that set if you use only the precomposed encoding. Source: http://www.undp.org.vn/unicode/ Markus -- Markus Kuhn, Computer Lab, Univ of Cambridge, GB http://www.cl.cam.ac.uk/~mgk25/ | __oo_O..O_oo__ -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
