On Sun, 2004-01-11 at 18:27, Keld Jørn Simonsen wrote: > Well, you can never tell. I know that Sc2/WG2 has said that they will > never allocate something above the 21th bit,
They have mentioned that in the text of the ISO/IEC 10646 itself. > but then again they said > they would never reallocate characters, and then they did it anyway. Recently?!! > I would say: "be liberal in what you accept, and conservative in what > you generate", and thus accept valid UTF-8 until the 31 bit. That is not necessarily good advice in security issues. > I also think there is code around to handle full UTF-8, so that is not > an extra burden to do it. UTF-8 is so easy it won't be a problem. roozbeh -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
