Hello, For those of you not already on the Unicode mailing list I thought you would like to be aware of www.bytext.org. Bytext has a much better design than Unicode and is a better long term solution. One of the main features is that it is designed to be searchable with fast 8 bit regular expression algorithms. You may want to build in some flexibility to deal with Bytext in your implementation of UTF-8, perhaps even give up on UTF-8 altogether if it �s possible for you to focus on the long term.
Sincerely, Bernard Miller -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
