Hi, this is my paper for the Unicode conference.
Please check it over for accuracy.
The Nov. changes muddied it a bit, but I commented on the changes during the
presentation.
http://www.i18nguy.com/unicode/Unicode-Enabling%20PHP-Mar%202006.pdf (5MB)
It displays slowly for some reason.
Tex
-
Suggestion for improving the performance of indexing strings:
Associate with the string the index of the first code unit that is a
surrogate.
Since most strings will have no surrogates, these strings will have a value
greater than the length of the string, and this tells you that you can index
dir