http://dk.php.net/manual/en/ref.mbstring.php claims that "a multi-byte character string may be destroyed when it is divided and/or counted unless multi-byte character encoding safe method is used". I've just run some tests with Unicode and Japanese characters (copied from http://unicode.org/unicode/standard/translations/japanese.html). I used functions like preg_match(), strlen(), and substr(), and no matter what I can't seem to break the Japanese strings. Which leads to my question: Is it really necessary to use functions like mb_substr() instead of substr(), mb_strlen() instead of strlen(), etc.? Does anyone have any examples of strings that would actually break if you use preg_match(), substr(), strlen() or similar functions on them?
-- PHP Internationalization Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php