[PHP-I18N] Multi-byte strings and Japanese characters

Jonas Koch Bentzen Thu, 26 Sep 2002 04:28:19 -0700

http://dk.php.net/manual/en/ref.mbstring.php claims that "a multi-byte 
character string may be destroyed when it is divided and/or counted 
unless multi-byte character encoding safe method is used". I've just run 
some tests with Unicode and Japanese characters (copied from 
http://unicode.org/unicode/standard/translations/japanese.html). I used 
functions like preg_match(), strlen(), and substr(), and no matter what 
I can't seem to break the Japanese strings. Which leads to my question: 
Is it really necessary to use functions like mb_substr() instead of 
substr(), mb_strlen() instead of strlen(), etc.? Does anyone have any 
examples of strings that would actually break if you use preg_match(), 
substr(), strlen() or similar functions on them?



-- 
PHP Internationalization Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

[PHP-I18N] Multi-byte strings and Japanese characters

Reply via email to