This is by design. If you prefer to work with actual bytes, use binary strings or literals. In unicode strings \xC4 is actually a codepoint (UTF-16 codepoint) specifying character U+00C4.

-Andrei


On May 19, 2007, at 8:48 AM, Tomas Kuliavas wrote:

strlen("\xC4\x85") = 2. strlen((binary)"\xC4\x85") = 4. Not good. It is
one character in utf-8.

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to