Below is what I got (php5.4):
1)
Script is in utf-8 with BOM encoding. zend.multibyte is enabled 
(to use this feature, mbstring extension must be enabled).
<?php
echo strlen('ы'); //russian letter
$str = 'aa';
echo "\n" . strlen($str);
?>

Output:
1
2

2)
Script is in utf-8 encoding. zend.multibyte is disabled.
<?php
echo strlen('ы');
$str = 'aa';
echo "\n" . strlen($str);
?>

Output:
2
4

  "Sherif Ramadan" <theanomaly...@gmail.com> сообщил/сообщила в новостях 
следующее: 
news:cal+t6cgpn+vnzak+g5i-c4cjllsorweal7rbhco38-t_7ow...@mail.gmail.com...





  On Sun, Jun 16, 2013 at 3:51 AM, Andrey Pashentsev <avp200...@mail.ru> wrote:


    "Adam Kazimierczak" <kazym...@gmail.com> сообщил/сообщила в новостях 
следующее: news:51ba2cc5.3060...@gmail.com...


      Hello Mailing List,

      I found that strlen is often considered as a function that counts
      characters in strings, rather than bytes. Such mistake is even often
      seen on blogs or tutorials, so many PHP beginners may get confused when
      they try to strlen a multi-byte characters string.

      I am attaching a patch proposition where the description notes that
      length is measured in bytes and adds an example showing results of
      strlen used on multi-byte character strings.

      Please let me know what do you think about this.

      Cheers,
      Adam




    Hi Adam



      I found that strlen is often considered as a function that counts
      characters in strings, rather than bytes.



    Don't forget about zend.multibyte feature...

    If it is on and you have a script in 'utf-8 with BOM' encoding
    php will detect it and strlen will take this into account.




  zend.multibyte enables the PHP engine to parse source files that are in 
multibyte encodings. It has nothing to do with the behavior of strlen, nor does 
it change the behavior of the function.


    Cheers,
    Andy



Reply via email to