ID:               41588
 Updated by:       [EMAIL PROTECTED]
 Reported By:      spam02 at pornel dot net
-Status:           Open
+Status:           Closed
 Bug Type:         Documentation problem
 Operating System: *
 PHP Version:      6.0.0-dev (20070509)
 New Comment:

This bug has been fixed in the documentation's XML sources. Since the
online and downloadable versions of the documentation need some time
to get updated, we would like to ask you to be a bit patient.

Thank you for the report, and for helping us make our documentation
better.

$offset: "(in bytes)"


Previous Comments:
------------------------------------------------------------------------

[2007-06-04 13:18:38] [EMAIL PROTECTED]

>preg_match() with 'u' modifier is supposed to use UTF-8, but this
>switch doesn't affect offset parameter, which is always in bytes.

Right, PHP is not supposed to parse the regexp to detect which
modifiers were used.
The byte/codepoint behaviour changes only in Unicode mode.

------------------------------------------------------------------------

[2007-06-04 13:08:02] spam02 at pornel dot net

(fixed php version)

------------------------------------------------------------------------

[2007-06-04 13:04:43] spam02 at pornel dot net

Description:
------------
preg_match() with 'u' modifier is supposed to use UTF-8, but this
switch doesn't affect offset parameter, which is always in bytes.

This gotcha at least deserves to be documented, although consistent
unicode support would be even better.


Reproduce code:
---------------
<?php
preg_match('/./u',urldecode('%C2%AE').'NY',$m,NULL,2);
echo $m[0];


Expected result:
----------------
Y

Actual result:
--------------
N


------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=41588&edit=1

Reply via email to