https://bugs.exim.org/show_bug.cgi?id=2670

            Bug ID: 2670
           Summary: \R erroneous match against particular UTF-8 character
                    0xC485
           Product: PCRE
           Version: N/A
          Hardware: x86
                OS: Windows
            Status: NEW
          Severity: bug
          Priority: medium
         Component: Code
          Assignee: [email protected]
          Reporter: [email protected]
                CC: [email protected]

preg_match("/\R/", 'ąbb'); // ą – 0xC485 (UTF-8)
preg_split("/\R/", 'ąbb');
//(and other preg_* functions using \R)

LANG=C echo -e "\xC4\x85" | grep -nP '\R'

Windows (Cygwin), Linux, various, different versions of PCRE. 
'\R' won't match other UTF-8 chars (at least dozens i've tried), just this one.

Yes, i'm aware of /u modifier (for preg_*) and it works normally with UTF-8
locales, but anyway seems like bug.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
-- 
## List details at https://lists.exim.org/mailman/listinfo/pcre-dev 

Reply via email to