[pcre-dev] [Bug 2670] New: \R erroneous match against particular UTF-8 character 0xC485

admin Wed, 04 Nov 2020 14:27:26 -0800

https://bugs.exim.org/show_bug.cgi?id=2670


            Bug ID: 2670
           Summary: \R erroneous match against particular UTF-8 character
                    0xC485
           Product: PCRE
           Version: N/A
          Hardware: x86
                OS: Windows
            Status: NEW
          Severity: bug
          Priority: medium
         Component: Code
          Assignee: philip.ha...@gmail.com
          Reporter: laimonas.ve...@gmail.com
                CC: pcre-dev@exim.org

preg_match("/\R/", 'Äbb'); // Ä â 0xC485 (UTF-8)
preg_split("/\R/", 'Äbb');
//(and other preg_* functions using \R)

LANG=C echo -e "\xC4\x85" | grep -nP '\R'

Windows (Cygwin), Linux, various, different versions of PCRE. 
'\R' won't match other UTF-8 chars (at least dozens i've tried), just this one.

Yes, i'm aware of /u modifier (for preg_*) and it works normally with UTF-8
locales, but anyway seems like bug.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

-- 
## List details at https://lists.exim.org/mailman/listinfo/pcre-dev

[pcre-dev] [Bug 2670] New: \R erroneous match against particular UTF-8 character 0xC485

Reply via email to