http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5696

           Summary: cut regexp base strings at Unicode high codepoints
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: minor
          Priority: P5
         Component: sa-compile
        AssignedTo: [email protected]
        ReportedBy: [EMAIL PROTECTED]


a pattern like /foo bar baz \x{e2}\x{a2}\x{ac}/ winds up with the UTF-8
codepoints corrupted as it passes through the base-extraction code.  to avoid
this, we should cut the base string at the first high codepoint found.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to