http://issues.apache.org/SpamAssassin/show_bug.cgi?id=5696
Summary: cut regexp base strings at Unicode high codepoints
Product: Spamassassin
Version: SVN Trunk (Latest Devel Version)
Platform: Other
OS/Version: other
Status: NEW
Severity: minor
Priority: P5
Component: sa-compile
AssignedTo: [email protected]
ReportedBy: [EMAIL PROTECTED]
a pattern like /foo bar baz \x{e2}\x{a2}\x{ac}/ winds up with the UTF-8
codepoints corrupted as it passes through the base-extraction code. to avoid
this, we should cut the base string at the first high codepoint found.
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.