http://bugzilla.spamassassin.org/show_bug.cgi?id=3979

           Summary: addresses not parsed properly for AWL if contain 2-byte
                    language characters
           Product: Spamassassin
           Version: 3.0.1
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P5
         Component: Libraries
        AssignedTo: [email protected]
        ReportedBy: [EMAIL PROTECTED]


From: headers that contain 2-byte (Japanese) characters in the description (i.e.
"2-byte chars here" <[EMAIL PROTECTED]>) were causing email addresses to be
improperly parsed before being added to the AWL database.

This caused entries in the database such as: "_z _ <[EMAIL PROTECTED]" (no
quotes) 

The From: header looks like this:

From: =?ISO-2022-JP?B?IhskQiF6OD02YiQ9JE4+bCRHRXYkPyRqISohekV2GyhC?=
        =?ISO-2022-JP?B?GyRCJD8kayEqISkkLyQ4IXo3fD5eJVwlcyVQITwhehsoQg==?=
        =?ISO-2022-JP?B?Ig==?=
        <[EMAIL PROTECTED]>

the problem is reproduceable.

I've collected debug information if necessary and am including a patch that will
extract the "[EMAIL PROTECTED]" from what's passed to AutoWhitelist's _pack 
function.

I haven't tested the patch extensively so it may be buggy or missing something
in the regex.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to