https://bz.apache.org/SpamAssassin/show_bug.cgi?id=8272

Joe Wein <joew...@surbl.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |REOPENED
         Resolution|FIXED                       |---

--- Comment #16 from Joe Wein <joew...@surbl.org> ---
We are still seeing the problem with the latest batch of emails from this gang.
The emails have text/html sections using charset=UTF-8 but then a "META
content" HTML tag also declares it to use charset=GB2312 which is the
"Simplified Chinese" character set.

Example:
====
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<html>=0A<head>=0A    <META content=3D"text/html; charset=3DGB2312" http-eq=
uiv=3DContent-Type>=0A    <STYLE>body{margin-top:5px; margin-right:2px ; ma=
====

Link that mismatches against a URI BL:

====
 3px"=0A                             href=3D'https://yahoo.co.jp=E2=88=
=956ybzql46h8z9pwl4n=E2=88=956ybzql46h8z9pwl4n=E2=88=95odcsdggjfjjg@odcsdgg=
jfjjg.=F0=9D=96=99=F0=9D=96=90=F0=9D=96=98=F0=9D=96=8D=F0=9D=96=94=
=F0=9D=96=95=F0=9D=96=9B=F0=9D=96=9B=F0=9D=96=8C.%F0%9D%92%B8%F0%9D%91%9C%F=
0%9D%93%82/gongyi?token=3D66xdBOq2ifwAfOfq_lANfwTnpWi4ooiqhpQjPLfKuMm63twSa=
Gsb3HFcOcOaM-5cG_CX_HZQ2EPEPQGkm29ZtXSpThdlX21K3gBYaflXAdeg'=0A            =
====

As you can see, some characters are quoted-printable encoded 8-bit values for
the UTF8 while others are HTML-encoded but either should convert from UTF-8 to
plain ASCII after the decoy Basic authentication "yahoo.co.jp=(...)fjjg@" is
stripped off.

The SA score header makes no mention of URI matches even though the basic
domain "tkshopvvg[.]com" (from the URI after UTF-8 to ASCII conversion) is
listed.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to