https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6659
Bug #: 6659
Summary: Empty Content-Type causes learning of binary file
Product: Spamassassin
Version: 3.3.1
Platform: PC
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: Learner
AssignedTo: [email protected]
ReportedBy: [email protected]
Classification: Unclassified
I'm receiving a few hundred mails a day with small attachments that are (afaik)
correctly parsed and nothing happens with the attachments when checking the
message to be spam. When autolearning the E-mails as spam, the attachments are
being decoded and parsed by the bayes algorithm. The only strange thing I can
find in the message (appart from the text content obviously being a phishing
mail) is the header of the attachment part:
------=_NextPart_000_0006_01CC51AC.63F30F00
Content-Type: ;
name="report_1609.pdf.zip"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
filename="report_1609.pdf.zip"
I suspect the empty content-type causes the attachment to be decoded.
Running spamassassin in debug mode causes it to hang a the following lines:
Sep 16 15:07:12.279 [8264] dbg: bayes: Using userid: 1
Sep 16 15:08:48.746 [8264] dbg: bayes: seen
(bf76e190b8121487c91051758a402dd20b18eaa6@sa_generated) put
Manually calling sa-learn hangs for a while at the "decoding base64" part:
Sep 16 15:34:12.786 [18308] dbg: message: decoding base64
Forgot tokens from 1 message(s) (1 message(s) examined)
Sep 16 15:35:49.764 [18308] dbg: plugin:
Mail::SpamAssassin::Plugin::Bayes=HASH(0x3891ba0) implements 'learner_close',
priority 0
--
Configure bugmail:
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.