http://bugzilla.spamassassin.org/show_bug.cgi?id=4469

           Summary: Add a process/option to efficiently deal with very long
                    mail messages
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: enhancement
          Priority: P4
         Component: spamc/spamd
        AssignedTo: [email protected]
        ReportedBy: [EMAIL PROTECTED]


There are starting to be occasional reports of very large spams that make it 
past SA by virtue of the length cutoff limit.

Passing the entire message to SA would of course not be a Good Thing to do.  
However, armchair reasoning suggests that the spaminess of the message can 
probably be determined reasonably accurately from the headers and the first 
2..10K or so of the message body in virtually all cases.  In fact, this is 
probably virtually always true, even with messages in the 20K..250K range.

Suggest two things here: an option to SA (perhaps a special line on the front 
of the message stream itself) that tells it that this will be a partial 
message, and secondly a change to spamd to pass partial messages, along with 
this flag, when some size limit is exceeded.  

Since only a partial message is being passed, obviously spamd can't just pipe 
the entire message thru SA and out the other end.  Instead, it will have to get 
a declaration from SA of spaminess, and then do something itself with the 
original message.

The purpose of the flag to SA for a partial message would be twofold: it would 
disable some of the rules that expect correct mime-part terminations, and it 
might change the output from SA to perhaps only be headers for the message, 
plus a return value that somehow indicates spam.  This return value might be in 
the form of a real return value, or a first header line with special 
formatting, or perhaps something else.

If SA operating in this mode returned modified headers only, it would be 
trivial for the spamd child to remove the original message headers and replace 
them with the SA-supplied headers, and pipe the rest of the message straight 
through, thus avoiding the SA large-message overhead.

However this sort of option is implemented (if it is), it should be done in a 
way that tools calling SA or the SA API directly can fairly easily implement 
spam detection using this option.



------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to