Re: [Bug 6854] Optimizations, profiling

Axb Tue, 23 Oct 2012 13:26:35 -0700

On 10/23/2012 10:15 PM, Kevin A. McGrail wrote:> On 10/23/2012 4:10 PM,Axb wrote:

>> On 10/23/2012 09:59 PM, Kevin A. McGrail wrote:
>>> On 10/23/2012 3:48 PM, [email protected] wrote:
>>>> A message larger than a certain configured size is truncated
>>>> at the configured size and that is what SpamAssassin sees.
>>>> No other contents processing in this data path, just
>>>> blunt truncation of the raw mail message. Works quite well,
>>>> certainly much better than not scanning large messages at all.

>>> Makes sense to me. Something we should consider for SA to do bydefault

>>> with spamc/spamd?
>>
>> why? wasn't spamassassin designed to ignore attachements instead of
>> what MailScanner and Amavisd are doing using the API?
>>
>> Why should SA spend time scanning "binary" content it cannot decode or
>> extract anything useful to apply rules?
>>
>> I consider the "chunk" method the worse way to do it as it may skip
>> txt/html content which could show up after the configured chunk size
>> while spending lots of cycles  scanning a two liner with an attached
>> 400kb PDF/workd/etc attachement.
>> or did it get it all wrong?
> It's very synergistic because I wrote a note about this 2 days ago.
>
> With SpamC/SpamD, if the email (encoded) is larger than X size, the
> email is not scanned at all.


make it switchable?

scan_chunck  yes
scan_chunk_size 400kb

but as a global method change, no thanks.

> My thoughts were to ignore any binary attachments.  I even considered
> writing a glue that would rewrite a temporary copy of the email to
> remove binary attachments and see if THAT met the threshold.  But if
> real-world experience with simply chopping works well, who am I to
> complain?

I stopped using MailScanner exactly due to that reason: lots of pluginsand rules go nuts over encoded attachements.

Spamc/Spamd's "skip size" method has made a huge *positive* differenceon FPs, and scan times.

The FNs wouldn't *ever* have been caught by a chunk method due to thekind of content included "above" threshold.

Re: [Bug 6854] Optimizations, profiling

Reply via email to