Hi, Bug 6108 says is that the pyzor plugin ignores whitelisting entirely, and a comment on that bug suggests updating the Pyzor plugin to use Wilson Score formula described here http://www.evanmiller.org/how-not-to-sort-by-average-rating.html
I have tested this using a spreadsheet and our own corpus of spam and ham email and got good results, so I went ahead and implemented it in the Pyzor plugin. I'm new to both spamassassin and perl so sorry if it's a bit rough around the edges. I wanted to make it possible to have multiple spamassassin rules with different wilson score parameters giving different spamassassin scores, but obviously we don't want to query pyzor multiple times on the same mail so I've changed the actual pyzor lookup to be done during extract_metadata() instead of when the eval-rule is run. It works, but it looks to me like the query to pyzor will block while waiting for a response and I don't know the spamassassin well enough to know if this will cause problems. Please could someone who knows the internals better take a look and give me some feedback? Thanks very much, PS: this is based off the pyzor plugin from version 3.4.0 because that's what we're using at the moment (I know, it's old, sorry...). I've not looked at more recent versions but if there is interest in including this work upstream I'd be happy to rebase my changes off of SVN. -- James Birkett Scanned by CyberHound (http://cyberhound.com/) Confidentiality Notice: This email, including any attachments, is confidential and may be privileged. If you are not the intended recipient please notify the sender immediately and delete it. You should not copy it or use it for any purpose or disclose its contents to any other person without CyberHound's prior written permission. CyberHound Pty Ltd reserves the right to monitor all email communications passing through its networks and devices.
nbPyzor.pm
Description: Binary data
sample_rules.cf
Description: Binary data
