On 13-04-11 10:44 AM, Stephen J. Turnbull wrote:
1. Mailman is the wrong place to do filtering. It's equally
effective, normally covers more messages, and is somewhat more
efficient in resource usage to do it at the MTA.
2. Any new algorithms*should* be made available at the MTA level
where they can be best put to use by more people. This implies
something that either plugs into existing filters (such as
spamassassin) or MTAs (ie, milters) rather than a Handler.
3. Adapting existing filters is generally pretty trivial: you write a
10-line custom Handler that pipes it to an external process. This
isn't big enough for a GSoC project.
4. To the extent that new algorithms are involved, I have doubts that
Mailman mentors have the kind of expertise needed to really help
with such a project (I could be wrong, but I certainly don't know
much about that kind of text processing, and I don't know that
anybody else in Mailman has expertise in it).
Writing individual pipelines may be trivial, but making a user interface
for managing said pipelines is non-trivial. Right now, our pipeline
management interface is "there's a text box in postorius that lets you
choose a pipeline. It's not even a dropdown, and you may be screwed if
you make a typo" which is obviously not how I want it when we release. ;)
I see a potential project timeline going something like this:
A. make a set of custom Mailman 3 Handlers for some well-known existing
anti-spam/anti-malware software. (Maybe 2-3 weeks of work here, finding
2-4 reasonable pieces of software, setting them up, writing the
handlers, and testing them)
B. make an interface in Postorius so list admins can
enable/disable/reorder these and any whitelisting happening within
mailman. This should involve making an interface in Postorius that
gives admins the ability to change the Pipeline being used, and will
likely involve a small amount of user testing to make sure said
interface doesn't have risk of disastrous results if the administrator
does the wrong thing. (Another 3-4 weeks of work including user
testing, unit tests, and documentation)
C. Figure out how to set up some sort of packager that can install
handlers + antispam software so that the site admin has an easy way to
set these up if requested. (Another 3-4 weeks of work, including testing
any scripts on a few different OSes and extensive documentation)
D. If there's any time leftover, implement some clever new filter (and
appropriate Handler) that makes use of the list information itself (e.g.
subscriber list, archives, etc.) to make better spam decisions. (at this
point, you've got maybe 2 weeks left in the GSoC timeline)
I think that constitutes enough useful-to-mailman work to justify the
google funds, gets us some customizable spam filtering (which as you
say, is a frequently requested feature), but doesn't turn us into
something we're not. That's why anti-spam made this year's gsoc list
even though we've always said "do it in the MTA" and I'm not about to
change that policy in general.
Do feel free to disagree with me, of course, Stephen. Or complain that
I'm using the lure of antispam to get someone solve my user interface
for pipelines problem, which I totally am. ;)
Terri
_______________________________________________
Mailman-Developers mailing list
Mailman-Developers@python.org
http://mail.python.org/mailman/listinfo/mailman-developers
Mailman FAQ: http://wiki.list.org/x/AgA3
Searchable Archives:
http://www.mail-archive.com/mailman-developers%40python.org/
Unsubscribe:
http://mail.python.org/mailman/options/mailman-developers/archive%40jab.org
Security Policy: http://wiki.list.org/x/QIA9