> > On Fri, 2010-05-28 at 12:54 -0700, John Hardin wrote: > > It's becoming much more prevalent. I'm seeing lots of 419 > and phishing > > spams as .txt and .html attachments, more rarely .doc or .rtf. > > > Writing an extractor for .txt attachments is trivial, and > it shouldn't > > be too hard to do HTML. RTF and PDF are included, but do require > > external tools be present. > > Err, unless I am terribly mistaken and didn't interpret the > docs correctly (too lazy to test) -- SA always uses any textual part. > > That includes plain text and HTML "attached" to the message. > It's just another MIME part anyway. > > It's a whole different cattle of fish with RTF... > > I found a 3rd party plug-in called "ExtractText" at: http://wiki.apache.org/spamassassin/CustomPlugins
However, that doesn't seem to advertise that it feeds the attachment to bayes. Kaleb
smime.p7s
Description: S/MIME cryptographic signature