> From: felic...@kluge.net On Behalf Of Theo Van Dinter > > On Fri, Jun 19, 2009 at 3:04 AM, Jason Haar<jason.h...@trimble.co.nz> > wrote: > > Speaking of image/rtf/word attachment spam; is there any work going > on > > to standardize this so that the textual output of such attachments > could > > be fed back into SA? > > That functionality already exists (has for almost 3 years, actually), > but as in the past (list archives) the documentation hasn't improved > for it. :( > > Here's my last(?) post about it which has some sample code and > everything: > > http://www.nabble.com/Re:-PDFText-Plugin-for-PDF-file-scoring---not- > for-PDF-images-p11595641.html
Thanks for the sample code. Once you get the $p object from $msg->find_parts(), how do you extract the contents of the message part to run it through antiword or whatever? L