Re: Mailbox for auto learning

2009-08-12 Thread Luis Daniel Lucio Quiroz
Le mardi 11 août 2009 05:12:05, Cedric Knight a écrit :
 Luis Daniel Lucio Quiroz wrote:
  Le lundi 10 août 2009 19:15:15, Cedric Knight a écrit :
  Stefan wrote:

 [...]

  You have to forward the message as an attachment un unpack it after
  receiving. Have a look at:
  https://po2.uni-stuttgart.de/~rusjako/sal-wrapper
 
  Yes, I find this approach works well.  It's the simplest way for me to
  train Bayes, and most users can cope with it, providing they're not
  using Outlook 2003/XP which can't forward as an attachment.  But
  Thunderbird, Outlook Express, Squirrelmail and Pine all can easily.
  It's not as simple as a 'This Is Spam' button perhaps, and that's a
  *good* thing.  Requiring a little bit of thought stops people using it
  as an alternative to the delete key for 'OK, perhaps I did subscribe to
  this but I don't want it now'.

 [...]

  Yes but problem is that 99% of users are about using some kind of outlook

 Well then, tell them not to :)  Outlook Express and Windows Mail are
 fine.  Outlook 2003 supposedly needs a special program like
 http://www.olspamcop.org/ to forward properly, although if you select
 multiple messages to forward, then it will forward them in some kind of
 possibly useful digest format.  Outlook 2007 introduces an explicit menu
 item called forward as an attachment (Ctrl+Alt+F) but still mangles
 the headers:
 http://forum.spamcop.net/forums/index.php?showtopic=10241st=0p=70453#ent
ry70453

 Outlook 2007 also mangles the headers (kind of reconstructing a
 misleading semblance of what the original was) when moving between IMAP
 folders.  Therefore, I wouldn't use spamassassin -r on spam from Outlook
 users, but sa-learn to get tokens from the body text may be OK.

 Actually, some users of Outlook 2003 do seem to be able to forward as
 intact message/rfc822 attachment.  Not exactly sure how.

 Anyway, the 1% using a better e-mail program may be all that's needed to
 train Bayes.

 CK

Tha nkx

I did resolve it by using altermime+postfix solution. I look my X-quarantine 
heather to get the mail_id and then i add that file.

Rustique, mais il marche

LD


Re: Mailbox for auto learning

2009-08-11 Thread Cedric Knight
Luis Daniel Lucio Quiroz wrote:
 Le lundi 10 août 2009 19:15:15, Cedric Knight a écrit :
 Stefan wrote:
[...]
 You have to forward the message as an attachment un unpack it after
 receiving. Have a look at:
 https://po2.uni-stuttgart.de/~rusjako/sal-wrapper
 Yes, I find this approach works well.  It's the simplest way for me to
 train Bayes, and most users can cope with it, providing they're not
 using Outlook 2003/XP which can't forward as an attachment.  But
 Thunderbird, Outlook Express, Squirrelmail and Pine all can easily.
 It's not as simple as a 'This Is Spam' button perhaps, and that's a
 *good* thing.  Requiring a little bit of thought stops people using it
 as an alternative to the delete key for 'OK, perhaps I did subscribe to
 this but I don't want it now'.
[...]

 Yes but problem is that 99% of users are about using some kind of outlook

Well then, tell them not to :)  Outlook Express and Windows Mail are
fine.  Outlook 2003 supposedly needs a special program like
http://www.olspamcop.org/ to forward properly, although if you select
multiple messages to forward, then it will forward them in some kind of
possibly useful digest format.  Outlook 2007 introduces an explicit menu
item called forward as an attachment (Ctrl+Alt+F) but still mangles
the headers:
http://forum.spamcop.net/forums/index.php?showtopic=10241st=0p=70453#entry70453

Outlook 2007 also mangles the headers (kind of reconstructing a
misleading semblance of what the original was) when moving between IMAP
folders.  Therefore, I wouldn't use spamassassin -r on spam from Outlook
users, but sa-learn to get tokens from the body text may be OK.

Actually, some users of Outlook 2003 do seem to be able to forward as
intact message/rfc822 attachment.  Not exactly sure how.

Anyway, the 1% using a better e-mail program may be all that's needed to
train Bayes.

CK



Re: Mailbox for auto learning

2009-08-10 Thread Stefan
Am Sonntag, 9. August 2009 07:36:54 schrieb Luis Daniel Lucio Quiroz:
 Hi SAs,

 Well, after reading this link
 http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
 looking for an easy-way to let my mortal users to train our antispam.  I
 was thinking a mailbox such as  h...@antispamserver and s...@antispamserver
 to let users to forward their false positivos or their false netgatives. 
 In isde each box (ham or spam), of course a procmail with sa-learn input
 will be forwarded.

 My doubts are nexts:
 1. Will forwarded mails be usefull for training, I mean if spam was: From:
 spa...@example.netTo: u...@mydomain,   when forwarding it will be From:
 mu...@mydomain To: s...@antispamserver.   Change of this and forwarding
 (getting rid of headers because mail-clients) wont change learning?

You have to forward the message as an attachment un unpack it after receiving. 
Have a look at: 
https://po2.uni-stuttgart.de/~rusjako/sal-wrapper

 2. If technique in question 1 is usless, what other way would be nice to
 let user to report a false positive/negative for training.

 TIA
 LD

Greetings
Stefan


Re: Mailbox for auto learning

2009-08-10 Thread Dan Schaefer

Stefan wrote:

Am Sonntag, 9. August 2009 07:36:54 schrieb Luis Daniel Lucio Quiroz:
  

Hi SAs,

Well, after reading this link
http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
looking for an easy-way to let my mortal users to train our antispam.  I
was thinking a mailbox such as  h...@antispamserver and s...@antispamserver
to let users to forward their false positivos or their false netgatives. 
In isde each box (ham or spam), of course a procmail with sa-learn input

will be forwarded.

My doubts are nexts:
1. Will forwarded mails be usefull for training, I mean if spam was: From:
spa...@example.netTo: u...@mydomain,   when forwarding it will be From:
mu...@mydomain To: s...@antispamserver.   Change of this and forwarding
(getting rid of headers because mail-clients) wont change learning?

You have to forward the message as an attachment un unpack it after receiving. 
Have a look at: 
https://po2.uni-stuttgart.de/~rusjako/sal-wrappe

2. If technique in question 1 is usless, what other way would be nice to
let user to report a false positive/negative for training.


This may not be ideal, but in Thunderbird, you can drag messages between 
mailboxes. You could setup each user to have access to their own account 
and the two learning mailboxes. You can then have your users drag the 
false positives/negatives to the appropriate box. I have not testing 
this 100%, so I don't know if any headers get re-written or not.


--
Dan Schaefer
Web Developer/Systems Analyst
Performance Administration Corp.



Re: Mailbox for auto learning

2009-08-10 Thread Jari Fredriksson
 Stefan wrote:

 This may not be ideal, but in Thunderbird, you can drag
 messages between mailboxes. You could setup each user to
 have access to their own account and the two learning
 mailboxes. You can then have your users drag the false
 positives/negatives to the appropriate box. I have not
 testing this 100%, so I don't know if any headers get
 re-written or not.  

This is possible only when using IMAP. Not POP.

When using IMAP, it is also possible to use folders, no need for separate 
mailboxes. But there will be no difference in using mailboxes or folders, it 
just works.

No header modifications take place on a message when dragging it from folder 
into another, or from mailbox to another.

But as the OP thinks about separate mailboxes, I am afraid that is because he 
has no folders available. That must be because his users are tied to POP3.




Re: Mailbox for auto learning

2009-08-10 Thread Cedric Knight
Stefan wrote:
 Am Sonntag, 9. August 2009 07:36:54 schrieb Luis Daniel Lucio Quiroz:
 Hi SAs,

 Well, after reading this link
 http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
 looking for an easy-way to let my mortal users to train our antispam.  I
 was thinking a mailbox such as  h...@antispamserver and s...@antispamserver
 to let users to forward their false positivos or their false netgatives. 
 In isde each box (ham or spam), of course a procmail with sa-learn input
 will be forwarded.

 My doubts are nexts:
 1. Will forwarded mails be usefull for training, I mean if spam was: From:
 spa...@example.netTo: u...@mydomain,   when forwarding it will be From:
 mu...@mydomain To: s...@antispamserver.   Change of this and forwarding
 (getting rid of headers because mail-clients) wont change learning?
 
 You have to forward the message as an attachment un unpack it after 
 receiving. 
 Have a look at: 
 https://po2.uni-stuttgart.de/~rusjako/sal-wrapper

Yes, I find this approach works well.  It's the simplest way for me to
train Bayes, and most users can cope with it, providing they're not
using Outlook 2003/XP which can't forward as an attachment.  But
Thunderbird, Outlook Express, Squirrelmail and Pine all can easily.
It's not as simple as a 'This Is Spam' button perhaps, and that's a
*good* thing.  Requiring a little bit of thought stops people using it
as an alternative to the delete key for 'OK, perhaps I did subscribe to
this but I don't want it now'.

My script is very similar to sal-wrapper, using Postfix
check_recipient_access to ensure only authenticated users can send to
the reporting address; triggered from procmail; using MIME::Parser to
extract (possibly multiple) message/rf822 attachments; feed through
sa-learn --ham or spamassassin -r as appropriate and send an
acknowledgement back to the user, to remind them to also send
spam/non-spam to the corresponding address and correct any mistakes.

One thing I notice from sal-wrapper however is that it pipes the header
and body to sa-learn without passing a file as parameter.  I found that
although sa-learn didn't complain, this didn't work at all well, and
quite short ham messages were scoring BAYES_99.  You can pipe to
spamassassin -r just like you can to spamassassin in any other mode, but
I think if you pipe to sa-learn, you need to do it as
   sa-learn --ham -

with the '-' as parameter, so it reads the standard input.
Alternatively feed it a temporary message file.  Or am I misreading
something?

CK



Re: Mailbox for auto learning

2009-08-10 Thread Luis Daniel Lucio Quiroz
Le lundi 10 août 2009 19:15:15, Cedric Knight a écrit :
 Stefan wrote:
  Am Sonntag, 9. August 2009 07:36:54 schrieb Luis Daniel Lucio Quiroz:
  Hi SAs,
 
  Well, after reading this link
  http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
  looking for an easy-way to let my mortal users to train our antispam.  I
  was thinking a mailbox such as  h...@antispamserver and
  s...@antispamserver to let users to forward their false positivos or
  their false netgatives. In isde each box (ham or spam), of course a
  procmail with sa-learn input will be forwarded.
 
  My doubts are nexts:
  1. Will forwarded mails be usefull for training, I mean if spam was:
  From: spa...@example.netTo: u...@mydomain,   when forwarding it will
  be From: mu...@mydomain To: s...@antispamserver.   Change of this and
  forwarding (getting rid of headers because mail-clients) wont change
  learning?
 
  You have to forward the message as an attachment un unpack it after
  receiving. Have a look at:
  https://po2.uni-stuttgart.de/~rusjako/sal-wrapper

 Yes, I find this approach works well.  It's the simplest way for me to
 train Bayes, and most users can cope with it, providing they're not
 using Outlook 2003/XP which can't forward as an attachment.  But
 Thunderbird, Outlook Express, Squirrelmail and Pine all can easily.
 It's not as simple as a 'This Is Spam' button perhaps, and that's a
 *good* thing.  Requiring a little bit of thought stops people using it
 as an alternative to the delete key for 'OK, perhaps I did subscribe to
 this but I don't want it now'.

 My script is very similar to sal-wrapper, using Postfix
 check_recipient_access to ensure only authenticated users can send to
 the reporting address; triggered from procmail; using MIME::Parser to
 extract (possibly multiple) message/rf822 attachments; feed through
 sa-learn --ham or spamassassin -r as appropriate and send an
 acknowledgement back to the user, to remind them to also send
 spam/non-spam to the corresponding address and correct any mistakes.

 One thing I notice from sal-wrapper however is that it pipes the header
 and body to sa-learn without passing a file as parameter.  I found that
 although sa-learn didn't complain, this didn't work at all well, and
 quite short ham messages were scoring BAYES_99.  You can pipe to
 spamassassin -r just like you can to spamassassin in any other mode, but
 I think if you pipe to sa-learn, you need to do it as
sa-learn --ham -

 with the '-' as parameter, so it reads the standard input.
 Alternatively feed it a temporary message file.  Or am I misreading
 something?

 CK
Yes but problem is that 99% of users are about using some kind of outlook


Re: Mailbox for auto learning

2009-08-09 Thread Matt Kettler
Luis Daniel Lucio Quiroz wrote:
 Hi SAs,

 Well, after reading this link 
 http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still looking 
 for an easy-way to let my mortal users to train our antispam.  I was thinking 
 a mailbox such as  h...@antispamserver and s...@antispamserver to let users 
 to 
 forward their false positivos or their false netgatives.  In isde each box 
 (ham or spam), of course a procmail with sa-learn input will be forwarded.  

 My doubts are nexts:
 1. Will forwarded mails be usefull for training, I mean if spam was: From: 
 spa...@example.netTo: u...@mydomain,   when forwarding it will be From: 
 mu...@mydomain To: s...@antispamserver.   Change of this and forwarding 
 (getting rid of headers because mail-clients) wont change learning?
   
Forwarded mails are NOT useful.

You also neglected to mention the change of Received headers, and pretty
much every header in the message, the re-encoding of the body by your
mail client, etc.

Since SA's bayes tokenizes headers, that's disastrous.
 2. If technique in question 1 is usless, what other way would be nice to let 
 user to report a false positive/negative for training.
In some cases you can have the client forward as attachment, and use a
mailbox that strips attachments and feeds them to sa-learn. As long as
the client being used forwards the entire original message, with
complete headers, this should work fine.

   

 TIA

 LD


   



Re: Mailbox for auto learning

2009-08-09 Thread Luis Daniel Lucio Quiroz
Le dimanche 9 août 2009 06:52:49, vous avez écrit :
 Luis Daniel Lucio Quiroz wrote:
  Hi SAs,
 
  Well, after reading this link
  http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
  looking for an easy-way to let my mortal users to train our antispam.  I
  was thinking a mailbox such as  h...@antispamserver and
  s...@antispamserver to let users to forward their false positivos or
  their false netgatives.  In isde each box (ham or spam), of course a
  procmail with sa-learn input will be forwarded.
 
  My doubts are nexts:
  1. Will forwarded mails be usefull for training, I mean if spam was:
  From: spa...@example.netTo: u...@mydomain,   when forwarding it will
  be From: mu...@mydomain To: s...@antispamserver.   Change of this and
  forwarding (getting rid of headers because mail-clients) wont change
  learning?

 Forwarded mails are NOT useful.

 You also neglected to mention the change of Received headers, and pretty
 much every header in the message, the re-encoding of the body by your
 mail client, etc.

 Since SA's bayes tokenizes headers, that's disastrous.

  2. If technique in question 1 is usless, what other way would be nice to
  let user to report a false positive/negative for training.

 In some cases you can have the client forward as attachment, and use a
 mailbox that strips attachments and feeds them to sa-learn. As long as
 the client being used forwards the entire original message, with
 complete headers, this should work fine.

  TIA
 
  LD

I understand

and if I use altemime to add a link, to identify email in a quarantine?  will 
tex in altermime change learning?


Re: Mailbox for auto learning

2009-08-09 Thread RW
On Sun, 9 Aug 2009 00:36:54 -0500
Luis Daniel Lucio Quiroz luis.daniel.lu...@gmail.com wrote:

 Hi SAs,
 
 Well, after reading this link 
 http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still
 looking for an easy-way to let my mortal users to train our
 antispam.

If your users use webmail, imap etc , the most convenient approach is to
have folders for learning spam and ham.


Re: Mailbox for auto learning

2009-08-09 Thread Benny Pedersen
On Sun, 9 Aug 2009 00:36:54 -0500, Luis Daniel Lucio Quiroz

 1. Will forwarded mails be usefull for training, I mean if spam was:
From: 
 spa...@example.netTo: u...@mydomain,   when forwarding it will be
 From: 
 mu...@mydomain To: s...@antispamserver.   Change of this and forwarding 
 (getting rid of headers because mail-clients) wont change learning?
 
 2. If technique in question 1 is usless, what other way would be nice to
 let 
 user to report a false positive/negative for training. 

dovecot-antispam solves it with dovecot

all users need to do is move mail in imap to junk folder, in that task
dovecot-antispam call sa-learn

this means no junk plugins to windows clients

and last but not least no header changes

mail that is moved out of the junk folder is learned as ham, intuitive
like an amiga :)


-- 
Benny Pedersen


Re: Mailbox for auto learning

2009-08-09 Thread Luis Daniel Lucio Quiroz
Le dimanche 9 août 2009 10:56:59, Benny Pedersen a écrit :
 On Sun, 9 Aug 2009 00:36:54 -0500, Luis Daniel Lucio Quiroz

  1. Will forwarded mails be usefull for training, I mean if spam was:

 From:
  spa...@example.netTo: u...@mydomain,   when forwarding it will be
  From:
  mu...@mydomain To: s...@antispamserver.   Change of this and forwarding
  (getting rid of headers because mail-clients) wont change learning?
 
  2. If technique in question 1 is usless, what other way would be nice to
  let
  user to report a false positive/negative for training.

 dovecot-antispam solves it with dovecot

 all users need to do is move mail in imap to junk folder, in that task
 dovecot-antispam call sa-learn

 this means no junk plugins to windows clients

 and last but not least no header changes

 mail that is moved out of the junk folder is learned as ham, intuitive
 like an amiga :)

Yes but worst scenario is best for me.  POP users with  MS outlook.

Then I was wondering to add with altermime somethin like this at footer:

if you think this mail is spam please click here (also for ham), and here 
is a link with message-id (i have a CC of all mails).

So, other doutbt, altering mail by adding a footer will alter SA learning?


Mailbox for auto learning

2009-08-08 Thread Luis Daniel Lucio Quiroz
Hi SAs,

Well, after reading this link 
http://spamassassin.apache.org/full/3.2.x/doc/sa-learn.html I'm still looking 
for an easy-way to let my mortal users to train our antispam.  I was thinking 
a mailbox such as  h...@antispamserver and s...@antispamserver to let users to 
forward their false positivos or their false netgatives.  In isde each box 
(ham or spam), of course a procmail with sa-learn input will be forwarded.  

My doubts are nexts:
1. Will forwarded mails be usefull for training, I mean if spam was: From: 
spa...@example.netTo: u...@mydomain,   when forwarding it will be From: 
mu...@mydomain To: s...@antispamserver.   Change of this and forwarding 
(getting rid of headers because mail-clients) wont change learning?

2. If technique in question 1 is usless, what other way would be nice to let 
user to report a false positive/negative for training.  

TIA

LD