Spam and mailing list filtering?

2011-02-10 Thread Daniel Barlow

I've been running notmuch for only a couple of days now and am still
getting to grips with it, but I am very very happy to be back in Emacs
again for mail after two years in the Thunderbird wilderness.  So, thank
you.  

I like my mail tagged on reception so that my mailing lists and spam
only show up when (or if) I actually want to read them.  To this end
I've done as I suspect most people do: wrote a small shell script that
runs 'notmuch add' and then a succession of 'notmuch tag' to identify
the mails I'll deal with later.  For example

notmuch new
notmuch tag +inbox --  to:dan and is:unread
notmuch tag +office-work  --  folder:Work/Shared.office and is:unread
notmuch tag +tech-work --  folder:Work/Shared.technotes and is:unread
notmuch tag +work -- folder:Work and is:unread
notmuch tag +work -- from:office at work.com and is:unread
notmuch tag +ruby-talk -inbox -- to:ruby-talk and is:unread
notmuch tag +lrug -inbox -- to:ruby-talk and is:unread
notmuch tag +work-figures -inbox -- '(from:figures at work.com or to:figures at 
work.com) and is:unread


My questions 

1) (How) can I filter on the X-Spam-Bar header to chop out spam and
suspected spam?

2) Is there any way (actual or planned) to filter on X-List-Id or
similar so that I can filter mailing list stuff more accurately

3) Ditto for filtering on Envelope-To, which I know is not a real
SMTP header but appears in my mail archives anyway


-dan


Spam and mailing list filtering?

2011-02-10 Thread Austin Clements
On Feb 10, 2011 8:33 AM, "Ben Gamari"  wrote:
>
> On Thu, 10 Feb 2011 12:20:55 +, Daniel Barlow  wrote:
> > My questions
> >
> > 1) (How) can I filter on the X-Spam-Bar header to chop out spam and
> > suspected spam?
> >
> I simply run new mail through bogofilter in my sorting script and tag
> junk with a junk tag. I then add "not tag:junk" to each of my saved
searches.

One of my plans for after the custom query parser patches get reviewed is to
make it possible to exclude certain tags from queries by default, unless the
query specifically mentions the tag. This should make it easier to handle
spam tags like this.
-- next part --
An HTML attachment was scrubbed...
URL: 
<http://notmuchmail.org/pipermail/notmuch/attachments/20110210/ff3276f8/attachment.html>


Spam and mailing list filtering?

2011-02-10 Thread Rob Browning
Ben Gamari  writes:

> I simply run new mail through bogofilter in my sorting script and tag
> junk with a junk tag. I then add "not tag:junk" to each of my saved searches.
>  
>> 2) Is there any way (actual or planned) to filter on X-List-Id or
>> similar so that I can filter mailing list stuff more accurately

Also note that if you're already filtering via procmail or similar, I
believe you should be able to use notmuch-deliver to add tags based on
the headers it finds.

Alternately, if you already deliver all spam to a particular folder, the
new "folder:" support might help.

-- 
Rob Browning
rlb @defaultvalue.org and @debian.org
GPG as of 2002-11-03 14DD 432F AE39 534D B592 F9A0 25C8 D377 8C7E 73A4


Spam and mailing list filtering?

2011-02-10 Thread Ben Gamari
On Thu, 10 Feb 2011 12:20:55 +, Daniel Barlow  wrote:
> My questions 
> 
> 1) (How) can I filter on the X-Spam-Bar header to chop out spam and
> suspected spam?
>
I simply run new mail through bogofilter in my sorting script and tag
junk with a junk tag. I then add "not tag:junk" to each of my saved searches.

> 2) Is there any way (actual or planned) to filter on X-List-Id or
> similar so that I can filter mailing list stuff more accurately
> 
There have been talks of adding support for indexing arbitrary headers
although no one has produced any code as far as I know. Patches welcome.

Cheers,

- Ben


Re: Spam and mailing list filtering?

2011-02-10 Thread Austin Clements
On Feb 10, 2011 8:33 AM, Ben Gamari bgamari.f...@gmail.com wrote:

 On Thu, 10 Feb 2011 12:20:55 +, Daniel Barlow d...@telent.net wrote:
  My questions
 
  1) (How) can I filter on the X-Spam-Bar header to chop out spam and
  suspected spam?
 
 I simply run new mail through bogofilter in my sorting script and tag
 junk with a junk tag. I then add not tag:junk to each of my saved
searches.

One of my plans for after the custom query parser patches get reviewed is to
make it possible to exclude certain tags from queries by default, unless the
query specifically mentions the tag. This should make it easier to handle
spam tags like this.
___
notmuch mailing list
notmuch@notmuchmail.org
http://notmuchmail.org/mailman/listinfo/notmuch


Re: Spam and mailing list filtering?

2011-02-10 Thread Rob Browning
Ben Gamari bgamari.f...@gmail.com writes:

 I simply run new mail through bogofilter in my sorting script and tag
 junk with a junk tag. I then add not tag:junk to each of my saved searches.
  
 2) Is there any way (actual or planned) to filter on X-List-Id or
 similar so that I can filter mailing list stuff more accurately

Also note that if you're already filtering via procmail or similar, I
believe you should be able to use notmuch-deliver to add tags based on
the headers it finds.

Alternately, if you already deliver all spam to a particular folder, the
new folder: support might help.

-- 
Rob Browning
rlb @defaultvalue.org and @debian.org
GPG as of 2002-11-03 14DD 432F AE39 534D B592 F9A0 25C8 D377 8C7E 73A4
___
notmuch mailing list
notmuch@notmuchmail.org
http://notmuchmail.org/mailman/listinfo/notmuch


Re: Remote usage script updated

2011-02-10 Thread Michal Sojka
On Wed, 09 Feb 2011, Jesse Rosenthal wrote:
 By the way, I've also realized that most attachment downloading is done
 not by --format=raw but by notmuch part. It's possible to do caching
 there as well. There are a few options there:
 
   * One option would be to just cache by the attachment number -- but
 this is very fragile if you delete an attachment through mutt or
 some other client that allows it.

This is obviously the simplest possibility. I do not know how often do
you (and others) delete attachments, but I do it rarely and usually I
delete all the attachments of the messages older than something. So this
would work for me.

Additionally, I have added the following command to the attachment
fetching code: 

find ${CACHE} -ctime +$CACHE_MAX_DAYS -and -mtime +$CACHE_MAX_DAYS -print0 
| xargs -r -0 rm -f

It deletes attachments older than one week (in my case) so it there is
some inconsistency, it would not last more than one week.

   * cache by the hash of the attachment. The idea is that asking the
 server to fetch it, hash it, and send the hash would still save
 time over sending the whole attachment. Probably -- though most
 attachments are small enough and most connections are fast enough
 that this might not actually matter.

If people do more weird things with their attachments, this is probably
the best solution.

   * Actually stick the attachment hash in the json output in the first
 place. But this would be a lot of trouble for a small gain for
 very few.

Agreed.

-Michal
___
notmuch mailing list
notmuch@notmuchmail.org
http://notmuchmail.org/mailman/listinfo/notmuch