Re: Rentention-rules and pilerpurge

2018-11-05 Thread sj




Hello Marina,

On 2018-11-05 13:35, Frau Marina Diezler wrote:


I turned on periodic purge in the backend so that calling pilerpurge
is allowed to delete anything at all. We created some retention rules,
for example all mails sent from our gitlab-instance on another host
should be deleted after 30 days. We added some archiving rules saying
that everything mared as spam or containing \*SPAM\* in the subject
should  be ignored and that the existing spam should be deleted via
retention rule following the same identification rules and giving a
retention time of 1 day. But I can still see "fresh" Spam coming into
the archive and the old spam-messages still being held where they are.


note that the rules (both archiving and retention) affects only new
incoming emails. Already stored emails are not affected.




When I started pilerpurge it deleted only about 100 messages which is
way to few. The retention rules  should meet thousands of messages and
in addition all emails older than 3561 days should be deleted too but
the amount of pilerpurge'd mails so far wasn't so big.

If I try to pilertest an arbitrary message then I never get readable
infos  but always something like this:

root@piler:/var/piler/store/00/5a1/3d/38# pilertest
40005a105109082cef0400aae4733d38.m
locale: de_DE.UTF-8
build: 955
parsing...
post parsing...
message-id: null /
74234e98afe7498fb5daf1f36ac2d78acc339464f950703b8c019892f982b90b
from: * ()*
to: * ()*
reference: **
subject: **
body: **
sent: 1541417598, delivered-date: 0
hdr len: 0
body digest:
d10a0fcad8f1fe6dee72f70a4d9888604ddfaf3e98b0309ebe23ecabdcd6c542
rules check: (null)
folder: 0
retention period: 1856863998
attachments:
direction: 0
spam: 0
NOT IN mydomains

Is this supposed to be like this?


no. Anything in /var/piler/store is an encrypted and compressed file,
pilertest can't handle it. Instead use pilerget to rertieve it, eg.
pilerget 40005a105109082cef0400aae4733d38 > /tmp/1.eml

and then analyze it with pilertest:

pilertest /tmp/1.eml





Do you have any idea what I should
do  so that more messages are being purged? Especially the retention
period seems suspect I think...


For already archived spam, you should identify them in the metadata 
table,
eg. check the subject field. Then set the retained column for such 
messages
only to the past, eg. a yesterday timestamp, and pilerpurge will take 
care

of them.

Note that such a late job will be slow and pretty ineffective. As you 
stated
above: it's best to specify both archiving and retention rules before 
the 1st

message arrives.



And another question is: Do I understand this correctly: purged mails
still show up forever in the browser backend and only disappear if I
rebuild the index via command line? Is there a way to find out in


No. Once pilerpurge has done, it sets the 'deleted' column to 1 which
would participate for the sphinx k-list (kill list) to suppress the
already purged emails from the gui.

Of course, rebuilding the index is another option since the reindex tool
honors the deleted column.


Janos



Rentention-rules and pilerpurge

2018-11-05 Thread Frau Marina Diezler

Hello,

about one year ago I set up a piler instance for our emails - I found  
out a lot by myself, but now I do need some help please.


As I don't dare at the moment to update the system we still use piler
1.3.0. After installation on a debian jessie server I tried to follow  
all the steps and archiving everything worked fine. I imported old  
emails back down to the year 2001 from pst-files via pilerimport, all  
was good so far and archiving worked/works.


As you can imagine our archive is filling up with mails, spam messages  
are archived as well as notifications from gitlab for example, there  
is a lot of overhead from other resources too.


I know that we should have had these rule before but there was no time  
to create them (the lack of manpower stood against the urge to create  
an archive at all). Now we slowly come to the point where we want to  
create archiving and retention rules but things aren't working as I  
expected.


I turned on periodic purge in the backend so that calling pilerpurge  
is allowed to delete anything at all. We created some retention rules,  
for example all mails sent from our gitlab-instance on another host  
should be deleted after 30 days. We added some archiving rules saying  
that everything mared as spam or containing \*SPAM\* in the subject  
should be ignored and that the existing spam should be deleted via  
retention rule following the same identification rules and giving a  
retention time of 1 day. But I can still see "fresh" Spam coming into  
the archive and the old spam-messages still being held where they are.


When I started pilerpurge it deleted only about 100 messages which is  
way to few. The retention rules should meet thousands of messages and  
in addition all emails older than 3561 days should be deleted too but  
the amount of pilerpurge'd mails so far wasn't so big.


If I try to pilertest an arbitrary message then I never get readable  
infos but always something like this:


root@piler :/var/piler/store/00/5a1/3d/38# pilertest  
40005a105109082cef0400aae4733d38.m

locale: de_DE.UTF-8
build: 955
parsing...
post parsing...
message-id: null /  
74234e98afe7498fb5daf1f36ac2d78acc339464f950703b8c019892f982b90b

from: * ()*
to: * ()*
reference: **
subject: **
body: **
sent: 1541417598, delivered-date: 0
hdr len: 0
body digest: d10a0fcad8f1fe6dee72f70a4d9888604ddfaf3e98b0309ebe23ecabdcd6c542
rules check: (null)
folder: 0
retention period: 1856863998
attachments:
direction: 0
spam: 0
NOT IN mydomains

Is this supposed to be like this? Do you have any idea what I should  
do so that more messages are being purged? Especially the retention  
period seems suspect I think...


And another question is: Do I understand this correctly: purged mails  
still show up forever in the browser backend and only disappear if I  
rebuild the index via command line? Is there a way to find out in  
which folder in the pilerstore at /var/piler/store/00/ the message is  
stored so I can check there against what the browser backend says?


Thanks for your help, any hints are appreciated.
Regards,
Marina