On Tue, Apr 05, 2011 at 04:17:56PM +0200, Elias Oltmanns wrote: > Kenneth Marshall <k...@rice.edu> wrote: > > On Mon, Apr 04, 2011 at 11:46:51PM +0200, Stevan Baji?? wrote: > >> On Sun, 03 Apr 2011 17:31:44 +0200 > > > >> Elias Oltmanns <e...@nebensachen.de> wrote: > >> > >> > Hi there, > >> > > >> Hallo Elias, > >> > >> > >> > switching from CHAIN to OSB tokenizer, I understand, makes the old > >> > tokens mostly useless or even harmful since OSB might achieve better > >> > accuracy when starting from scratch. I wouldn't mind losing most of the > >> > tokens if it wasn't for the automatic whitelisting information. So, My > >> > question is whether there might be any practical way to keep the > >> > whitelist information during a transition from CHAIN to OSB (or any > >> > other combination of tokenizers for that matter). > >> > > >> > Thanks in advance for any advice you can give me, > >> > > [...] > > As far as keeping the old whitelisting tokens, if > > you have archives of good mail, it should be possible to calculate > > the whitelist token hash manually and make a list of the tokens > > to save in the DB. > > Yes, I've started thinking along those lines too. However, I don't seem > to be able to *guess* how these tokens are assembled. In the > documentation it explicitly states that the whole From: header is used > for the whitelist feature. Yet > > $ dspam_dump userid "From*Elias+Oltmanns+<e...@nebensachen.de>" > > produces no hits. Does anyone of you know off the top of your head what > the correct query should look like? I can look in the sources myself > once I've got some more spare time on my hands. Then again, I'm not too > sure anymore whether it is really worth bothering with those whitelist > tokens. > > Regards, > > Elias >
Hi Elias, Stevan already sent you the correct query to look at the whitelist tokens. The tokens are valuable for performance on correspondance from "known" senders. Personally, I would not bother with migrating them and just have them be reset as they get processed in the new DB. Cheers, Ken ------------------------------------------------------------------------------ Xperia(TM) PLAY It's a major breakthrough. An authentic gaming smartphone on the nation's most reliable network. And it wants your games. http://p.sf.net/sfu/verizon-sfdev _______________________________________________ Dspam-user mailing list Dspam-user@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-user