https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6037

           Summary: Bayes-SQL improvements
           Product: Spamassassin
           Version: 3.2.4
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P5
         Component: Learner
        AssignedTo: [email protected]
        ReportedBy: [email protected]


Created an attachment (id=4410)
 --> (https://issues.apache.org/SpamAssassin/attachment.cgi?id=4410)
Patch for splitting the bayes_token table

All bayes tokens for all user are currently stored inside one huge table (if
Bayes is stored inside an SQL database). For several thousand users this
becomes a bottleneck, especially for bayes_expire. The patch below adds the
possibility to split the token table into several tables. Which user is
contained in which table is looked up from bayes_vars which has an additional
column "token_table". New user are automatically assigned to one table by using
their name's CRC32 checksum (could have been any other but this one was easiest
as it gives an int which can be used to derive a simple number for the token
table). This patch leads
to considerably lower loads on our machine and bayes_expire now only takes 
about 5 hours instead of 20 before when using 10 instead of 1 table.


-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to