Mark wrote:
-----Original Message-----
From: Federico Giannici [mailto:[EMAIL PROTECTED] Sent: woensdag 15 november 2006 10:31
To: users@spamassassin.apache.org
Subject: Bayes column 'token'


Last week we migrated our bayes DB from DBM to MySQL.
Now we have upgraded our MySQL server from version 4.0 to 4.1.

Today I found a couple of duplicate index values in the "token" column of "bayes_token" table.

This field is defined as char(5) with default collation
(that is "latin1_swedish_ci"). Is it the correct one?

Well, bayes_mysql.sql does not specify collation; so, like you said, the
collation will be your MySQL server-set default. And searches in MySQL
are case-insensitive by default. Might indeed perhaps be a good idea
to convert to "latin1_bin" or some such.

There will be any problem if I convert the current data to the new collation?


There is, btw, now that I look at it, a small bug in:

CREATE TABLE bayes_token (
  id int(11) NOT NULL default '0',
  token char(5) NOT NULL default '',
  spam_count int(11) NOT NULL default '0',
  ham_count int(11) NOT NULL default '0',
  atime int(11) NOT NULL default '0',
  PRIMARY KEY (id, token),
  INDEX bayes_token_idx1 (token),
  INDEX bayes_token_idx2 (id, atime)
) TYPE=MyISAM;

PRIMARY for `id` and `token` should not have INDEX for `id` and `token`
added, too.

I don't understand what you mean.
The couple (id, token) is PRIMARY, not INDEX...
Where exactly is the problem?


Thanks.

--
___________________________________________________
    __
   |-                      [EMAIL PROTECTED]
   |ederico Giannici      http://www.neomedia.it
___________________________________________________

Reply via email to