https://bugzilla.wikimedia.org/show_bug.cgi?id=53751

--- Comment #3 from Matt Walker <mwal...@wikimedia.org> ---
(In reply to comment #2)
> Does this deserve a more thorough investigation?

So, in doing a more thorough look through I noticed that in fact:
* All table rows in CentralNotice are binary types
* The default character set and collation for all the wikis that I checked is
binary
** As CN does not specify a table charset/collation we take the databases which
is binary -- which is why all our columns are binary.

The answer to the historical question of "Why are all our tables by default
binary charset/collation instead of utf8?" is apparently:

(05:00:55 PM) ori-l: mysql's 'utf8' wasn't real utf8 until a fairly recent
version (5.1 iirc)
(05:00:55 PM) ori-l: it was restricted to the basic multilingual plane
(05:01:24 PM) ori-l: which is a problem if you want to support certain
languages
(05:01:31 PM) ori-l: and we do

-- so --
I'm going to send an email to the wikitech list about how to resolve this
problem.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to