https://bugzilla.wikimedia.org/show_bug.cgi?id=53751
--- Comment #3 from Matt Walker <mwal...@wikimedia.org> --- (In reply to comment #2) > Does this deserve a more thorough investigation? So, in doing a more thorough look through I noticed that in fact: * All table rows in CentralNotice are binary types * The default character set and collation for all the wikis that I checked is binary ** As CN does not specify a table charset/collation we take the databases which is binary -- which is why all our columns are binary. The answer to the historical question of "Why are all our tables by default binary charset/collation instead of utf8?" is apparently: (05:00:55 PM) ori-l: mysql's 'utf8' wasn't real utf8 until a fairly recent version (5.1 iirc) (05:00:55 PM) ori-l: it was restricted to the basic multilingual plane (05:01:24 PM) ori-l: which is a problem if you want to support certain languages (05:01:31 PM) ori-l: and we do -- so -- I'm going to send an email to the wikitech list about how to resolve this problem. -- You are receiving this mail because: You are on the CC list for the bug. _______________________________________________ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l