https://bugzilla.wikimedia.org/show_bug.cgi?id=47368

--- Comment #8 from Ori Livneh <[email protected]> ---
The reason we don't use the 'utf8' character encoding setting in MySQL is, it
turns out, that it can only encode BMP characters, which have a maximum width
of three bytes. Supplementary characters cannot be encoded at all.

So I *do* think we need to change the character encoding, but to what?

Our databases run MariaDB 5.5, which supports a newer character encoding called
'utf8mb4', which offers true utf8 support. I'd strongly prefer that to
'binary', in the interest of being explicit and mitigating the chance that a
client will default to some other value based on the system locale or what have
you. But this then would leave Yuvi's issue unresolved: the EventLogging
database would still be slightly different than the rest of the cluster.

Thoughts?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to