Hi everyone, bug 18336, implementing utf8mb4 encoding [1] is now failed QA
(should probably be IN DISCUSSION).

The reason for it to be failed is that the encoding change from utf8 into
utf8mb4 alters all *text fields too (and I didn't notice it) and so there's
a discrepancy between the updated schema and a fresh one. Fixing it seems
easy: I just need to change all fields like this in kohastructure.sql:


As I understand it, *TEXT columns only store as part of the row the number
of bytes stored (in the external object storage) for the object [2]. Given
this, the changes described above means this size change:

2 bytes => 3 bytes
3 bytes => 4 bytes

The move into utf8mb4 is needed, and is important for supporting
supplemental characters (japanese, chinese, emojis, etc). Do you have
another understanding on how this could impact (say) space? Do you think we
should move forward with it?

I think we should move forward and will provide a followup patch right now,
but more opinions are needed here.

Thanks in advance

[1] https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=18336
[2] https://dev.mysql.com/doc/refman/5.7/en/string-type-overview.html
Tomás Cohen Arazi
Theke Solutions (https://theke.io <http://theke.io/>)
✆ +54 9351 3513384
Koha-devel mailing list
website : http://www.koha-community.org/
git : http://git.koha-community.org/
bugs : http://bugs.koha-community.org/

Reply via email to