Le 09/06/2024 à 08:33, Sutou Kouhei a écrit :
Fields:
| Name | Type | Comments |
|----------------|-----------------------| -------- |
| column | utf8 | (2) |
| key | utf8 not null | (3) |
1. Should the key be something like `dictionary(int32, utf8)` to make
the representation more efficient where there are many columns?
Dictionary is more efficient. But we need to standardize not
only key but also ID -> key mapping.
I don't get why we would need to standardize ID -> key mapping. The key
names would be significant, the dictionary mapping is just for efficiency.
It may be complex to support full multi-column statistics
use cases. How about standardizing this without
multi-columns statistics support for the first version?
That's ok with me.
Regards
Antoine.