Le 09/06/2024 à 08:33, Sutou Kouhei a écrit :

Fields:
| Name           | Type                  | Comments |
|----------------|-----------------------| -------- |
| column         | utf8                  | (2)      |
| key            | utf8 not null         | (3)      |

1. Should the key be something like `dictionary(int32, utf8)` to make
the representation more efficient where there are many columns?

Dictionary is more efficient. But we need to standardize not
only key but also ID -> key mapping.

I don't get why we would need to standardize ID -> key mapping. The key names would be significant, the dictionary mapping is just for efficiency.

It may be complex to support full multi-column statistics
use cases. How about standardizing this without
multi-columns statistics support for the first version?

That's ok with me.

Regards

Antoine.

Reply via email to