Can someone help me to answer this question? I was still waiting for answer.

张天生 <[email protected]>于2016年8月15日周一 上午11:28写道:

> I have a dimension user_tags, it is a multi-value column, for example the
> value is "1_3_12_15_27_35_...", it was seperated by "_". As i known, kylin
> don't directly propress this multi-value column, it must preprocess it to a
> single value column, but it will increase record count to 50~100 times, the
> data is too big.So is there a way to deal with multi-value dimension, it
> don't need to split the value to many record, in calculate dimension
> cardinality, it can read original data and automate split the value to
> multi-value and process, and it will save disk i/o and cpu spending.
>

Reply via email to