If expressions are supported, the storage engine will changed when the
function column is built automatically.
Also the mv selector need a expression calculation tool to judge whether mv
could be selected for query.

After Doris support the count_distinct and count, I will consider this
feature.

陈明雨 <morning...@163.com> 于2020年4月16日周四 下午10:50写道:

> That could be good.
>
>
> However, if expressions are supported, the function of automatically
> selecting rollup may fail.
> Because the reason why Doris can automatically select the rollup is based
> on keeping the column values consistent.
> When the column value generates a new value through the expression, the
> column will no longer be consistent.
>
>
> Therefore, we may need a completely new design, such as the "relevant
> table" conception to create views,
> and maintain the data consistency of views through the function of atomic
> load operation.
>
>
>
>
> --
>
> 此致!Best Regards
> 陈明雨 Mingyu Chen
>
> Email:
> chenmin...@apache.org
>
>
>
>
>
> At 2020-04-16 17:40:34, "Zhao Chun" <zh...@apache.org> wrote:
> >Hi Ling
> >
> >Very glad to see this proposal.
> >
> >We can discuss whether to support expression operations.
> >
> >For the case that expressions are applied on the KEY column. For example
> we
> >can easily get a monthly table
> >through a rollup which is defined by "group by to_month(day_column)".
> >
> >For the case that expression are applied on the value column. For example
> >we can get a PV aggregate table
> >through a rollup which is defined by "sum(case event_column when
> >"page_view" then 1 else 0 end)".
> >
> >Thanks,
> >Zhao Chun
> >
> >
> >ling miao <emmymia...@gmail.com> 于2020年4月16日周四 下午5:13写道:
> >
> >> Hi everyone,
> >>
> >> *The status of materialized view 1.0*
> >> In the present, we have supported the materialized views in Doris 0.12
> >> version. The materialized view selector supports to select the most
> >> efficient mv and rewrite the SQL to query against the selected mv
> instead
> >> of the base table.
> >> For query results contain a small number of rows where the original
> table
> >> has a large amount of data, the performance can reach the 5X to 100X
> times
> >> depends on the cardinality of the data.
> >> The aggregate functions supported by the materialized view in 0.12
> include:
> >> sum, min, max.
> >>
> >> However, the aggregate functions supported by the current materialized
> view
> >> are not rich enough to fully cover the user's scene.
> >> For example, in the `Order` scenario, user needs to analyze the number
> of
> >> orders in different dimensions.
> >> Another example is the count_distinct function is used for analyzing PV
> and
> >> UV data in website traffic.
> >>
> >> *The goal of materialized view 2.0*
> >> In order to support more scenarios, the materialized view 2.0 will
> support
> >> the following functions:
> >>
> >> 1. Materialized view supports aggregate functions: count, count_distinct
> >> (bitmap and hll)
> >> 2. Support to create materialized views of the same column with
> different
> >> aggregate functions. For example: select k1, sum (v1), min (v1) from
> table
> >> group by k1
> >>
> >>
> >> What features do you want the materialized view 2.0 to support?
> >>
> >> Looking forward to your idea~
> >>
> >> LingMiao
> >>
>

Reply via email to