Hi Hari,

I see now; it is very similar to this integration with spark:
https://kylin.apache.org/docs/tutorial/spark.html

Spark can register a SQL as a temp table, and then use that temp table for
subsequent analysis. But other engines may not have this function, so a
common view in Kylin may help to fill that gap, am I correct?



hosur narahari <[email protected]> 于2018年10月2日周二 下午12:48写道:

> Hi ShaoFeng,
>
> It could be either like lambda architecture or just like merging cube data
> with latest data for which cube has not yet been generated(in case of
> periodic cube generation).
>
> Now coming to aggregate pushdown. I'll give a very simple scenario.
> Consider below query.
>
> *Select sum(price) from kylin_sales*
>
> when we execute it in any query engine like presto, spark, drill etc., it
> doesn't execute above query but execute *Select price from kylin_sales.* After
> getting all price values, it does *map-reduce* to calculate sum. It's
> good for normal data sources but not for cube, since this is already
> pre-calculated. Even if in future aggregate pushdown is provided, it'll be
> very limited because we can't pushdown everything. For example, what do we
> do when we've joins and then aggregate. And AFAIK, unlike indexes(used for
> predicate pushdown), JDBC driver doesn't provide any information on pre
> aggregated data.
>
> How does views, solve this problem?
>
> Let's say we create view  on above query like *Create view sum_view as
> Select sum(price) from kylin_sales;*
>
> In query engine we execute, *Select * from sum_view.* So kylin is queried
> for sum_view, which is basically *Select sum(price) from kylin_sales.* In
> this way, we can solve this problem using views.
>
> Please let me know, if I'm not being clear.
>
> Best Regards,
> Hari
>
> On Tue, Oct 2, 2018 at 8:52 AM ShaoFeng Shi <[email protected]>
> wrote:
>
>> Hi Hari,
>>
>> Sorry for the late response. Does it like a lambda architecture? How can
>> add a view to supporting aggregation pushdown? I'm not clear on this,
>> please elaborate. Thank you!
>>
>> hosur narahari <[email protected]> 于2018年9月30日周日 下午1:50写道:
>>
>>> Hi ShaoFeng,
>>>
>>> Is it possible to provide that support. Because, in many cases cubes are
>>> used with latest transaction data to get up to date analysis and some kind
>>> of query engine will be used for merging cube with non-cube data. And most
>>> of the query engines don't have aggregate pushdown, which makes it unable
>>> to query from cube. If we provide view, we can solve this problem for all
>>> query engines irrespective of whether they provide aggregate pushdown or
>>> not, making kylin more adaptable.
>>>
>>> Also it's just a conceptual view, not adding any overhead.
>>>
>>> Best Regards,
>>> Hari
>>>
>>> On Fri, Sep 28, 2018 at 7:40 PM ShaoFeng Shi <[email protected]>
>>> wrote:
>>>
>>>> Hi Hari,
>>>>
>>>> Creating view on top of kylin tables is not supported in Kylin. Kylin
>>>> is read-only.
>>>>
>>>> hosur narahari <[email protected]> 于2018年9月28日周五 下午1:09写道:
>>>>
>>>>> Hi,
>>>>>
>>>>> Is it possible to create views on the lines of rdbms or hive on top of
>>>>> kylin tables.
>>>>>
>>>>> Best Regards,
>>>>> Hari
>>>>>
>>>>
>>>>
>>>> --
>>>> Best regards,
>>>>
>>>> Shaofeng Shi 史少锋
>>>>
>>>>
>>
>> --
>> Best regards,
>>
>> Shaofeng Shi 史少锋
>>
>>

-- 
Best regards,

Shaofeng Shi 史少锋

Reply via email to