I'm wrapping up designs on streaming cubing and inverted-index. Will
publish here soon.

On Thu, Feb 5, 2015 at 2:50 PM, Luke Han <[email protected]> wrote:

> Forward to mailing list for further discussion.
>
> 在 2014年12月30日星期二 UTC+8下午6:30:17,Li Yang写道:
>>
>> What Xu described is the ultimate goal in this direction -- realtime!
>>
>> For this feature, it is less in scope by focusing on the micro segment of
>> cube building. The goal is to reduce cube data delay to be within one hour.
>>
>> Realtime is on the long term roadmap of course!  We will come to it when
>> inverted index is mature.
>>
>> Cheers
>> Yang
>>
>>
>> 在 2014年12月25日星期四UTC+8下午1时14分58秒,Luke Han写道:
>>>
>>> Github Issues for tracking this new feature: https://github.com/
>>> KylinOLAP/Kylin/issues/262
>>>
>>>
>>>
>>> 在 2014年12月25日星期四UTC+8上午7时11分49秒,Branky Shao写道:
>>>>
>>>> Xu, thanks for your comment. For our internal use cases like Nous, the
>>>> query pattern is determinate. Defining "popular path" can be treated as an
>>>> additional step in cube modeling and only queries on "popular path" are
>>>> most optimized.
>>>> On Tuesday, December 23, 2014 6:37:23 PM UTC-8, Jiang Xu wrote:
>>>>>
>>>>> "popular path" is a kind of partial cube materialization that is
>>>>> highly depended on the query pattern. Since we can't predict the user's
>>>>> query pattern, I have some concerns about the query performance.
>>>>>
>>>>> I think that the more important thing is to bring real-time capability
>>>>> into Kylin. It includes 3 things:
>>>>>
>>>>> 1. real time build in-memory storage.
>>>>> I suggest to build bitmap index instead of cube in memory. Kafka &
>>>>> Storm is a good option.
>>>>>
>>>>> 2. real-time query in-memory storage.
>>>>> We need a distributed execution engine for real-time query. I have
>>>>> some concerns about the RPC mechanism of Storm.
>>>>>
>>>>> 3. dump in-memory storage into hadoop.
>>>>> Storm is a good opinion for it.
>>>>>
>>>>>
>>>>>

Reply via email to