[DISCUSS]Apache Kylin 2.0 Release Features & Criteria

Adunuthula, Seshu Sun, 31 Jan 2016 09:49:31 -0800

Hello Folks,

We are actively working towards Apache Kylin 2.0 Release and would like a 
discussion with the community on what they would like to see in 2.0 release of 
the product. We have three big rock items we are working towards in 2.0 and lot 
of additional minor feature enhancements.


Streaming Data Source support.
This feature is semi baked in where the source of Kylin Cubes is Kafka Topics. 
Cube Segment are built on micro batches of messages arriving on Kafka topics. 
Currently a lot of work is going on to productize this feature. Primary areas 
of work are Stream Processing Engines/Frameworks to process the micro batches 
and UI to support out of the box integration of Kafka topics with Kylin Cubes.

Spark based Cube building Engine.
The initial performance numbers for a Spark based cubing engine did not show 
substantial improvement over MR based engine, but would like this feature to be 
baked in for the 2.0 Release. Lot of work underway to stabilize this feature.

Amazon EMR Integration
We had initial conversations with Amazon EMR to support Apache Kylin on Amazon 
EMR which was received well. With Kylin 2.0 Apache Kylin will be enabled 
feature on Amazon EMR. Limited work has gone into this area, but this will be 
an important milestone for 2.0

We are also working towards creating an area for community driven improvements 
page similar to Apache Kafka’s KIP 
https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Improvement+Proposals. 
Stay tuned.

Regards
Seshu Adunuthula

[DISCUSS]Apache Kylin 2.0 Release Features & Criteria

Reply via email to