We are in the process of releasing TPC-DS bench marks for Kylin to compare against Hive.
Also I do not see Kylin completing with SQL on Hadoop Solutions like Impala but complementing them. There is a subset of SQL Workload that can be represented in a classic star schema format and allow for pre-aggregation where Kylin will do better. Regards Seshu On 5/27/15, 7:30 PM, "Luke Han" <[email protected]> wrote: >Hi Sun, > There's no benchmark from our side yet,especially in prod env. I'm >also >very curious to know if someone did such comparison. > > The direct advantage for Kylin over Impala (include other MPP >solution): > 1. Non-Invasive Design: you do not need to install any agent, library >or others in your existing Hadoop Cluster (Neither on Namenode or >DataNode) > 2. Pre-Calculation result avoid runtime scan/aggregation, that mean >you >could get result more faster in seconds latency over billions data. > > > Thanks. > >Luke > > >Best Regards! >--------------------- > >Luke Han > >2015-05-28 10:20 GMT+08:00 [email protected] <[email protected]>: > >> Hi, team >> >> Really interested in the performance comparison and also the native >>design >> advantage over Apache Kylin >> >> and Cloudera Impala. As the official saying, Cloudera Impala is a >> "Lightning-fast, distributed SQL queries >> >> for petabytes of data stored in Apache Hadoop clusters". Kylin can goes >>to >> 10-1000x query efficiency over >> >> hive in the usage of MOLAP, while Cloudera Impala can also achieve much >> more performance upgrade over >> >> hive. >> >> Question is : Does Kylin do some benchmark test or performance >>comparison >> with Cloudera Impala in production >> >> environment? What can be the direct advantage for Apache Kylin over >> Cloudera Impala? >> >> If anyone had deployed and used both products in your usage, please >>kindly >> share any available suggestions. >> >> Best regards, >> >> Sun. >> >> >> >> [email protected] >>
