We are in the process of releasing TPC-DS bench marks for Kylin to compare
against Hive. 

Also I do not see Kylin completing with SQL on Hadoop Solutions like
Impala but complementing them. There is a subset of SQL Workload that can
be represented in a classic star schema format and allow for
pre-aggregation where Kylin will do better.

Regards
Seshu


On 5/27/15, 7:30 PM, "Luke Han" <[email protected]> wrote:

>Hi Sun,
>    There's no benchmark from our side yet,especially in prod env. I'm
>also
>very curious to know if someone did such comparison.
>
>    The direct advantage for Kylin over Impala (include other MPP
>solution):
>    1. Non-Invasive Design: you do not need to install any agent, library
>or others in your existing Hadoop Cluster (Neither on Namenode or
>DataNode)
>    2. Pre-Calculation result avoid runtime scan/aggregation, that mean
>you
>could get result more faster in seconds latency over billions data.
>
>
>     Thanks.
>
>Luke
>
>
>Best Regards!
>---------------------
>
>Luke Han
>
>2015-05-28 10:20 GMT+08:00 [email protected] <[email protected]>:
>
>> Hi, team
>>
>> Really interested in the performance comparison and also the native
>>design
>> advantage over Apache Kylin
>>
>> and Cloudera Impala. As the official saying, Cloudera Impala is a
>> "Lightning-fast, distributed SQL queries
>>
>> for petabytes of data stored in Apache Hadoop clusters". Kylin can goes
>>to
>> 10-1000x query efficiency over
>>
>> hive in the usage of MOLAP,  while Cloudera Impala can also achieve much
>> more performance upgrade over
>>
>> hive.
>>
>> Question is : Does Kylin do some benchmark test or performance
>>comparison
>> with Cloudera Impala in production
>>
>> environment? What can be the direct advantage for Apache Kylin over
>> Cloudera Impala?
>>
>> If anyone had deployed and used both products in your usage, please
>>kindly
>> share any available suggestions.
>>
>> Best regards,
>>
>> Sun.
>>
>>
>>
>> [email protected]
>>

Reply via email to