Re: Support for Hive on Tez or Hive on Spark, cube build automation and best practices

2016-01-21 Thread Li Yang
In principal, Kylin does not do any scheduling stuff. Because only upstream ETL knows when the data lands in hive. That's why Kylin provides Rest API for upstream to call when data is ready. On Sat, Jan 16, 2016 at 8:18 PM, hongbin ma wrote: > ​kylin invokes shell to

Re: I got error first on kylin

2016-01-21 Thread Dong Li
Hi Kevin, For FileNotFoundException on startup, this is not problem. See https://issues.apache.org/jira/browse/KYLIN-1298 Thanks, Dong Li 2016-01-21 18:26 GMT+08:00 kevin : > I will try hbase0.98 later ,but the error I got when start up is why? why > kylin need the

[jira] [Created] (KYLIN-1355) Add script to build a java-hotfix package, instead of a full package

2016-01-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-1355: --- Summary: Add script to build a java-hotfix package, instead of a full package Key: KYLIN-1355 URL: https://issues.apache.org/jira/browse/KYLIN-1355 Project: Kylin

[jira] [Created] (KYLIN-1356) use exec-maven-plugin for IT environment provision

2016-01-21 Thread hongbin ma (JIRA)
hongbin ma created KYLIN-1356: - Summary: use exec-maven-plugin for IT environment provision Key: KYLIN-1356 URL: https://issues.apache.org/jira/browse/KYLIN-1356 Project: Kylin Issue Type: Bug

Re: I got error first on kylin

2016-01-21 Thread kevin
I will try hbase0.98 later ,but the error I got when start up is why? why kylin need the jars belowe? java.io.FileNotFoundException: /{the url of my hadoop_home}/contrib/capacity-scheduler/*.jar 2016-01-21 17:48 GMT+08:00 hongbin ma : > we don't have stable release for

Re: I got error first on kylin

2016-01-21 Thread hongbin ma
we don't have stable release for hbase 1.1.2 or higher. "Also, please aware this is not a formal release, and it is not fully tested:" the latest stable version is now 1.2, but it does not work with hbase 1.0+ On Thu, Jan 21, 2016 at 5:39 PM, kevin wrote: > hi, >I

Re: From the Build Base Cuboid Data step to Build N-Dimension steps, Too much time is taken.

2016-01-21 Thread 杨海乐
I find that The reason is the precision of count distinct measure. The precision is 1.2% . So the steps is too slow even though the data is little(million).Can I sloue the problem by reducing the value of kylin.job.mapreduce.default.reduce.input.mb? -- View this message in context:

how does Kylin decide which cube to use for the SQL query?

2016-01-21 Thread zhong zhang
Hi All, After several cubes are built, we put a query in the UI. How does Kylin decide which cube to use for this query? My guess is that it is based on the join conditions in the data model? If we create two cubes with the exactly same data model (same join conditions) but with different

TableExistsException: kylin_metadata_acl

2016-01-21 Thread kevin
HI,all when I start up kylin , I got the error below: org.apache.hadoop.hbase.TableExistsException: kylin_metadata_acl but when I use hbase shell : desc 'kylin_metadata_acl' got nothing. I have reformat hdfs , rebuild hive's metastore database,but it's not work.

Re: which hadoop version is matched?

2016-01-21 Thread Luke Han
Hi Kevin, Please refer to latest document here: http://kylin.apache.org/docs/install/index.html Thanks. Luke Best Regards! - Luke Han On Fri, Jan 22, 2016 at 2:36 PM, kevin wrote: > *hi,all :* > *I installed hadoop2.7.1 \

Re: Re: how does Kylin decide which cube to use for the SQL query?

2016-01-21 Thread 13802880...@139.com
we have a case like this: CubeA : date_id,hour_id,service_type,user,count1,count2 the rowkey sequence is : date_id+hour_id+service_type+user this is ok when i select all the users who use serviceA, but if we want to find all the services that userA used, it's became very slow; so we create

Re: kylin job压缩支持的参数!

2016-01-21 Thread Li Yang
http://kylin.apache.org/docs/install/advance_settings.html Here you find settings about compression. On Tue, Jan 19, 2016 at 8:33 PM, hongbin ma wrote: > ​you can comment out all entries containing snappy in kylin_hive_conf.xml, > kylin_job_conf.xml and kylin.properties

Re: TableExistsException: kylin_metadata_acl

2016-01-21 Thread kevin
it's a problem of hbase's zookeeper 2016-01-22 13:25 GMT+08:00 kevin : > HI,all >when I start up kylin , I got the error below: > > org.apache.hadoop.hbase.TableExistsException: kylin_metadata_acl > > but when I use hbase shell : desc 'kylin_metadata_acl'

Re: Kylin service crash easily while building cube in HDP sandbox

2016-01-21 Thread Li Yang
What's the HDP version? Kylin 1.2 only works with HDP 2.2.4. Make sure you are NOT running on latest HDP 2.3 On Tue, Jan 19, 2016 at 1:34 PM, 宋轶 wrote: > I remember we can config the service to be a job engine or a query engine. > > > From: mahong...@apache.org > > Date:

[jira] [Created] (KYLIN-1358) revisit on cube selection within same project

2016-01-21 Thread hongbin ma (JIRA)
hongbin ma created KYLIN-1358: - Summary: revisit on cube selection within same project Key: KYLIN-1358 URL: https://issues.apache.org/jira/browse/KYLIN-1358 Project: Kylin Issue Type:

[Suggestion] Kylin Dev Document would explicitly tell dependent hive job jars

2016-01-21 Thread Edward Zhang
Hi, When start cubing jobs in Kylin dev environment for example from Intellij, we should explicitly set hive dependent jars which include FactDistinctColumnsMapper etc. This is missing in http://kylin.apache.org/development/dev_env.html. We can tell developer to do the following: 1) In IDE, set

which hadoop version is matched?

2016-01-21 Thread kevin
*hi,all :* *I installed hadoop2.7.1 \ hive1.2.1 \ zookpper3.4.6 \ hbase-0.98.16.1-hadoop2 \ apache-kylin-1.2 * *when I build the sample cube . I got error : * *[pool-5-thread-2]:[2016-01-22

Re: [Suggestion] Kylin Dev Document would explicitly tell dependent hive job jars

2016-01-21 Thread Luke Han
Hi Edward, Would you mind to open JIRA and submit a patch for such improvement? Thanks. Luke Best Regards! - Luke Han On Fri, Jan 22, 2016 at 2:07 PM, Edward Zhang wrote: > Hi, > When start cubing jobs in Kylin dev environment for example

Re: Re: how does Kylin decide which cube to use for the SQL query?

2016-01-21 Thread hongbin ma
i see​, so cube selection should honor cubes with a better rowkey order respecting the current query. any other scenarios? -- Regards, *Bin Mahone | 马洪宾* Apache Kylin: http://kylin.io Github: https://github.com/binmahone