Fact tables with complex data types.

2016-06-06 Thread Joel Victor
Hi, I am using Kylin 1.5.2 with HDP 2.2 Currently my fact table contains multiple columns with type array. Kylin won't allow me to sync this table since it has complex datatypes. I don't need these complex data types in my cube builds but I do require them for other jobs. The table is partitioned

Per project Kylin hive/job/inmen config

2016-06-08 Thread Joel Victor
Hi, Does Kylin support per project hive_job_conf / kylin_job_conf / kylin_job_conf_inmem ? My use cases: 1. Specifying different queues for different projects 2. Different inmem/job/hive configuration per project Thanks -Joel

HBase region split policy.

2016-06-08 Thread Joel Victor
Hi, Can some one please explain why region splitting for HBase has been disabled? Is it the case that the Kylin cube job determines the region size using the kylin.hbase.region.cut property in kylin.properties? If so, what are the other configuration parameters that determine region split in HBase

Re: HBase region split policy.

2016-06-09 Thread Joel Victor
ed to rebalance; > > From 1.5.2, the "kylin.hbase.region.cut" determins the size of a region; > But as Kylin calculates the split keys by estimation; the real size of a > region may vary a little; > > The link in the code is out of date and mis-leading, will remove that > so

Re: Per project Kylin hive/job/inmen config

2016-06-11 Thread Joel Victor
> At least, you can do cube level configuration at moment. see > KylinConfigBase.getMRConfigOverride() > > On Wed, Jun 8, 2016 at 7:45 PM, Joel Victor wrote: > >> Hi, >> >> Does Kylin support per project hive_job_conf / kylin_job_conf / >> kylin_job_conf_inmem ?

HBase Region Replication.

2016-06-22 Thread Joel Victor
Hi, HDP 2.2 offers "HBase Read HA" functionality. More details about this are here[1]. Currently in Kylin when the HTable is created in HBase the region replication is not set explicitly for the Htable's[2][3]. The default value for region replication is 1. As far as I can tell this region replica

Support for weighted mean & median.

2016-06-30 Thread Joel Victor
Hi, I have a fact table with two measure columns one is cnt and other is val with count & sum measures defined over both columns. Val has either median or average for that day. Now if I want to calculate mean & median for a month I would be required to do the following, sum(cnt * val) / sum(cnt).

Re: Support for weighted mean & median.

2016-06-30 Thread Joel Victor
Thanks!!! -Joel On Fri, Jul 1, 2016 at 11:18 AM, hongbin ma wrote: > we don't have such plan yet. As a workaround you can create a hive view > with an additional column for cnt * val, and use the view as fact table > in your cube > > On Fri, Jul 1, 2016 at 1:06 PM, Joel Vi

KYLIN-1656

2016-08-02 Thread Joel Victor
Is there any way to disable this new step that has been added to the build process. https://issues.apache.org/jira/browse/KYLIN-1656 This adds a new step which counts the number of records at the beginning of each build. For my cube builds it does not benefit me much since my build latencies have

Status of Kylin support on HDP 2.3

2016-08-22 Thread Joel Victor
I wanted to know the status of support for Kylin on HDP 2.3. Is Kylin ready to be used with HDP 2.3. Also the documentation for the Hadoop environment says that Kylin can be used with Hive 1.2.1, HBase 1.x and Hadoop 2.7. Is it safe to assume that it can be run on HDP 2.3 ? Has anyone faced any iss

Re: Status of Kylin support on HDP 2.3

2016-08-23 Thread Joel Victor
Kylin is also stable on 2.3 ? -Joel On Tue, Aug 23, 2016 at 11:54 AM, Yiming Liu wrote: > We have tested on HDP 2.2 and 2.4, no issues found. > > 2016-08-23 13:56 GMT+08:00 Joel Victor : > >> I wanted to know the status of support for Kylin on HDP 2.3. >> Is Kylin ready

Re: Status of Kylin support on HDP 2.3

2016-08-23 Thread Joel Victor
> > 2016-08-23 17:48 GMT+08:00 Joel Victor : > >> Hey Yiming, >> >> Thanks for the info. >> >> Currently we are facing issues with Storm + Hive in HDP 2.2. We are using >> HDP 2.2 since Kylin supports 2.2 >> We have evaluated HDP 2.3 and streaming i

Re: KYLIN-1656

2016-09-09 Thread Joel Victor
ubscribe and > comment there. > > > 2016-09-03 17:26 GMT+08:00 hongbin ma : > >> ​@shaofeng any insights?​ >> >> On Wed, Aug 3, 2016 at 9:56 AM, Joel Victor >> wrote: >> >>> ld think about making this feature optional (A per cube config unti

Incorrect query result

2016-10-10 Thread Joel Victor
There are two columns in the table which are event date and event hour. I have attached the result and the plans for all three queries that I ran. The first query will *not* show the value for date 2016-09-28 and hour 0 in the result The second query will show the value for date 2016-09-28 and hou

Trie dictionary Not a valid value

2016-10-10 Thread Joel Victor
I have come across this error where I get a exception in the cube build step (step #17) which says that a particular key does not exist in the trie dictionary. The build dictionary step says that the value is present in the dictionary. I have deduced this by looking at the Kylin debug logs. Follow

Re: Trie dictionary Not a valid value

2016-10-10 Thread Joel Victor
; > check if your actual version is 1.5.2 there is a new version 1.5.4.2 > > good luck, Alb > > 2016-10-10 18:38 GMT+02:00 Joel Victor : > >> I have come across this error where I get a exception in the cube build >> step (step #17) which says that a particular key does

Kylin metadata and storage cleanup.

2016-10-17 Thread Joel Victor
In order to cleanup kylin metadata, HBase tables and HDFS. Can I do this while kylin cube jobs are running in the background. Is there a chance that the metadata might get messed up? In this document http://kylin.apache.org/docs15/howto/howto_cleanup_storage.html it says you can do offline cleanup.

Re: Kylin metadata and storage cleanup.

2016-10-18 Thread Joel Victor
dataCleanup, would be better to run when there is no running job; and > take a backup before clean is always recommended. > > 2016-10-17 22:29 GMT+08:00 Joel Victor : > >> In order to cleanup kylin metadata, HBase tables and HDFS. Can I do this >> while kylin cube jobs are r

Build times gone up!

2016-10-27 Thread Joel Victor
We are seeing a problem in our cube builds that our build times have gone up. In between two steps we see that it is spending 30 seconds. Our cube build is a in memory cube build and while going through step #6 to step #17 it takes ~ 5.5 minutes. I would like to know what happens in between these s

Re: Build times gone up!

2016-10-27 Thread Joel Victor
> skipped, such like "Build N-Dimension". All build actually happens at "Build > Cube". > > 2016-10-28 2:09 GMT+08:00 Joel Victor : > >> We are seeing a problem in our cube builds that our build times have gone >> up. In between two steps we see that it