[ 
https://issues.apache.org/jira/browse/KYLIN-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16309196#comment-16309196
 ] 

Wang, Gang commented on KYLIN-1403:
-----------------------------------

Tested in Hive 1.2 Kylin 2.1, HCatlog works good in format TXT, Parquet and ORC.
This may be not a issue anymore.

set hive.enforce.bucketing = true;
set hive.exec.dynamic.partition=true;
set hive.exec.dynamic.partition.mode=nostrick;
create table testBucket_parquet (x int,y int) partitioned by(z int) clustered 
by(x) into 10 buckets STORED AS PARQUET;
insert into table testBucket_parquet partition(z) values (1, 1, 1);
insert into table testBucket_parquet partition(z) values (2, 1, 1);
insert into table testBucket_parquet partition(z) values (2, 1, 2);
insert into table testBucket_parquet partition(z) values (1, 1, 2);


> Kylin Hive Column Cardinality Job unable to read bucketed table
> ---------------------------------------------------------------
>
>                 Key: KYLIN-1403
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1403
>             Project: Kylin
>          Issue Type: Bug
>    Affects Versions: v1.2, v1.3.0
>         Environment: - Tested against 
> apache-kylin-1.2-HBase1.1-incubating-SNAPSHOT-bin and 
> apache-kylin-1.3-HBase-1.1-SNAPSHOT-bin
> - Environment is HDP 2.3.4 
> - Hive version: hive-1.2.1.2.3.4.0
> - HBase version: HBase 1.1.2.2.3.4.0-3485
>            Reporter: Sebastian Zimmermann
>            Assignee: Wang, Gang
>              Labels: newbie
>
> This issue is connected with https://issues.apache.org/jira/browse/KYLIN-1402 
> and states the findings while investigating on the 
> StringIndexOutOfBoundsException.
> While trying to find out why the outputfile created in the cardinality job is 
> empty, we discovered that the only difference between this non-working job 
> and all our other jobs (which work without problems), is that the underlying 
> table is bucketed. 
> The data folder is dbfolder/db/table/partition/bucketfolder/file
> Kylin checks for data in dbfolder/db/table/partition and so is unable to find 
> the data.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to