Kylin uses HCatalog to read the hive table, ideally HCatalog will
understand the different formats and partitions; I tried to search whether
HCatalog supports bucket tables, but there is no related discussion. Could
you please report a JIRA with your findings? Firstly we can fix the string
index out of bounds error, and then look into the hive source issue.

2016-02-03 22:09 GMT+08:00 <h...@uni.de>:

> Hi,
>
> we found the reason for the empty output files: the Hive table are
> bucketed. It looks like Kylin does not support bucketed tables and is
> looking in the wrong folder for the necessary files.
>
> Can anyone confirm this?
>
>
> 2016-01-29 7:34 GMT+01:00  <h...@uni.de>:
> > Hi,
> >
> > the output file is actually empty (that's probably the cause for "out
> > of range -1" -> length (0)-1 = -1). There is no output logging which
> > could be used to investigate why the file is actually empty. Any hints
> > on how we can debug why it is empty?
> >
> >
> > 2016-01-29 2:52 GMT+01:00 hongbin ma <mahong...@apache.org>:
> >> HiveColumnCardinalityUpdateJob
> >> desc in source code:
> >>
> >> /**
> >>  * This job will update save the cardinality result into Kylin table
> >> metadata store.
> >>  * @author shaoshi
> >>  */
> >>
> >>
> >>
> >> it does not belong to a cubing job, it's a separate task to help
> modeling.
> >> can you checkout the output in /tmp/kylin/cardinality/KYLIN_DK.DIM_DTM,
> it
> >> seems the content format is not as expected:
> >>
> https://github.com/apache/kylin/blob/kylin-1.2/job/src/main/java/org/apache/kylin/job/hadoop/cardinality/HiveColumnCardinalityUpdateJob.java#L113
> >>
> >>
> >>
> >> --
> >> Regards,
> >>
> >> *Bin Mahone | 马洪宾*
> >> Apache Kylin: http://kylin.io
> >> Github: https://github.com/binmahone
>



-- 
Best regards,

Shaofeng Shi

Reply via email to