Kylin uses HCatalog to read the hive table, ideally HCatalog will understand the different formats and partitions; I tried to search whether HCatalog supports bucket tables, but there is no related discussion. Could you please report a JIRA with your findings? Firstly we can fix the string index out of bounds error, and then look into the hive source issue.
2016-02-03 22:09 GMT+08:00 <h...@uni.de>: > Hi, > > we found the reason for the empty output files: the Hive table are > bucketed. It looks like Kylin does not support bucketed tables and is > looking in the wrong folder for the necessary files. > > Can anyone confirm this? > > > 2016-01-29 7:34 GMT+01:00 <h...@uni.de>: > > Hi, > > > > the output file is actually empty (that's probably the cause for "out > > of range -1" -> length (0)-1 = -1). There is no output logging which > > could be used to investigate why the file is actually empty. Any hints > > on how we can debug why it is empty? > > > > > > 2016-01-29 2:52 GMT+01:00 hongbin ma <mahong...@apache.org>: > >> HiveColumnCardinalityUpdateJob > >> desc in source code: > >> > >> /** > >> * This job will update save the cardinality result into Kylin table > >> metadata store. > >> * @author shaoshi > >> */ > >> > >> > >> > >> it does not belong to a cubing job, it's a separate task to help > modeling. > >> can you checkout the output in /tmp/kylin/cardinality/KYLIN_DK.DIM_DTM, > it > >> seems the content format is not as expected: > >> > https://github.com/apache/kylin/blob/kylin-1.2/job/src/main/java/org/apache/kylin/job/hadoop/cardinality/HiveColumnCardinalityUpdateJob.java#L113 > >> > >> > >> > >> -- > >> Regards, > >> > >> *Bin Mahone | 马洪宾* > >> Apache Kylin: http://kylin.io > >> Github: https://github.com/binmahone > -- Best regards, Shaofeng Shi