Hi, should I file one or two different Jira's (IndexOutOfBounds + Bucket tables)?
2016-02-03 15:34 GMT+01:00 ShaoFeng Shi <[email protected]>: > Kylin uses HCatalog to read the hive table, ideally HCatalog will > understand the different formats and partitions; I tried to search whether > HCatalog supports bucket tables, but there is no related discussion. Could > you please report a JIRA with your findings? Firstly we can fix the string > index out of bounds error, and then look into the hive source issue. > > 2016-02-03 22:09 GMT+08:00 <[email protected]>: > >> Hi, >> >> we found the reason for the empty output files: the Hive table are >> bucketed. It looks like Kylin does not support bucketed tables and is >> looking in the wrong folder for the necessary files. >> >> Can anyone confirm this? >> >> >> 2016-01-29 7:34 GMT+01:00 <[email protected]>: >> > Hi, >> > >> > the output file is actually empty (that's probably the cause for "out >> > of range -1" -> length (0)-1 = -1). There is no output logging which >> > could be used to investigate why the file is actually empty. Any hints >> > on how we can debug why it is empty? >> > >> > >> > 2016-01-29 2:52 GMT+01:00 hongbin ma <[email protected]>: >> >> HiveColumnCardinalityUpdateJob >> >> desc in source code: >> >> >> >> /** >> >> * This job will update save the cardinality result into Kylin table >> >> metadata store. >> >> * @author shaoshi >> >> */ >> >> >> >> >> >> >> >> it does not belong to a cubing job, it's a separate task to help >> modeling. >> >> can you checkout the output in /tmp/kylin/cardinality/KYLIN_DK.DIM_DTM, >> it >> >> seems the content format is not as expected: >> >> >> https://github.com/apache/kylin/blob/kylin-1.2/job/src/main/java/org/apache/kylin/job/hadoop/cardinality/HiveColumnCardinalityUpdateJob.java#L113 >> >> >> >> >> >> >> >> -- >> >> Regards, >> >> >> >> *Bin Mahone | 马洪宾* >> >> Apache Kylin: http://kylin.io >> >> Github: https://github.com/binmahone >> > > > > -- > Best regards, > > Shaofeng Shi
