[ https://issues.apache.org/jira/browse/HIVE-5631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13804510#comment-13804510 ]
Venki Korukanti commented on HIVE-5631: --------------------------------------- org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_index_creation This test failed because SerDeInfo object is not initialized correctly in patch. Attached the updated patch. org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_infer_bucket_sort_convert_join The test failed in setup but I don't see any error info in logs. Also I am able to run it successfully on my machine. org.apache.hcatalog.hbase.snapshot.lock.TestWriteLock.testRun This seems to be a failure in starting the mini ZK server and doesn't look related to the change in this bug. Again this runs successfully on my machine. > Index creation on a skew table fails > ------------------------------------ > > Key: HIVE-5631 > URL: https://issues.apache.org/jira/browse/HIVE-5631 > Project: Hive > Issue Type: Bug > Components: Database/Schema > Affects Versions: 0.12.0 > Reporter: Venki Korukanti > Assignee: Venki Korukanti > Fix For: 0.13.0 > > Attachments: HIVE-5631.1.patch.txt, HIVE-5631.2.patch.txt > > > REPRO STEPS: > create database skewtest; > use skewtest; > create table skew (id bigint, acct string) skewed by (acct) on ('CC','CH'); > create index skew_indx on table skew (id) as > 'org.apache.hadoop.hive.ql.index.compact.CompactIndexHandler' WITH DEFERRED > REBUILD; > Last DDL fails with following error. > FAILED: Execution Error, return code 1 from > org.apache.hadoop.hive.ql.exec.DDLTask. > InvalidObjectException(message:Invalid skew column [acct]) > When creating a table, Hive has sanity tests to make sure the columns have > proper names and the skewed columns are subset of the table columns. Here we > fail because index table has skewed column info. Index tables's skewed > columns include {acct} and the columns are {id, _bucketname, _offsets}. As > the skewed column {acct} is not part of the table columns Hive throws the > exception. > The reason why Index table got skewed column info even though its definition > has no such info is: When creating the index table a deep copy of the base > table's StorageDescriptor (SD) (in this case 'skew') is made. And in that > copied SD, index specific parameters are set and unrelated parameters are > reset. Here skewed column info is not reset (there are few other params that > are not reset). That's why the index table contains the skewed column info. > Fix: Instead of deep copying the base table StorageDescriptor, create a new > one from gathered info. This way it avoids the index table to inherit > unnecessary properties in SD from base table. -- This message was sent by Atlassian JIRA (v6.1#6144)