Multiple insert on same table with INSERT OVERWRITE + INSERT INTO ?

2014-06-30 Thread Furcy Pin
Hi all, Does anyone know what is the expected behavior on a query like this : FROM source INSERT OVERWRITE TABLE dest SELECT * INSERT INTO TABLE dest SELECT * is it the same as FROM source INSERT INTO TABLE dest SELECT * INSERT OVERWRITE TABLE dest SELECT * ? I'm asking because I ran a

Column selection in Hive

2014-06-30 Thread Chhaya Vishwakarma
Hi I have a Customer table with 200 columns I want to create another table from Customer table which should have only 190 columns. How can i skip few columns in hive from base table? Regards, Chhaya Vishwakarma The contents of this e-mail and any

ERROR 1115: Unsupported type: 5 in Pig's schema

2014-06-30 Thread Carlotta Hicks
Hi All! I created an hbase table and hcatalog table like the following: hive describe formatted client; OK # col_namedata_type comment a string from deserializer b

Alter location of database in Hive

2014-06-30 Thread Jon Bender
Hey all, I'm on Hive 0.10.0 on one of my clusters. We had a namenode hostname change, so I'm trying to point all of our tables, partitions and databases to the new locations. When i describe database mydb, the location shows up as hdfs://old_hostname/user/hive/warehouse/mydb.db, and i want to

Re: Alter location of database in Hive

2014-06-30 Thread Jon Bender
Answered my own question, no there is not. The way to do is is to modify the DB_LOCATION_URI field in metastore.DBS (at least if you're using MySQL) On Mon, Jun 30, 2014 at 5:14 PM, Jon Bender jonathan.ben...@gmail.com wrote: Hey all, I'm on Hive 0.10.0 on one of my clusters. We had a

Re: Alter location of database in Hive

2014-06-30 Thread Prem Yadav
I think you should be able to copy the data to a different location and then drop the old db, and create a new one with the new location. On Tue, Jul 1, 2014 at 1:54 AM, Jon Bender jonathan.ben...@gmail.com wrote: Answered my own question, no there is not. The way to do is is to modify the

Re: Column selection in Hive

2014-06-30 Thread Szehon Ho
Why dont you just use CTAS (Create table as select), putting the 190 columns in the select part? I guess you're maybe asking for any way to not have to type all the columns, but I'm not aware of any.

Calculation works wrong when hive.groupby.skewindata is true and count(*) count(distinct) group by work simultaneously

2014-06-30 Thread Chris Chen
【Phenomenon】 The query results are not the same as when hive.groupby.skewindata was setted to true and false. 【my question】 I want to calculate the count(*) and count(distinct) simultaneously ,otherwise it will cost 2 MR job to calculate. But when i set the hive.groupby.skewindata to be true,