Hi Team,
We need your expert input in resolving issue while updating record in Hive ORC
bucket table. I’m getting “FAILED: SemanticException [Error 10294]: Attempt to
do update or delete using transaction manager that does not support these
operations” . I'm in HDP 2.3
Here are the step I followed for update
STEP 1:
set hive.support.concurrency = true;
SET hive.enforce.bucketing = true;
SET hive.exec.dynamic.partition.mode = nonstrict;
SET hive.txn.manager = org.apache.hadoop.hive.ql.lockmgr.DbTxnManager;
SET hive.compactor.initiator.on = true;
SET hive.compactor.worker.threads = 2;
STEP 2:
create table testTableNew(id int ,name string ) clustered by (id) into 2
buckets stored as orc TBLPROPERTIES('transactional'='true');
STEP 3:
insert into table testTableNew values (1,'row1'),(2,'row2'),(3,'row3'); -- 3
rows inserted successfully
STEP 4:
delete from testTable where id = 1;
FAILED: SemanticException [Error 10294]: Attempt to do update or delete using
transaction manager that does not support these operations.“
STEP 5 :
insert into table testTable values (1,'row1'),(2,'row2');
FAILED: SemanticException [Error 10294]: Attempt to do update or delete using
transaction manager that does not support these operations.“
· Is there any setting I might be missing here. One small hunch is, Is
it necessary to have zoo keeper running for performing update or delete?.
The other questions, which we are looking @ performance of update.
We are having scenario, where we need to update Single or couple of rows say
10 rows in Hive ORC table, it contains almost 200 million of Records in table.
Can you share your thoughts whether it is advisable to use “Update” or “delete”
in hive 0.14 (HDInsight 3.2),
Post Update will the hive orc table as is like previous. Regards,Mahender