[
https://issues.apache.org/jira/browse/HIVE-9938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14703879#comment-14703879
]
Eugene Koifman commented on HIVE-9938:
--------------------------------------
The infrastructure for this is in place. TxnHandler.isRetryable() needs to
have a clause added to check for this message/condition.
> Add retry logic to DbTxnMgr instead of aborting transactions.
> -------------------------------------------------------------
>
> Key: HIVE-9938
> URL: https://issues.apache.org/jira/browse/HIVE-9938
> Project: Hive
> Issue Type: Improvement
> Affects Versions: 0.14.0
> Reporter: bharath v
>
> Sometimes parallel updates using DBTxnMgr results in the following error trace
> {noformat}
> 5325 [main] INFO org.apache.hadoop.hive.ql.log.PerfLogger - <PERFLOG
> method=acquireReadWriteLocks from=org.apache.hadoop.hive.ql.Driver>
> 5351 [main] ERROR org.apache.hadoop.hive.ql.Driver - FAILED: Error in
> acquiring locks: Error communicating with the metastore
> org.apache.hadoop.hive.ql.lockmgr.LockException: Error communicating with the
> metastore
> at
> org.apache.hadoop.hive.ql.lockmgr.DbLockManager.lock(DbLockManager.java:100)
> at
> org.apache.hadoop.hive.ql.lockmgr.DbTxnManager.acquireLocks(DbTxnManager.java:194)
>
> {noformat}
> Internally looking at the postgres logs we see
> {noformat}
> 2015-02-02 06:36:05,632 ERROR
> org.apache.hadoop.hive.metastore.RetryingHMSHandler:
> org.apache.thrift.TException: MetaException(message:Unable to update
> transaction database org.postgresql.util.PSQLException: ERROR: could not
> serialize access due to concurrent update
> {noformat}
> Ideally we should add a retry logic to retry the failed transaction.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)