[ 
https://issues.apache.org/jira/browse/HIVE-19867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16516053#comment-16516053
 ] 

Steve Yeom edited comment on HIVE-19867 at 6/22/18 5:40 AM:
------------------------------------------------------------

I found the patch 03 does not address the cases of concurrent writes to a 
table. 
If writeId for transactional stats is saved into TBLS/PARTITIONS, then 
it can be used to figure out concurrent writes by comparing a write with its 
writeIdList.
Simply by checking it is in the list or not. 

I.e., if we assume two concurrent NSERTs are subsequently committed and updated 
stats 
of, let's say, table1. Then first INSERT's writeId is now saved and so can be 
used to see if it is
in second INSERT's writeIdList when second INSERT comes in. Also second 
INSERT's writeId
can be checked to see it is in first INSERT's writeIdList that is saved in 
TBLS/PARTITIONS.
If each writeId is contained in the other's writeIdList, we can say it is 
concurrent INSERT case
and turn the COLUMN_STATS_ACCURATE flag off.


was (Author: steveyeom2017):
I found the patch 03 does not address the cases of concurrent writes to a 
table. 
If writeId for transactional stats is saved into TBLS/PARTITIONS, then 
it can be used to figure out concurrent writes by comparing a write with its 
writeIdList.
Simply by checking it is in the list or not. 

> Test and verify Concurrent INSERTS  
> ------------------------------------
>
>                 Key: HIVE-19867
>                 URL: https://issues.apache.org/jira/browse/HIVE-19867
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Transactions
>    Affects Versions: 4.0.0
>            Reporter: Steve Yeom
>            Assignee: Steve Yeom
>            Priority: Major
>             Fix For: 4.0.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to