[
https://issues.apache.org/jira/browse/HIVE-21803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jiaxin zou updated HIVE-21803:
------------------------------
Description:
Hi all
I have a tableA partitioned by day/hour( insert overwrite table A
partiton(day,hour) select ...from table B where day =.. and hour =...) but
count ( * ) of table A and B is not equal occasionally (for example, hour =12).
when i rerun the job ( insert overwrite ..hour =12), the count ( * ) is
consistent. That means the bug cannot repeat.
I find the map output records is not equal to the reducer input records
!企业微信截图_15591333565716.png!
was:
Hi all
I have a tableA partitioned by day/hour( insert overwrite table A
partiton(day,hour) select ...from table B where day =.. and hour =...) but
count(*) of table A and B is not equal occasionally (for example, hour =12).
when i rerun the job ( insert overwrite ..hour =12), the count(*) is
consistent. That means the bug cannot repeat.
I find the map output records is not equal to the reducer input records
!企业微信截图_15591333565716.png!
> The result of "insert overwrite table" is inconsistent with the original table
> ------------------------------------------------------------------------------
>
> Key: HIVE-21803
> URL: https://issues.apache.org/jira/browse/HIVE-21803
> Project: Hive
> Issue Type: Bug
> Affects Versions: 2.3.0
> Reporter: jiaxin zou
> Priority: Major
> Attachments: 企业微信截图_15591333565716.png
>
>
> Hi all
> I have a tableA partitioned by day/hour( insert overwrite table A
> partiton(day,hour) select ...from table B where day =.. and hour =...) but
> count ( * ) of table A and B is not equal occasionally (for example, hour
> =12). when i rerun the job ( insert overwrite ..hour =12), the count ( * )
> is consistent. That means the bug cannot repeat.
> I find the map output records is not equal to the reducer input records
> !企业微信截图_15591333565716.png!
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)