Quanlong Huang created IMPALA-6830: -------------------------------------- Summary: HdfsScanner get stale data when Hive table is overwrited Key: IMPALA-6830 URL: https://issues.apache.org/jira/browse/IMPALA-6830 Project: IMPALA Issue Type: Bug Reporter: Quanlong Huang
In the minicluster: {code:bash} hive> create table tmp_parq (a int, b string, c int) stored as parquet; hive> insert overwrite table tmp_parq select 1, "abc", 2; impala> select * from tmp_parq; +---+-----+---+ | a | b | c | +---+-----+---+ | 1 | abc | 2 | +---+-----+---+ hive> insert overwrite table tmp_parq select 100, "ddd", 200; # # impala still gets old results: impala> select * from tmp_parq; +---+-----+---+ | a | b | c | +---+-----+---+ | 1 | abc | 2 | +---+-----+---+ # # It can be fixed after REFRESH impala> refresh tmp_parq; impala> select * from tmp_parq; +-----+-----+-----+ | a | b | c | +-----+-----+-----+ | 100 | ddd | 200 | +-----+-----+-----+ {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)