Quanlong Huang created IMPALA-6830:
--------------------------------------

             Summary: HdfsScanner get stale data when Hive table is overwrited
                 Key: IMPALA-6830
                 URL: https://issues.apache.org/jira/browse/IMPALA-6830
             Project: IMPALA
          Issue Type: Bug
            Reporter: Quanlong Huang


In the minicluster:
{code:bash}
hive> create table tmp_parq (a int, b string, c int) stored as parquet;
hive> insert overwrite table tmp_parq select 1, "abc", 2;
impala> select * from tmp_parq;
+---+-----+---+
| a | b   | c |
+---+-----+---+
| 1 | abc | 2 |
+---+-----+---+
hive> insert overwrite table tmp_parq select 100, "ddd", 200;

# # impala still gets old results:
impala> select * from tmp_parq;
+---+-----+---+
| a | b   | c |
+---+-----+---+
| 1 | abc | 2 |
+---+-----+---+

# # It can be fixed after REFRESH
impala> refresh tmp_parq;
impala> select * from tmp_parq;
+-----+-----+-----+
| a   | b   | c   |
+-----+-----+-----+
| 100 | ddd | 200 |
+-----+-----+-----+
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to