Tim Armstrong created IMPALA-8458:
-------------------------------------

             Summary: Can't set maxSize and avgSize column stats with local 
catalog
                 Key: IMPALA-8458
                 URL: https://issues.apache.org/jira/browse/IMPALA-8458
             Project: IMPALA
          Issue Type: Bug
          Components: Catalog
    Affects Versions: Impala 3.3.0
            Reporter: Tim Armstrong
            Assignee: Tim Armstrong


Repro:

{noformat}
[tarmstrong-box2.ca.cloudera.com:21000] default> create table test_stats2(s 
string);
+-------------------------+
| summary                 |
+-------------------------+
| Table has been created. |
+-------------------------+
Fetched 1 row(s) in 0.36s
[tarmstrong-box2.ca.cloudera.com:21000] default> show column stats test_stats2;
+--------+--------+------------------+--------+----------+----------+
| Column | Type   | #Distinct Values | #Nulls | Max Size | Avg Size |
+--------+--------+------------------+--------+----------+----------+
| s      | STRING | -1               | -1     | -1       | -1       |
+--------+--------+------------------+--------+----------+----------+
Fetched 1 row(s) in 0.02s
[tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set 
column stats s('avgSize'='1234');
+-----------------------------------------+
| summary                                 |
+-----------------------------------------+
| Updated 0 partition(s) and 1 column(s). |
+-----------------------------------------+
Fetched 1 row(s) in 0.14s
[tarmstrong-box2.ca.cloudera.com:21000] default> show column stats test_stats2;
+--------+--------+------------------+--------+----------+----------+
| Column | Type   | #Distinct Values | #Nulls | Max Size | Avg Size |
+--------+--------+------------------+--------+----------+----------+
| s      | STRING | -1               | -1     | -1       | -1       |
+--------+--------+------------------+--------+----------+----------+
Fetched 1 row(s) in 0.02s
[tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set 
column stats s('maxSize'='1234');
+-----------------------------------------+
| summary                                 |
+-----------------------------------------+
| Updated 0 partition(s) and 1 column(s). |
+-----------------------------------------+
Fetched 1 row(s) in 0.10s
[tarmstrong-box2.ca.cloudera.com:21000] default> show column stats test_stats2;
+--------+--------+------------------+--------+----------+----------+
| Column | Type   | #Distinct Values | #Nulls | Max Size | Avg Size |
+--------+--------+------------------+--------+----------+----------+
| s      | STRING | -1               | -1     | -1       | -1       |
+--------+--------+------------------+--------+----------+----------+
Fetched 1 row(s) in 0.02s
[tarmstrong-box2.ca.cloudera.com:21000] default> invalidate metadata 
test_stats2;
Fetched 0 row(s) in 0.03s
[tarmstrong-box2.ca.cloudera.com:21000] default> show column stats test_stats2;
Query: show column stats test_stats2
+--------+--------+------------------+--------+----------+----------+
| Column | Type   | #Distinct Values | #Nulls | Max Size | Avg Size |
+--------+--------+------------------+--------+----------+----------+
| s      | STRING | -1               | -1     | -1       | -1       |
+--------+--------+------------------+--------+----------+----------+
Fetched 1 row(s) in 0.07s
{noformat}
I expected that the updates would take effect. Weirdly it doesn't happen for 
NDV and NULLS:

{noformat}
[tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set 
column stats s('numDVs'='1234','numNulls'='12345');
Query: alter table test_stats2 set column stats 
s('numDVs'='1234','numNulls'='12345')
+-----------------------------------------+
| summary                                 |
+-----------------------------------------+
| Updated 0 partition(s) and 1 column(s). |
+-----------------------------------------+
Fetched 1 row(s) in 0.12s
[tarmstrong-box2.ca.cloudera.com:21000] default> show column stats test_stats2;
Query: show column stats test_stats2
+--------+--------+------------------+--------+----------+----------+
| Column | Type   | #Distinct Values | #Nulls | Max Size | Avg Size |
+--------+--------+------------------+--------+----------+----------+
| s      | STRING | 1234             | 12345  | -1       | -1       |
+--------+--------+------------------+--------+----------+----------+
Fetched 1 row(s) in 0.02s
{noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to