[jira] [Resolved] (IMPALA-8458) Can't set numNull/maxSize/avgSize column stats with local catalog without also setting NDV
[ https://issues.apache.org/jira/browse/IMPALA-8458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon resolved IMPALA-8458. - Resolution: Fixed Fix Version/s: Impala 3.3.0 > Can't set numNull/maxSize/avgSize column stats with local catalog without > also setting NDV > -- > > Key: IMPALA-8458 > URL: https://issues.apache.org/jira/browse/IMPALA-8458 > Project: IMPALA > Issue Type: Bug > Components: Catalog >Affects Versions: Impala 3.3.0 >Reporter: Tim Armstrong >Assignee: Todd Lipcon >Priority: Critical > Fix For: Impala 3.3.0 > > > Repro: > {noformat} > [tarmstrong-box2.ca.cloudera.com:21000] default> create table test_stats2(s > string); > +-+ > | summary | > +-+ > | Table has been created. | > +-+ > Fetched 1 row(s) in 0.36s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('avgSize'='1234'); > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.14s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('maxSize'='1234'); > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.10s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> invalidate metadata > test_stats2; > Fetched 0 row(s) in 0.03s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > Query: show column stats test_stats2 > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.07s > {noformat} > I expected that the updates would take effect. Weirdly it doesn't happen for > NDV and NULLS: > {noformat} > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('numDVs'='1234','numNulls'='12345'); > Query: alter table test_stats2 set column stats > s('numDVs'='1234','numNulls'='12345') > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.12s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > Query: show column stats test_stats2 > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | 1234 | 12345 | -1 | -1 | > +++--++--+--+ >
[jira] [Resolved] (IMPALA-8458) Can't set numNull/maxSize/avgSize column stats with local catalog without also setting NDV
[ https://issues.apache.org/jira/browse/IMPALA-8458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Lipcon resolved IMPALA-8458. - Resolution: Fixed Fix Version/s: Impala 3.3.0 > Can't set numNull/maxSize/avgSize column stats with local catalog without > also setting NDV > -- > > Key: IMPALA-8458 > URL: https://issues.apache.org/jira/browse/IMPALA-8458 > Project: IMPALA > Issue Type: Bug > Components: Catalog >Affects Versions: Impala 3.3.0 >Reporter: Tim Armstrong >Assignee: Todd Lipcon >Priority: Critical > Fix For: Impala 3.3.0 > > > Repro: > {noformat} > [tarmstrong-box2.ca.cloudera.com:21000] default> create table test_stats2(s > string); > +-+ > | summary | > +-+ > | Table has been created. | > +-+ > Fetched 1 row(s) in 0.36s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('avgSize'='1234'); > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.14s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('maxSize'='1234'); > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.10s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.02s > [tarmstrong-box2.ca.cloudera.com:21000] default> invalidate metadata > test_stats2; > Fetched 0 row(s) in 0.03s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > Query: show column stats test_stats2 > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | -1 | -1 | -1 | -1 | > +++--++--+--+ > Fetched 1 row(s) in 0.07s > {noformat} > I expected that the updates would take effect. Weirdly it doesn't happen for > NDV and NULLS: > {noformat} > [tarmstrong-box2.ca.cloudera.com:21000] default> alter table test_stats2 set > column stats s('numDVs'='1234','numNulls'='12345'); > Query: alter table test_stats2 set column stats > s('numDVs'='1234','numNulls'='12345') > +-+ > | summary | > +-+ > | Updated 0 partition(s) and 1 column(s). | > +-+ > Fetched 1 row(s) in 0.12s > [tarmstrong-box2.ca.cloudera.com:21000] default> show column stats > test_stats2; > Query: show column stats test_stats2 > +++--++--+--+ > | Column | Type | #Distinct Values | #Nulls | Max Size | Avg Size | > +++--++--+--+ > | s | STRING | 1234 | 12345 | -1 | -1 | > +++--++--+--+ >