[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Resolution: Fixed Fix Version/s: 0.14.0 Status: Resolved (was: Patch Available) Committed to trunk. Thanks, Navis! > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0, 0.13.0 >Reporter: caofangkun >Assignee: Navis > Fix For: 0.14.0 > > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt, > HIVE-4561.7.patch.txt, HIVE-4561.8.patch.txt, HIVE-4561.9.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.9.patch.txt > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0, 0.13.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt, > HIVE-4561.7.patch.txt, HIVE-4561.8.patch.txt, HIVE-4561.9.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.8.patch.txt > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0, 0.13.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt, > HIVE-4561.7.patch.txt, HIVE-4561.8.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Status: Patch Available (was: Open) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.13.0, 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt, > HIVE-4561.7.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.7.patch.txt > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0, 0.13.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt, > HIVE-4561.7.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Status: Open (was: Patch Available) Ok, I'll take a look. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.13.0, 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.6.patch.txt > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0, 0.13.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Status: Patch Available (was: Open) Fixed result of metadata_only_queries_with_filters.q, but cannot reproduce other failures. Rerunning test. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.13.0, 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Affects Version/s: 0.13.0 Status: Open (was: Patch Available) Failures need to be looked at > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.13.0, 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.5.patch.txt > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Status: Patch Available (was: Open) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Assignee: Navis (was: caofangkun) Status: Open (was: Patch Available) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Navis > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navis updated HIVE-4561: Attachment: HIVE-4561.4.patch.txt Changed to allow null for min/max value of long/double/decimal stats. Let's see what will happen. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: caofangkun > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch.txt > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Assignee: caofangkun (was: Ashutosh Chauhan) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: caofangkun > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Attachment: (was: HIVE-4561.4.patch) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Status: Patch Available (was: Open) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Attachment: HIVE-4561.4.patch Update patch, make HIGH/LOW values of empty tables return null. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, > HIVE-4561.4.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Status: Open (was: Patch Available) [~ashutoshc] The values sounds quite strange, I will try to make a new patch. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Status: Patch Available (was: Open) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Attachment: HIVE-4561.3.patch fix compute_stats_empty_table.q test results. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Status: Open (was: Patch Available) Test {{compute_stats_empty_table.q}} failed. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Attachment: HIVE-4561.2.patch Update a new patch. In case of all the long values are positive, we can get the right min. In case of all the values are negative, we can get the right max. UT "compute_stats_long.q" reads values from data/files/int.txt which values are all above zero. Original ut computes the min value "0", however, the correct min value is "4". This patch fixes the bug. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-4561: --- Assignee: Zhuoluo (Clark) Yang > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Assignee: Zhuoluo (Clark) Yang > Attachments: HIVE-4561.1.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Attachment: HIVE-4561.1.patch A quick fix. > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun > Attachments: HIVE-4561.1.patch > > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Status: Patch Available (was: Open) A quick fix, would any body assign the issue to me? > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhuoluo (Clark) Yang updated HIVE-4561: --- Priority: Major (was: Minor) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-4561) Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the column values larger than 0.0 (or if all column values smaller than 0.0)
[ https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caofangkun updated HIVE-4561: - Description: if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be hive (default)> create table src_test (price double); hive (default)> load data local inpath './test.txt' into table src_test; hive (default)> select * from src_test; OK 1.0 2.0 3.0 Time taken: 0.313 seconds, Fetched: 3 row(s) hive (default)> analyze table src_test compute statistics for columns price; mysql> select * from TAB_COL_STATS \G; CS_ID: 16 DB_NAME: default TABLE_NAME: src_test COLUMN_NAME: price COLUMN_TYPE: double TBL_ID: 2586 LONG_LOW_VALUE: 0 LONG_HIGH_VALUE: 0 DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. DOUBLE_HIGH_VALUE: 3. BIG_DECIMAL_LOW_VALUE: NULL BIG_DECIMAL_HIGH_VALUE: NULL NUM_NULLS: 0 NUM_DISTINCTS: 1 AVG_COL_LEN: 0. MAX_COL_LEN: 0 NUM_TRUES: 0 NUM_FALSES: 0 LAST_ANALYZED: 1368596151 2 rows in set (0.00 sec) was: if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be hive (default)> create table src_test (price double); hive (default)> load data local inpath './test.txt' into table src_test; hive (default)> select * from src_test; OK 1.0 2.0 3.0 Time taken: 0.313 seconds, Fetched: 3 row(s) hive (default)> analyze table src_test compute statistics for columns price; mysql> select * from TAB_COL_STATS \G; *** 1. row *** CS_ID: 16 DB_NAME: default TABLE_NAME: src_test COLUMN_NAME: price COLUMN_TYPE: double TBL_ID: 2586 LONG_LOW_VALUE: 0 LONG_HIGH_VALUE: 0 DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. DOUBLE_HIGH_VALUE: 3. BIG_DECIMAL_LOW_VALUE: NULL BIG_DECIMAL_HIGH_VALUE: NULL NUM_NULLS: 0 NUM_DISTINCTS: 1 AVG_COL_LEN: 0. MAX_COL_LEN: 0 NUM_TRUES: 0 NUM_FALSES: 0 LAST_ANALYZED: 1368596151 2 rows in set (0.00 sec) > Column stats : LOW_VALUE (or HIGH_VALUE) will always be 0. ,if all the > column values larger than 0.0 (or if all column values smaller than 0.0) > > > Key: HIVE-4561 > URL: https://issues.apache.org/jira/browse/HIVE-4561 > Project: Hive > Issue Type: Bug > Components: Statistics >Affects Versions: 0.12.0 >Reporter: caofangkun >Priority: Minor > > if all column values larger than 0.0 DOUBLE_LOW_VALUE always will be 0.0 > or if all column values less than 0.0, DOUBLE_HIGH_VALUE will always be > hive (default)> create table src_test (price double); > hive (default)> load data local inpath './test.txt' into table src_test; > hive (default)> select * from src_test; > OK > 1.0 > 2.0 > 3.0 > Time taken: 0.313 seconds, Fetched: 3 row(s) > hive (default)> analyze table src_test compute statistics for columns price; > mysql> select * from TAB_COL_STATS \G; > CS_ID: 16 >DB_NAME: default > TABLE_NAME: src_test >COLUMN_NAME: price >COLUMN_TYPE: double > TBL_ID: 2586 > LONG_LOW_VALUE: 0 >LONG_HIGH_VALUE: 0 > DOUBLE_LOW_VALUE: 0. # Wrong Result ! Expected is 1. > DOUBLE_HIGH_VALUE: 3. > BIG_DECIMAL_LOW_VALUE: NULL > BIG_DECIMAL_HIGH_VALUE: NULL > NUM_NULLS: 0 > NUM_DISTINCTS: 1 >AVG_COL_LEN: 0. >MAX_COL_LEN: 0 > NUM_TRUES: 0 > NUM_FALSES: 0 > LAST_ANALYZED: 1368596151 > 2 rows in set (0.00 sec) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira