[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-15680: - Resolution: Fixed Fix Version/s: 2.4.0 3.0.0 Status: Resolved (was: Patch Available) Committed to master and branch-2. Thanks [~erwaman] for the initial patches and [~thejas] for the review! > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu >Priority: Major > Fix For: 3.0.0, 2.4.0 > > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch, > HIVE-15680.6.patch, HIVE-15680.7.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-15680: - Attachment: HIVE-15680.7.patch > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch, > HIVE-15680.6.patch, HIVE-15680.7.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-15680: - Attachment: (was: HIVE-15680.7.patch) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch, HIVE-15680.6.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prasanth Jayachandran updated HIVE-15680: - Attachment: HIVE-15680.7.patch > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Prasanth Jayachandran >Priority: Major > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch, > HIVE-15680.6.patch, HIVE-15680.7.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Open (was: Patch Available) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Patch Available (was: Open) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Attachment: HIVE-15680.5.patch Uploaded new patch. > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch, HIVE-15680.5.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Attachment: HIVE-15680.4.patch Fixed NPEs in LLAP tests, uploaded new patch, and updated [RB|https://reviews.apache.org/r/55816/]. > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Patch Available (was: Open) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch, HIVE-15680.4.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Open (was: Patch Available) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Patch Available (was: Open) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Attachment: HIVE-15680.3.patch Added some missing null checks. Uploaded new patch and updated RB: https://reviews.apache.org/r/55816/ > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch, > HIVE-15680.3.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Open (was: Patch Available) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch, HIVE-15680.2.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Attachment: HIVE-15680.1.patch Uploaded patch. Also posted RB at https://reviews.apache.org/r/55816/. > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15680) Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
[ https://issues.apache.org/jira/browse/HIVE-15680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Hsu updated HIVE-15680: --- Status: Patch Available (was: Open) > Incorrect results when hive.optimize.index.filter=true and same ORC table is > referenced twice in query > -- > > Key: HIVE-15680 > URL: https://issues.apache.org/jira/browse/HIVE-15680 > Project: Hive > Issue Type: Bug >Affects Versions: 1.1.0, 2.2.0 >Reporter: Anthony Hsu >Assignee: Anthony Hsu > Attachments: HIVE-15680.1.patch > > > To repro: > {noformat} > set hive.optimize.index.filter=true; > create table test_table(number int) stored as ORC; > -- Two insertions will create two files, with one stripe each > insert into table test_table VALUES (1); > insert into table test_table VALUES (2); > -- This should and does return 2 records > select * from test_table; > -- These should and do each return 1 record > select * from test_table where number = 1; > select * from test_table where number = 2; > -- This should return 2 records but only returns 1 record > select * from test_table where number = 1 > union all > select * from test_table where number = 2; > {noformat} > What's happening is only the last predicate is being pushed down. -- This message was sent by Atlassian JIRA (v6.3.4#6332)