[jira] [Updated] (HIVE-24323) JDBC driver fails when using Kerberos due to missing dependencies
[ https://issues.apache.org/jira/browse/HIVE-24323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-24323: -- Description: *The Apache Hive web pages historically implied that only 3-JAR files are required* hadoop-auth hadoop-common hive-jdbc *If a connection is attempted using Kerberos authentication, it will fail due to several missing dependencies* hadoop-auth-3.1.1.3.1.5.0-152.jar hadoop-common-3.1.1.3.1.5.0-152.jar hive-jdbc-3.1.0.3.1.5.0-152-standalone.jar *Dependencies* commons-collections-3.2.2.jar commons-configuration2.jar commons-lang-2.6.jar guava-29.0-jre.jar log4j-1.2.17.jar slf4j-api-1.7.25.jar *It is unclear if the intent of the standalone JAR is to include these dependencies or not.* But does not seem to be any documentation either way. *It also appears that dependencies are not being shaded, which can result in conflicts with guava or wstx jar files in the class path.* Such as noted by ORACLE {color:#00}Doc ID 2650046.1{color} {color:#00} com.ctc.wstx.io.StreamBootstrapper.getInstance(Ljava/lang/String;Lcom/ctc/wstx/io/SystemId;Ljava/io/InputStream;)Lcom/ctc/wstx/io/StreamBootstrapper; ] java.lang.NoSuchMethodError: com.ctc.wstx.io.StreamBootstrapper.getInstance(Ljava/lang/String;Lcom/ctc/wstx/io/SystemId;Ljava/io/InputStream;)Lcom/ctc/wstx/io/StreamBootstrapper; at org.apache.hadoop.conf.Configuration.parse(Configuration.java:2918) at org.apache.hadoop.conf.Configuration.parse(Configuration.java:2901){color} was: The Apache Hive web pages historically implied that only 3-JAR files are required hadoop-auth hadoop-common hive-jdbc If a connection is attempted using Kerberos authentication, it will fail due to several missing dependencies hadoop-auth-3.1.1.3.1.5.0-152.jar hadoop-common-3.1.1.3.1.5.0-152.jar hive-jdbc-3.1.0.3.1.5.0-152-standalone.jar It is unclear if the intent of the standalone JAR is to include these dependencies or not. But does not seem to be any documentation either way. It also appears that dependencies are not being shaded, which can result in conflicts with guava or wstx jar files in the class path. Such as noted by ORACLE {color:#00}Doc ID 2650046.1{color} commons-collections-3.2.2.jar commons-configuration2.jar commons-lang-2.6.jar guava-29.0-jre.jar log4j-1.2.17.jar slf4j-api-1.7.25.jar > JDBC driver fails when using Kerberos due to missing dependencies > - > > Key: HIVE-24323 > URL: https://issues.apache.org/jira/browse/HIVE-24323 > Project: Hive > Issue Type: Bug > Components: JDBC >Affects Versions: 3.1.0 >Reporter: N Campbell >Priority: Major > > *The Apache Hive web pages historically implied that only 3-JAR files are > required* > hadoop-auth > hadoop-common > hive-jdbc > *If a connection is attempted using Kerberos authentication, it will fail due > to several missing dependencies* > hadoop-auth-3.1.1.3.1.5.0-152.jar > hadoop-common-3.1.1.3.1.5.0-152.jar > hive-jdbc-3.1.0.3.1.5.0-152-standalone.jar > *Dependencies* > commons-collections-3.2.2.jar > commons-configuration2.jar > commons-lang-2.6.jar > guava-29.0-jre.jar > log4j-1.2.17.jar > slf4j-api-1.7.25.jar > *It is unclear if the intent of the standalone JAR is to include these > dependencies or not.* > But does not seem to be any documentation either way. > *It also appears that dependencies are not being shaded, which can result in > conflicts with guava or wstx jar files in the class path.* > Such as noted by ORACLE {color:#00}Doc ID 2650046.1{color} > {color:#00} > com.ctc.wstx.io.StreamBootstrapper.getInstance(Ljava/lang/String;Lcom/ctc/wstx/io/SystemId;Ljava/io/InputStream;)Lcom/ctc/wstx/io/StreamBootstrapper; > ] > java.lang.NoSuchMethodError: > com.ctc.wstx.io.StreamBootstrapper.getInstance(Ljava/lang/String;Lcom/ctc/wstx/io/SystemId;Ljava/io/InputStream;)Lcom/ctc/wstx/io/StreamBootstrapper; > at org.apache.hadoop.conf.Configuration.parse(Configuration.java:2918) > at > org.apache.hadoop.conf.Configuration.parse(Configuration.java:2901){color} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (HIVE-15561) JDBC driver does not implement getArray method for array types
[ https://issues.apache.org/jira/browse/HIVE-15561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15561: -- Component/s: JDBC > JDBC driver does not implement getArray method for array types > -- > > Key: HIVE-15561 > URL: https://issues.apache.org/jira/browse/HIVE-15561 > Project: Hive > Issue Type: Bug > Components: JDBC >Affects Versions: 1.2.1 >Reporter: N Campbell > > getDriverVersion: 1.2.1000.2.5.0.0-1245 > The JDBC driver returns a JDBC type enum 2003 which is an array type. If an > application then attempts to call the getArray() method the driver will throw > a not implemented exception. > Array z = rs.getArray("carrint"); > create table if not exists TARRSINT (RNUM int , CARRSINT ARRAY< SMALLINT >) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' COLLECTION ITEMS TERMINATED > BY '^' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15553) expression to compute running sum fails with Failed to breakup Windowing invocations into Groups
[ https://issues.apache.org/jira/browse/HIVE-15553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15553: -- Attachment: TOLAP > expression to compute running sum fails with Failed to breakup Windowing > invocations into Groups > > > Key: HIVE-15553 > URL: https://issues.apache.org/jira/browse/HIVE-15553 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 2.1.0 >Reporter: N Campbell > Attachments: TOLAP > > > The following statement will fail in Hive.Expecting to compute the following > as is possible in Oracle, Db2, Teradata, ... > select c1, sum ( sum ( c3 ) ) over ( order by c1 ) from certtext.tolap > group by c1 > Error: Error while compiling statement: FAILED: SemanticException Failed to > breakup Windowing invocations into Groups. At least 1 group must only depend > on input columns. Also check for circular dependencies. > Underlying error: org.apache.hadoop.hive.ql.parse.SemanticException: Line > 1:18 Expression not in GROUP BY key 'c3' > SQLState: 42000 > ErrorCode: 4 > Note: if you use the following statement it can encounter a Tez error when c1 > is a char vs string type. > select c1, sum( c3 ), sum ( sum ( c3 ) ) over ( order by c1 ) from > certtext.tolap group by c1 > C1SUM(C3) SUM(SUM(C3))OVER(ORDERBYC1) > AAA 50 50 > BBB 20 70 > CCC 30 100 > DDD 40 140 > 50 190 > Table definition > create table if not exists TOLAP (RNUM int , C1 char(3), C2 char(2), C3 int, > C4 int) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-15548) TEZ exception error when NULL ordering specification used on cursor or window agg
[ https://issues.apache.org/jira/browse/HIVE-15548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15804555#comment-15804555 ] N Campbell commented on HIVE-15548: --- Another case not involving null order specification SELECT `D1`.`C0` AS `rnum`, `D1`.`C1` AS `c1`, `D1`.`C2` AS `c2`, SUM(`D1`.`C3`) OVER( PARTITION BY `D1`.`C4` ) AS `cx` FROM ( SELECT `rnum` AS `C0`, `c1` AS `C1`, `c2` AS `C2`, `c3` AS `C3`, SUM(`c3`) OVER( PARTITION BY `c1` ORDER BY `c1` ASC ) AS `C4` FROM `tolap` ) `D1` > TEZ exception error when NULL ordering specification used on cursor or window > agg > - > > Key: HIVE-15548 > URL: https://issues.apache.org/jira/browse/HIVE-15548 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 2.1.0 >Reporter: N Campbell > Attachments: TOLAP, TSET1 > > > Hive server: 2.1.0.2.5.3.0-37 > select c1, c2 from tset1 order by c1 asc nulls last, c2 asc nulls first > select rnum , c1 , c2 , sum( c3 ) over (partition by sum( c3 ) over > (partition by c1 order by c1 )) from tolap > i.e. > Error: Error while processing statement: FAILED: Execution Error, return code > 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, > vertexName=Reducer 2, vertexId=vertex_1483461312952_0011_11_01, > diagnostics=[Task failed, taskId=task_1483461312952_0011_11_01_00, > diagnostics=[TaskAttempt 0 failed, info=[Error: Error while running task ( > failure ) : > attempt_1483461312952_0011_11_01_00_0:java.lang.RuntimeException: > java.lang.RuntimeException: java.io.EOFException: Detail: > "java.io.EOFException" occured for field 0 of 2 fields (INT, CHAR) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:211) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > table definitions: > create table if not exists TSET1 (RNUM int , C1 int, C2 char(3)) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; > create table if not exists TOLAP (RNUM int , C1 char(3), C2 char(2), C3 int, > C4 int) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15548) TEZ exception error when NULL ordering specification used on cursor or window agg
[ https://issues.apache.org/jira/browse/HIVE-15548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15548: -- Description: Hive server: 2.1.0.2.5.3.0-37 select c1, c2 from tset1 order by c1 asc nulls last, c2 asc nulls first select rnum , c1 , c2 , sum( c3 ) over (partition by sum( c3 ) over (partition by c1 order by c1 )) from tolap i.e. Error: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Reducer 2, vertexId=vertex_1483461312952_0011_11_01, diagnostics=[Task failed, taskId=task_1483461312952_0011_11_01_00, diagnostics=[TaskAttempt 0 failed, info=[Error: Error while running task ( failure ) : attempt_1483461312952_0011_11_01_00_0:java.lang.RuntimeException: java.lang.RuntimeException: java.io.EOFException: Detail: "java.io.EOFException" occured for field 0 of 2 fields (INT, CHAR) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:211) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) at org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) table definitions: create table if not exists TSET1 (RNUM int , C1 int, C2 char(3)) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS textfile ; create table if not exists TOLAP (RNUM int , C1 char(3), C2 char(2), C3 int, C4 int) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS textfile ; was: select c1, c2 from tset1 order by c1 asc nulls last, c2 asc nulls first select rnum , c1 , c2 , sum( c3 ) over (partition by sum( c3 ) over (partition by c1 order by c1 )) from tolap i.e. Error: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, vertexName=Reducer 2, vertexId=vertex_1483461312952_0011_11_01, diagnostics=[Task failed, taskId=task_1483461312952_0011_11_01_00, diagnostics=[TaskAttempt 0 failed, info=[Error: Error while running task ( failure ) : attempt_1483461312952_0011_11_01_00_0:java.lang.RuntimeException: java.lang.RuntimeException: java.io.EOFException: Detail: "java.io.EOFException" occured for field 0 of 2 fields (INT, CHAR) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:211) at org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) at org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) at org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) at org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.
[jira] [Updated] (HIVE-15548) TEZ exception error when NULL ordering specification used on cursor or window agg
[ https://issues.apache.org/jira/browse/HIVE-15548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15548: -- Attachment: TSET1 TOLAP > TEZ exception error when NULL ordering specification used on cursor or window > agg > - > > Key: HIVE-15548 > URL: https://issues.apache.org/jira/browse/HIVE-15548 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 2.1.0 >Reporter: N Campbell > Attachments: TOLAP, TSET1 > > > select c1, c2 from tset1 order by c1 asc nulls last, c2 asc nulls first > select rnum , c1 , c2 , sum( c3 ) over (partition by sum( c3 ) over > (partition by c1 order by c1 )) from tolap > i.e. > Error: Error while processing statement: FAILED: Execution Error, return code > 2 from org.apache.hadoop.hive.ql.exec.tez.TezTask. Vertex failed, > vertexName=Reducer 2, vertexId=vertex_1483461312952_0011_11_01, > diagnostics=[Task failed, taskId=task_1483461312952_0011_11_01_00, > diagnostics=[TaskAttempt 0 failed, info=[Error: Error while running task ( > failure ) : > attempt_1483461312952_0011_11_01_00_0:java.lang.RuntimeException: > java.lang.RuntimeException: java.io.EOFException: Detail: > "java.io.EOFException" occured for field 0 of 2 fields (INT, CHAR) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:211) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > table definitions: > create table if not exists TSET1 (RNUM int , C1 int, C2 char(3)) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; > create table if not exists TOLAP (RNUM int , C1 char(3), C2 char(2), C3 int, > C4 int) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15547) invalid result set when order by includes nulls sort last
[ https://issues.apache.org/jira/browse/HIVE-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15547: -- Description: Query that attempts to sort nulls last produces an incorrect result set. Data is correct when order by has no null order specification or specification is FIRST. Hive server version: 2.1.0.2.5.3.0-37 Query: SELECT `tint`.`rnum`, `tint`.`cint` FROM `tint` ORDER BY `tint`.`rnum` ASC NULLS LAST Results: tint.rnum tint.cint -1 0 1 10 Source data rnumcint 0 1 -1 2 0 3 1 4 10 Table create table if not exists TINT ( RNUM int , CINT int ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS textfile ; was: Query that attempts to sort nulls last produces an incorrect result set. (1) first column contains all null values which is wrong (2) second column has not sorted the only null value as the last row Hive server version: 2.1.0.2.5.3.0-37 Query: SELECT `tint`.`rnum`, `tint`.`cint` FROM `tint` ORDER BY `tint`.`rnum` ASC NULLS LAST Results: tint.rnum tint.cint -1 0 1 10 Source data rnumcint 0 1 -1 2 0 3 1 4 10 Table create table if not exists TINT ( RNUM int , CINT int ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS textfile ; > invalid result set when order by includes nulls sort last > - > > Key: HIVE-15547 > URL: https://issues.apache.org/jira/browse/HIVE-15547 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 2.1.0 >Reporter: N Campbell > > Query that attempts to sort nulls last produces an incorrect result set. > Data is correct when order by has no null order specification or > specification is FIRST. > Hive server version: 2.1.0.2.5.3.0-37 > Query: > SELECT `tint`.`rnum`, `tint`.`cint` FROM `tint` ORDER BY `tint`.`rnum` ASC > NULLS LAST > Results: > tint.rnum tint.cint > > -1 > 0 > 1 > 10 > Source data > rnum cint > 0 > 1 -1 > 2 0 > 3 1 > 4 10 > Table > create table if not exists TINT ( RNUM int , CINT int ) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-15547) invalid result set when order by includes nulls sort last
[ https://issues.apache.org/jira/browse/HIVE-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-15547: -- Summary: invalid result set when order by includes nulls sort last (was: nulls not sorted last on cursor specification) > invalid result set when order by includes nulls sort last > - > > Key: HIVE-15547 > URL: https://issues.apache.org/jira/browse/HIVE-15547 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 2.1.0 >Reporter: N Campbell > > Query that attempts to sort nulls last produces an incorrect result set. > (1) first column contains all null values which is wrong > (2) second column has not sorted the only null value as the last row > Hive server version: 2.1.0.2.5.3.0-37 > Query: > SELECT `tint`.`rnum`, `tint`.`cint` FROM `tint` ORDER BY `tint`.`rnum` ASC > NULLS LAST > Results: > tint.rnum tint.cint > > -1 > 0 > 1 > 10 > Source data > rnum cint > 0 > 1 -1 > 2 0 > 3 1 > 4 10 > Table > create table if not exists TINT ( RNUM int , CINT int ) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS textfile ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-9534) incorrect result set for query that projects a windowed aggregate
[ https://issues.apache.org/jira/browse/HIVE-9534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14901818#comment-14901818 ] N Campbell commented on HIVE-9534: -- should say Netezza 7.2 and ORACLE 12. > incorrect result set for query that projects a windowed aggregate > - > > Key: HIVE-9534 > URL: https://issues.apache.org/jira/browse/HIVE-9534 > Project: Hive > Issue Type: Bug > Components: SQL >Reporter: N Campbell >Assignee: Chaoyu Tang > > Result set returned by Hive has one row instead of 5 > {code} > select avg(distinct tsint.csint) over () from tsint > create table if not exists TSINT (RNUM int , CSINT smallint) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS TEXTFILE; > 0|\N > 1|-1 > 2|0 > 3|1 > 4|10 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-9534) incorrect result set for query that projects a windowed aggregate
[ https://issues.apache.org/jira/browse/HIVE-9534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14901809#comment-14901809 ] N Campbell commented on HIVE-9534: -- IBM Netezza 7.2, IBM Big Insights, SAP Hana SP10, ORACLE 12 etc all return the expected result. > incorrect result set for query that projects a windowed aggregate > - > > Key: HIVE-9534 > URL: https://issues.apache.org/jira/browse/HIVE-9534 > Project: Hive > Issue Type: Bug > Components: SQL >Reporter: N Campbell >Assignee: Chaoyu Tang > > Result set returned by Hive has one row instead of 5 > {code} > select avg(distinct tsint.csint) over () from tsint > create table if not exists TSINT (RNUM int , CSINT smallint) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS TEXTFILE; > 0|\N > 1|-1 > 2|0 > 3|1 > 4|10 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-11372) join with betwee predicate comparing integer types returns no rows when ORC format used
[ https://issues.apache.org/jira/browse/HIVE-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-11372: -- Attachment: TSINT TINT > join with betwee predicate comparing integer types returns no rows when ORC > format used > --- > > Key: HIVE-11372 > URL: https://issues.apache.org/jira/browse/HIVE-11372 > Project: Hive > Issue Type: Bug >Affects Versions: 1.2.0 >Reporter: N Campbell > Attachments: TINT, TSINT > > > getDatabaseProductNameApache Hive > getDatabaseProductVersion 1.2.1.2.3.0.0-2557 > getDriverName Hive JDBC > getDriverVersion 1.2.1.2.3.0.0-2557 > getDriverMajorVersion 1 > getDriverMinorVersion 2 > select tint.rnum, tsint.rnum from tint , tsint where tint.cint between > tsint.csint and tsint.csint > when ORC used no rows returned versus TEXT > create table if not exists TSINT ( RNUM int , CSINT smallint ) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc ; > create table if not exists TINT ( RNUM int , CINT int ) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-11371) Null pointer exception for nested table query when using ORC versus text
[ https://issues.apache.org/jira/browse/HIVE-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-11371: -- Attachment: TJOIN4 TJOIN3 TJOIN2 TJOIN1 > Null pointer exception for nested table query when using ORC versus text > > > Key: HIVE-11371 > URL: https://issues.apache.org/jira/browse/HIVE-11371 > Project: Hive > Issue Type: Bug >Affects Versions: 1.2.0 >Reporter: N Campbell > Attachments: TJOIN1, TJOIN2, TJOIN3, TJOIN4 > > > Following query will fail if the file format is ORC > select tj1rnum, tj2rnum, tjoin3.rnum as rnumt3 from (select tjoin1.rnum > tj1rnum, tjoin2.rnum tj2rnum, tjoin2.c1 tj2c1 from tjoin1 left outer join > tjoin2 on tjoin1.c1 = tjoin2.c1 ) tj left outer join tjoin3 on tj2c1 = > tjoin3.c1 > aused by: java.lang.NullPointerException > at > org.apache.hadoop.hive.ql.exec.vector.VectorCopyRow$LongCopyRow.copy(VectorCopyRow.java:60) > at > org.apache.hadoop.hive.ql.exec.vector.VectorCopyRow.copyByReference(VectorCopyRow.java:260) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinGenerateResultOperator.generateHashMapResultMultiValue(VectorMapJoinGenerateResultOperator.java:238) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinOuterGenerateResultOperator.finishOuter(VectorMapJoinOuterGenerateResultOperator.java:495) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinOuterLongOperator.process(VectorMapJoinOuterLongOperator.java:430) > ... 22 more > ]], Vertex did not succeed due to OWN_TASK_FAILURE, failedTasks:1 > killedTasks:0, Vertex vertex_1437788144883_0004_2_02 [Map 1] killed/failed > due to:null]DAG did not succeed due to VERTEX_FAILURE. failedVertices:1 > killedVertices:0 > SQLState: 08S01 > ErrorCode: 2 > getDatabaseProductNameApache Hive > getDatabaseProductVersion 1.2.1.2.3.0.0-2557 > getDriverName Hive JDBC > getDriverVersion 1.2.1.2.3.0.0-2557 > getDriverMajorVersion 1 > getDriverMinorVersion 2 > create table if not exists TJOIN1 (RNUM int , C1 int, C2 int) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc; > create table if not exists TJOIN2 (RNUM int , C1 int, C2 char(2)) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc ; > create table if not exists TJOIN3 (RNUM int , C1 int, C2 char(2)) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc ; > create table if not exists TJOIN4 (RNUM int , C1 int, C2 char(2)) > -- ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS orc ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-10936) incorrect result set when hive.vectorized.execution.enabled = true with predicate casting to CHAR or VARCHAR
[ https://issues.apache.org/jira/browse/HIVE-10936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] N Campbell updated HIVE-10936: -- Attachment: GO_TIME_DIM.zip > incorrect result set when hive.vectorized.execution.enabled = true with > predicate casting to CHAR or VARCHAR > > > Key: HIVE-10936 > URL: https://issues.apache.org/jira/browse/HIVE-10936 > Project: Hive > Issue Type: Bug > Components: Hive >Affects Versions: 0.14.0 > Environment: In this case using HDP install of Hive - 0.14.0.2.2.4.2-2 >Reporter: N Campbell > Attachments: GO_TIME_DIM.zip > > > Query returns data when set hive.vectorized.execution.enabled = false -or- if > target of CAST is STRING and not CHAR/VARCHAR > set hive.vectorized.execution.enabled = true; > select > `GO_TIME_DIM`.`day_key` > from > `gosalesdw1021`.`go_time_dim` `GO_TIME_DIM` > where > CAST(`GO_TIME_DIM`.`current_year` AS CHAR(4)) = '2010' > group by > `GO_TIME_DIM`.`day_key`; > create table GO_TIME_DIM ( DAY_KEY int , DAY_DATE timestamp , MONTH_KEY int , > CURRENT_MONTH smallint , MONTH_NUMBER int , QUARTER_KEY int , CURRENT_QUARTER > smallint , CURRENT_YEAR smallint , DAY_OF_WEEK smallint , DAY_OF_MONTH > smallint , DAYS_IN_MONTH smallint , DAY_OF_YEAR smallint , WEEK_OF_MONTH > smallint , WEEK_OF_QUARTER smallint , WEEK_OF_YEAR smallint , MONTH_EN string > , WEEKDAY_EN string , MONTH_DE string , WEEKDAY_DE string , MONTH_FR string , > WEEKDAY_FR string , MONTH_JA string , WEEKDAY_JA string , MONTH_AR string , > WEEKDAY_AR string , MONTH_CS string , WEEKDAY_CS string , MONTH_DA string , > WEEKDAY_DA string , MONTH_EL string , WEEKDAY_EL string , MONTH_ES string , > WEEKDAY_ES string , MONTH_FI string , WEEKDAY_FI string , MONTH_HR string , > WEEKDAY_HR string , MONTH_HU string , WEEKDAY_HU string , MONTH_ID string , > WEEKDAY_ID string , MONTH_IT string , WEEKDAY_IT string , MONTH_KK string , > WEEKDAY_KK string , MONTH_KO string , WEEKDAY_KO string , MONTH_MS string , > WEEKDAY_MS string , MONTH_NL string , WEEKDAY_NL string , MONTH_NO string , > WEEKDAY_NO string , MONTH_PL string , WEEKDAY_PL string , MONTH_PT string , > WEEKDAY_PT string , MONTH_RO string , WEEKDAY_RO string , MONTH_RU string , > WEEKDAY_RU string , MONTH_SC string , WEEKDAY_SC string , MONTH_SL string , > WEEKDAY_SL string , MONTH_SV string , WEEKDAY_SV string , MONTH_TC string , > WEEKDAY_TC string , MONTH_TH string , WEEKDAY_TH string , MONTH_TR string , > WEEKDAY_TR string ) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS TEXTFILE > LOCATION '../GO_TIME_DIM'; > Then create an ORC equivalent table and load it > insert overwrite table > GO_TIME_DIM > select * from TEXT.GO_TIME_DIM > ; -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-9534) incorrect result set for query that projects a windowed aggregate
[ https://issues.apache.org/jira/browse/HIVE-9534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14529608#comment-14529608 ] N Campbell commented on HIVE-9534: -- re your comment about ORACLE select avg(distinct tsint.csint) over () from tsint null, -1, 0, 1, 10 ORACLE Oracle Database 12c Enterprise Edition ( 12.1.0.2.0) returns 2.5, 2.5, 2.5, 2.5, 2.5 > incorrect result set for query that projects a windowed aggregate > - > > Key: HIVE-9534 > URL: https://issues.apache.org/jira/browse/HIVE-9534 > Project: Hive > Issue Type: Bug > Components: SQL >Reporter: N Campbell >Assignee: Chaoyu Tang > > Result set returned by Hive has one row instead of 5 > {code} > select avg(distinct tsint.csint) over () from tsint > create table if not exists TSINT (RNUM int , CSINT smallint) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS TEXTFILE; > 0|\N > 1|-1 > 2|0 > 3|1 > 4|10 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-10488) cast DATE as TIMESTAMP returns incorrect values
[ https://issues.apache.org/jira/browse/HIVE-10488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14514263#comment-14514263 ] N Campbell commented on HIVE-10488: --- Cast was using a column of type date and not literals or value expression. Variance was the format the data was in. > cast DATE as TIMESTAMP returns incorrect values > --- > > Key: HIVE-10488 > URL: https://issues.apache.org/jira/browse/HIVE-10488 > Project: Hive > Issue Type: Bug > Components: SQL >Affects Versions: 0.13.1 >Reporter: N Campbell >Assignee: Chaoyu Tang > > same data in textfile works > same data loaded into an ORC table does not > connection property of tez/mr makes no difference. > select rnum, cdt, cast (cdt as timestamp) from tdt > 0 > 1 1996-01-01 1969-12-31 19:00:09.496 > 2 2000-01-01 1969-12-31 19:00:10.957 > 3 2000-12-31 1969-12-31 19:00:11.322 > vs > 0 > 1 1996-01-01 1996-01-01 00:00:00.0 > 2 2000-01-01 2000-01-01 00:00:00.0 > 3 2000-12-31 2000-12-31 00:00:00.0 > create table if not exists TDT ( RNUM int , CDT date ) > STORED AS orc ; > insert overwrite table TDT select * from text.TDT; > 0|\N > 1|1996-01-01 > 2|2000-01-01 > 3|2000-12-31 -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-9534) incorrect result set for query that projects a windowed aggregate
[ https://issues.apache.org/jira/browse/HIVE-9534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512997#comment-14512997 ] N Campbell commented on HIVE-9534: -- While Postgres may not support ISO-SQL syntax as shown other vendors do. That said, if the engine does not attempt to support distinct aggregation then a parsing error should be thrown -and- the documentation improved at https://cwiki.apache.org/confluence/display/Hive/LanguageManual+WindowingAndAnalytics > incorrect result set for query that projects a windowed aggregate > - > > Key: HIVE-9534 > URL: https://issues.apache.org/jira/browse/HIVE-9534 > Project: Hive > Issue Type: Bug > Components: SQL >Reporter: N Campbell > > Result set returned by Hive has one row instead of 5 > {code} > select avg(distinct tsint.csint) over () from tsint > create table if not exists TSINT (RNUM int , CSINT smallint) > ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' > STORED AS TEXTFILE; > 0|\N > 1|-1 > 2|0 > 3|1 > 4|10 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)