from:"N Campbell $Jira$"

N Campbell created HIVE-9739:


 Summary: Quantified query fails with Tez/ORC file 
org.apache.hadoop.hive.ql.exec.tez.TezTask
 Key: HIVE-9739
 URL: https://issues.apache.org/jira/browse/HIVE-9739
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell


This fails when using Tez and ORC. 

It will run when text files are used or text/ORC and MapReduce and not Tez used.

Is this another example of a type issue per 
https://issues.apache.org/jira/browse/HIVE-9735

select rnum, c1, c2 from tset1 as t1 where exists ( select c1 from tset2 where 
c1 = t1.c1 )

This will run in both Tez and MapReduce using a text file

select rnum, c1, c2 from t_tset1 as t1 where exists ( select c1 from t_tset2 
where c1 = t1.c1 )


Caused by: java.lang.RuntimeException: 
org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
processing row 
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:91)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:68)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:294)
at 
org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:163)
... 13 more
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error 
while processing row 
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:52)
at 
org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:83)
... 16 more
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unexpected 
exception: org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast to 
org.apache.hadoop.hive.common.type.HiveChar
at 
org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:311)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.processOp(VectorMapJoinOperator.java:249)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.processOp(VectorFilterOperator.java:111)
at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
at 
org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
at 
org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:157)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
... 17 more
Caused by: java.lang.ClassCastException: 
org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast to 
org.apache.hadoop.hive.common.type.HiveChar
at 
org.apache.hadoop.hive.ql.exec.vector.VectorColumnAssignFactory$18.assignObjectValue(VectorColumnAssignFactory.java:432)
at 
org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.internalForward(VectorMapJoinOperator.java:196)
at 
org.apache.hadoop.hive.ql.exec.CommonJoinOperator.genAllOneUniqueJoinObject(CommonJoinOperator.java:670)
at 
org.apache.hadoop.hive.ql.exec.CommonJoinOperator.checkAndGenObject(CommonJoinOperator.java:748)
at 
org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:299)
... 24 more

create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS textfile ;

create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
 STORED AS ORC ;


create table  if not exists T_TSET2 (RNUM int , C1 int, C2 char(3))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS textfile ;


TSET1 data

0|10|AAA
1|10|AAA
2|10|AAA
3|20|BBB
4|30|CCC
5|40|DDD
6|50|\N
7|60|\N
8|\N|AAA
9|\N|AAA
10|\N|\N
11|\N|\N

TSET2 DATA

0|10|AAA
1|10|AAA
2|40|DDD
3|50|EEE
4|60|FFF




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-9537) string expressions on a fixed length character do not preserve trailing spaces


[ 
https://issues.apache.org/jira/browse/HIVE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14328992#comment-14328992
 ] 

N Campbell commented on HIVE-9537:
--

I saying that if the current implementation is by design and will not change 
that it be more completely documented. In particular, to show examples of how 
the design may deviate from other systems which implement fixed length 
character types as many vendors do (Teradata, DB2, Informix, Netezza, ). 

Unfortunately, the current implementation feels like cases where vendors have 
added syntax that other products have but the implementation is not the 
same/has various gaps. The SQL documentation on the Apache Hive WIKI has 
limited discussion on the implementation etc. 

As for the last comment: ISO-SQL concatenation CHAR(x) || CHAR(y) should return 
CHAR(x+y) with implementation details re what happens when x+y  maximum 
precision of CHAR that a vendor provides. 


 string expressions on a fixed length character do not preserve trailing spaces
 --

 Key: HIVE-9537
 URL: https://issues.apache.org/jira/browse/HIVE-9537
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell
Assignee: Aihua Xu

 When a string expression such as upper or lower is applied to a fixed length 
 column the trailing spaces of the fixed length character are not preserved.
 {code:sql}
 CREATE TABLE  if not exists TCHAR ( 
 RNUM int, 
 CCHAR char(32)
 )
 ROW FORMAT DELIMITED 
 FIELDS TERMINATED BY '|' 
 LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE;
 {code}
 {{cchar}} as a {{char(32)}}.
 {code:sql}
 select cchar, concat(cchar, cchar), concat(lower(cchar), cchar), 
 concat(upper(cchar), cchar) 
 from tchar;
 {code}
 0|\N
 1|
 2| 
 3|BB
 4|EE
 5|FF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-9735) aggregate ( smalllint ) fails when ORC file used ava.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Short


[ 
https://issues.apache.org/jira/browse/HIVE-9735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329533#comment-14329533
 ] 

N Campbell commented on HIVE-9735:
--

Using JDBC with SQLSquirrel

 aggregate ( smalllint ) fails when ORC file used ava.lang.ClassCastException: 
 java.lang.Long cannot be cast to java.lang.Short
 --

 Key: HIVE-9735
 URL: https://issues.apache.org/jira/browse/HIVE-9735
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell

 select min( tsint.csint ) from tsint 
 select max( tsint.csint ) from tsint
 ava.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Short
 select min( t_tsint.csint ) from t_tsint 
 create table  if not exists T_TSINT ( RNUM int , CSINT smallint   )
  ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
  STORED AS textfile  ;
 create table  if not exists TSINT ( RNUM int , CSINT smallint   )
 TERMINATED BY '\n' 
  STORED AS orc  ;
 input data loaded into text file and then inserted into ORC table from text 
 based table
 0|\N
 1|-1
 2|0
 3|1
 4|10



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-9739) Various queries fails with Tez/ORC file org.apache.hadoop.hive.ql.exec.tez.TezTask


 [ 
https://issues.apache.org/jira/browse/HIVE-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-9739:
-
Summary: Various queries fails with Tez/ORC file 
org.apache.hadoop.hive.ql.exec.tez.TezTask  (was: Quantified query fails with 
Tez/ORC file org.apache.hadoop.hive.ql.exec.tez.TezTask)

 Various queries fails with Tez/ORC file 
 org.apache.hadoop.hive.ql.exec.tez.TezTask
 --

 Key: HIVE-9739
 URL: https://issues.apache.org/jira/browse/HIVE-9739
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell

 This fails when using Tez and ORC. 
 It will run when text files are used or text/ORC and MapReduce and not Tez 
 used.
 Is this another example of a type issue per 
 https://issues.apache.org/jira/browse/HIVE-9735
 select rnum, c1, c2 from tset1 as t1 where exists ( select c1 from tset2 
 where c1 = t1.c1 )
 This will run in both Tez and MapReduce using a text file
 select rnum, c1, c2 from t_tset1 as t1 where exists ( select c1 from t_tset2 
 where c1 = t1.c1 )
 Caused by: java.lang.RuntimeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
 processing row 
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:91)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:68)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:294)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:163)
   ... 13 more
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
 Error while processing row 
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:52)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:83)
   ... 16 more
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unexpected 
 exception: org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast 
 to org.apache.hadoop.hive.common.type.HiveChar
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:311)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.processOp(VectorMapJoinOperator.java:249)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.processOp(VectorFilterOperator.java:111)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
   at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
   at 
 org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:157)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
   ... 17 more
 Caused by: java.lang.ClassCastException: 
 org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast to 
 org.apache.hadoop.hive.common.type.HiveChar
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorColumnAssignFactory$18.assignObjectValue(VectorColumnAssignFactory.java:432)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.internalForward(VectorMapJoinOperator.java:196)
   at 
 org.apache.hadoop.hive.ql.exec.CommonJoinOperator.genAllOneUniqueJoinObject(CommonJoinOperator.java:670)
   at 
 org.apache.hadoop.hive.ql.exec.CommonJoinOperator.checkAndGenObject(CommonJoinOperator.java:748)
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:299)
   ... 24 more
 create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
  ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
  STORED AS textfile ;
 create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
  STORED AS ORC ;
 create table  if not exists T_TSET2 (RNUM int , C1 int, C2 char(3))
  ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
  STORED AS textfile ;
 TSET1 data
 0|10|AAA
 1|10|AAA
 2|10|AAA
 3|20|BBB
 4|30|CCC
 5|40|DDD
 6|50|\N
 7|60|\N
 8|\N|AAA
 9|\N|AAA
 10|\N|\N
 11|\N|\N
 TSET2 DATA
 0|10|AAA
 1|10|AAA
 2|40|DDD
 3|50|EEE
 4|60|FFF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-9739) Various queries fails with Tez/ORC file org.apache.hadoop.hive.ql.exec.tez.TezTask due to Caused by: java.lang.ClassCastException


 [ 
https://issues.apache.org/jira/browse/HIVE-9739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-9739:
-
Summary: Various queries fails with Tez/ORC file 
org.apache.hadoop.hive.ql.exec.tez.TezTask due to Caused by: 
java.lang.ClassCastException  (was: Various queries fails with Tez/ORC file 
org.apache.hadoop.hive.ql.exec.tez.TezTask)

 Various queries fails with Tez/ORC file 
 org.apache.hadoop.hive.ql.exec.tez.TezTask due to Caused by: 
 java.lang.ClassCastException
 -

 Key: HIVE-9739
 URL: https://issues.apache.org/jira/browse/HIVE-9739
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell

 This fails when using Tez and ORC. 
 It will run when text files are used or text/ORC and MapReduce and not Tez 
 used.
 Is this another example of a type issue per 
 https://issues.apache.org/jira/browse/HIVE-9735
 select rnum, c1, c2 from tset1 as t1 where exists ( select c1 from tset2 
 where c1 = t1.c1 )
 This will run in both Tez and MapReduce using a text file
 select rnum, c1, c2 from t_tset1 as t1 where exists ( select c1 from t_tset2 
 where c1 = t1.c1 )
 Caused by: java.lang.RuntimeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
 processing row 
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:91)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:68)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:294)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:163)
   ... 13 more
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
 Error while processing row 
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:52)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:83)
   ... 16 more
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unexpected 
 exception: org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast 
 to org.apache.hadoop.hive.common.type.HiveChar
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:311)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.processOp(VectorMapJoinOperator.java:249)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.processOp(VectorFilterOperator.java:111)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
   at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
   at 
 org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:157)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
   ... 17 more
 Caused by: java.lang.ClassCastException: 
 org.apache.hadoop.hive.serde2.io.HiveCharWritable cannot be cast to 
 org.apache.hadoop.hive.common.type.HiveChar
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorColumnAssignFactory$18.assignObjectValue(VectorColumnAssignFactory.java:432)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.internalForward(VectorMapJoinOperator.java:196)
   at 
 org.apache.hadoop.hive.ql.exec.CommonJoinOperator.genAllOneUniqueJoinObject(CommonJoinOperator.java:670)
   at 
 org.apache.hadoop.hive.ql.exec.CommonJoinOperator.checkAndGenObject(CommonJoinOperator.java:748)
   at 
 org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(MapJoinOperator.java:299)
   ... 24 more
 create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
  ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
  STORED AS textfile ;
 create table  if not exists T_TSET1 (RNUM int , C1 int, C2 char(3))
  STORED AS ORC ;
 create table  if not exists T_TSET2 (RNUM int , C1 int, C2 char(3))
  ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
  STORED AS textfile ;
 TSET1 data
 0|10|AAA
 1|10|AAA
 2|10|AAA
 3|20|BBB
 4|30|CCC
 5|40|DDD
 6|50|\N
 7|60|\N
 8|\N|AAA
 9|\N|AAA
 10|\N|\N
 11|\N|\N
 TSET2 DATA
 0|10|AAA
 1|10|AAA
 2|40|DDD
 3|50|EEE
 4|60|FFF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9743) incorrect result set for left outer join when executed with tez versus mapreduce

N Campbell created HIVE-9743:


 Summary: incorrect result set for left outer join when executed 
with tez versus mapreduce
 Key: HIVE-9743
 URL: https://issues.apache.org/jira/browse/HIVE-9743
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell


This query is supposed to return 3 rows and will when run without Tez but 
returns 2 rows when run with Tez.

select tjoin1.rnum, tjoin1.c1, tjoin1.c2, tjoin2.c2 as c2j2 from tjoin1 left 
outer join tjoin2 on ( tjoin1.c1 = tjoin2.c1 and tjoin1.c2  15 )

tjoin1.rnum tjoin1.c1   tjoin1.c2   c2j2
1   20  25  null
2   null  50  null

instead of

tjoin1.rnum tjoin1.c1   tjoin1.c2   c2j2
0   10  15  null
1   20  25  null
2   null  50  null

create table  if not exists TJOIN1 (RNUM int , C1 int, C2 int)
 STORED AS orc ;

0|10|15
1|20|25
2|\N|50

create table  if not exists TJOIN2 (RNUM int , C1 int, C2 char(2))
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE ;

0|10|BB
1|15|DD
2|\N|EE
3|10|FF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9745) predicate evaluation of character fields with spaces and literals with spaces returns unexpected result

N Campbell created HIVE-9745:


 Summary: predicate evaluation of character fields with spaces and 
literals with spaces returns unexpected result
 Key: HIVE-9745
 URL: https://issues.apache.org/jira/browse/HIVE-9745
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell


The following query should return 5 rows but Hive returns 3


select rnum, tchar.cchar from tchar where not (  tchar.cchar = ' ' or ( 
tchar.cchar is null and ' ' is null ))

Consider the following project of the base table


select rnum, tchar.cchar, 
case tchar.cchar when ' ' then 'space' else 'not space' end, 
case when tchar.cchar is null then 'is null' else 'not null' end, case when ' ' 
is null then 'is null' else 'not null' end
from tchar
order by rnum

Row 0 is a NULL
Row 1 was loaded with a zero length string ''
Row 2 was loaded with a single space ' '

rnumtchar.cchar _c2 _c3 _c4
0   null  not space   is null not null
1   not space   not null
not null
2   not space   not null
not null
3   BB  not space   not null
not null
4   EE  not space   not null
not null
5   FF  not space   not null
not null

Explicitly type cast the literal which many  SQL developers would not expect 
need to do.

select rnum, tchar.cchar, 
case tchar.cchar when cast(' ' as char(1)) then 'space' else 'not space' end, 
case when tchar.cchar is null then 'is null' else 'not null' end, case when 
cast( ' ' as char(1)) is null then 'is null' else 'not null' end
from tchar
order by rnum

rnumtchar.cchar _c2 _c3 _c4
0   null  not space   is null not null
1   space   not nullnot null
2   space   not nullnot null
3   BB  not space   not null
not null
4   EE  not space   not null
not null
5   FF  not space   not null
not null


create table  if not exists T_TCHAR ( RNUM int , CCHAR char(32 ))
ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE  ;

0|\N
1|
2| 
3|BB
4|EE
5|FF


create table  if not exists TCHAR ( RNUM int , CCHAR char(32 ))
 STORED AS orc  ;

insert overwrite table TCHAR select * from  T_TCHAR;




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-9537) string expressions on a fixed length character do not preserve trailing spaces

2015-02-19 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14327408#comment-14327408
 ] 

N Campbell commented on HIVE-9537:
--

I wouldn't suggest that use MySQL and Postgres alone is ideal to cross check 
SQL semantics. 

Should the Apache implementation for CHAR(n) stay this way going forward I 
recommend you get the documentation improved. 

Having seen companies migrate applications across vendors where this sort of 
thing burns them. Many may not know what ISO-SQL 20xx states (let alone care) 
and will presume that their source vendor(s) have followed the specification 
etc. 

Given the server side string operations etc, they may as well stay with VARCHAR 
and know that trailing spaces are not preserved vs the 'appears' to have spaces 
when projected but not in other cases. 

See discussion: 
http://www.postgresql.org/docs/9.1/static/datatype-character.html

See discussion of MySQL JDBC: 
https://dev.mysql.com/doc/connector-j/en/connector-j-reference-configuration-properties.html
 re padCharsWithSpace

If you apply a fold (upper/lower) the type of the result is supposed to be 
from the character value expression

Length should be the number of characters in the string value expression - 
where number of  characters is based upon the semantics of the character set of 
the string value expression

If both operands are fixed length character strings, concatenation result is a 
fixed length character string with a length equal to the sum of the lengths of 
the operands where the length cannot exceed the maximum allowed for a fixed 
length character string.

Cheers.

 string expressions on a fixed length character do not preserve trailing spaces
 --

 Key: HIVE-9537
 URL: https://issues.apache.org/jira/browse/HIVE-9537
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell
Assignee: Aihua Xu

 When a string expression such as upper or lower is applied to a fixed length 
 column the trailing spaces of the fixed length character are not preserved.
 {code:sql}
 CREATE TABLE  if not exists TCHAR ( 
 RNUM int, 
 CCHAR char(32)
 )
 ROW FORMAT DELIMITED 
 FIELDS TERMINATED BY '|' 
 LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE;
 {code}
 {{cchar}} as a {{char(32)}}.
 {code:sql}
 select cchar, concat(cchar, cchar), concat(lower(cchar), cchar), 
 concat(upper(cchar), cchar) 
 from tchar;
 {code}
 0|\N
 1|
 2| 
 3|BB
 4|EE
 5|FF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9734) Correlating expression cannot contain unqualified column references

2015-02-19 Thread N Campbell (JIRA)

N Campbell created HIVE-9734:


 Summary: Correlating expression cannot contain unqualified column 
references
 Key: HIVE-9734
 URL: https://issues.apache.org/jira/browse/HIVE-9734
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell
Priority: Minor


The following valid ISO-SQL constructs will not parse

Unsupported SubQuery Expression 'c1': Correlating expression cannot contain 
unqualified column references

The statement will parse if the predicate is changed to where tjoin1.c1 in ( 
)

select tjoin1.rnum, tjoin1.c1, tjoin1.c2 from tjoin1 where c1 in ( select c1 
from tjoin1) 

select tjoin1.rnum, tjoin1.c1, tjoin1.c2 from tjoin1 where c1 in ( select 
tjoin1.c1 from tjoin1) 

select tjoin1.rnum, tjoin1.c1, tjoin1.c2 from tjoin1 where c1 in ( select x.c1 
from tjoin1 x) 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9735) aggregate ( smalllint ) fails when ORC file used ava.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Short

2015-02-19 Thread N Campbell (JIRA)

N Campbell created HIVE-9735:


 Summary: aggregate ( smalllint ) fails when ORC file used 
ava.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Short
 Key: HIVE-9735
 URL: https://issues.apache.org/jira/browse/HIVE-9735
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell


select min( tsint.csint ) from tsint 
select max( tsint.csint ) from tsint
ava.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Short

select min( t_tsint.csint ) from t_tsint 

create table  if not exists T_TSINT ( RNUM int , CSINT smallint   )
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS textfile  ;

create table  if not exists TSINT ( RNUM int , CSINT smallint   )
TERMINATED BY '\n' 
 STORED AS orc  ;

input data loaded into text file and then inserted into ORC table from text 
based table

0|\N
1|-1
2|0
3|1
4|10




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Reopened] (HIVE-9537) string expressions on a fixed length character do not preserve trailing spaces

2015-02-18 Thread N Campbell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell reopened HIVE-9537:
--

The Hive documentation is vague at best with respect to when padding is 
preserved/ignored:

Char types are similar to Varchar but they are fixed-length meaning that 
values shorter than the specified length value are padded with spaces but 
trailing spaces are not important during comparisons. The maximum length is 
fixed at 255. 

There is no discussion on non-comparison operations such as upper, lower, 
concat etc.

Consider the following, the driver may return CCHAR will trailing blanks but a 
string operation such as concat fails to preserve them. Should an application 
locally perform a scalar operation on the returned value such as LEN, LOWER etc 
then it may retain the spaces. Meanwhile server side an 'equivalent' expression 
is not blank preserving.

select rnum, cchar, concat( concat( concat( cchar,'...'), cchar),'...') from 
tchar. 

So the driver will return BBspaces and then BB...BB... for the 2nd and 3rd 
projected item. Similarly length(cchar) returns 2 and not 5 etc.

Customers using technologies such as Hana, DB2, Netezza, ... will expect the 
blank padded behaviour. To all intents and purposes most SQL persons would not 
consider the implementation to be fixed length character.

i.e length(cchar) returns 32

i.e cchar || '...' . returns 'BB  ...BB 
 ...'

Should this be the design intent of Hive I would ask for the documentation to 
be far more comprehensive is stating the semantics. 



 string expressions on a fixed length character do not preserve trailing spaces
 --

 Key: HIVE-9537
 URL: https://issues.apache.org/jira/browse/HIVE-9537
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell
Assignee: Aihua Xu

 When a string expression such as upper or lower is applied to a fixed length 
 column the trailing spaces of the fixed length character are not preserved.
 {code:sql}
 CREATE TABLE  if not exists TCHAR ( 
 RNUM int, 
 CCHAR char(32)
 )
 ROW FORMAT DELIMITED 
 FIELDS TERMINATED BY '|' 
 LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE;
 {code}
 {{cchar}} as a {{char(32)}}.
 {code:sql}
 select cchar, concat(cchar, cchar), concat(lower(cchar), cchar), 
 concat(upper(cchar), cchar) 
 from tchar;
 {code}
 0|\N
 1|
 2| 
 3|BB
 4|EE
 5|FF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9598) java.lang.IllegalMonitorStateException/java.util.concurrent.locks.ReentrantLock$Sync.tryRelease if ResultSet.closed called after Statement.close called

2015-02-06 Thread N Campbell (JIRA)

N Campbell created HIVE-9598:


 Summary: 
java.lang.IllegalMonitorStateException/java.util.concurrent.locks.ReentrantLock$Sync.tryRelease
 if ResultSet.closed called after Statement.close called
 Key: HIVE-9598
 URL: https://issues.apache.org/jira/browse/HIVE-9598
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.13.0
Reporter: N Campbell


http://docs.oracle.com/javase/7/docs/api/java/sql/ResultSet.html#close()
http://docs.oracle.com/javase/7/docs/api/java/sql/Statement.html#close()

Statement stmt;
try {
stmt = dbConnection.createStatement();
stmt.executeQuery(select* from t);
ResultSet rs = stmt.getResultSet();
stmt.close();
if (rs != null) {
System.out.println(IS NOT NULL);
// Hive does not implement isClosed()
//  if (!rs.isClosed()) {
//  System.out.println(IS NOT CLOSED);
//  }
rs.close();
}
} catch (SQLException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}

Exception in thread main java.lang.IllegalMonitorStateException
at 
java.util.concurrent.locks.ReentrantLock$Sync.tryRelease(ReentrantLock.java:166)
at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.release(AbstractQueuedSynchronizer.java:1271)
at 
java.util.concurrent.locks.ReentrantLock.unlock(ReentrantLock.java:471)
at 
org.apache.hive.jdbc.HiveStatement.closeClientOperation(HiveStatement.java:175)
at 
org.apache.hive.jdbc.HiveQueryResultSet.close(HiveQueryResultSet.java:293)

/D:/JDBC/Hortonworks_Hive13/commons-configuration-1.6.jar
/D:/JDBC/Hortonworks_Hive13/commons-logging-1.1.3.jar
/D:/JDBC/Hortonworks_Hive13/hadoop-common-2.4.0.2.1.1.0-385.jar
/D:/JDBC/Hortonworks_Hive13/hive-exec-0.13.0.2.1.1.0-385.jar
/D:/JDBC/Hortonworks_Hive13/hive-jdbc-0.13.0.2.1.1.0-385.jar
/D:/JDBC/Hortonworks_Hive13/hive-service-0.13.0.2.1.1.0-385.jar
/D:/JDBC/Hortonworks_Hive13/httpclient-4.2.5.jar
/D:/JDBC/Hortonworks_Hive13/httpcore-4.2.5.jar
/D:/JDBC/Hortonworks_Hive13/libfb303-0.9.0.jar
/D:/JDBC/Hortonworks_Hive13/libthrift-0.9.0.jar
/D:/JDBC/Hortonworks_Hive13/log4j-1.2.16.jar
/D:/JDBC/Hortonworks_Hive13/slf4j-api-1.7.5.jar
/D:/JDBC/Hortonworks_Hive13/slf4j-log4j12-1.7.5.jar



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9535) add support for NULL ORDER specification in windowed aggregates

N Campbell created HIVE-9535:


 Summary: add support for NULL ORDER specification in windowed 
aggregates
 Key: HIVE-9535
 URL: https://issues.apache.org/jira/browse/HIVE-9535
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Reporter: N Campbell


Cannot explicitly state how nulls order in windowed aggregates. 

select rnum, c1, c2, c3, rank() over(partition by c1,c2 order by c3 desc nulls 
last),  rank() over(partition by c1,c2 order by c3 desc nulls first)  from tolap

while faking with an expression may simulate the intent supporting the ISO 
standard would be preferred.

select rnum, c1, c2, c3, rank() over(partition by c1,c2 order by c3 desc nulls 
last),  rank() over(partition by c1,c2 order by c3 desc nulls first)  from tolap



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9530) constant * column is null interpreted as constant * boolean

N Campbell created HIVE-9530:


 Summary: constant * column is null interpreted as constant * 
boolean
 Key: HIVE-9530
 URL: https://issues.apache.org/jira/browse/HIVE-9530
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.14.0
Reporter: N Campbell
Priority: Minor


select c1 from tversion where 1 * cnnull is null

FAILED: SemanticException [Error 10014]: Line 1:30 Wrong arguments 'cnnull': No 
matching method for class 
org.apache.hadoop.hive.ql.udf.generic.GenericUDFOPMultiply with (int, boolean)

create table  if not exists TVERSION (RNUM int , C1 int, CVER char(6), CNNULL 
int, CCNULL char(1))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE ;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9532) add support for quantified predicates

N Campbell created HIVE-9532:


 Summary: add support for quantified predicates 
 Key: HIVE-9532
 URL: https://issues.apache.org/jira/browse/HIVE-9532
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Reporter: N Campbell


allow a quantified predicate using ALL/ANY/SOME

select rnum, c1, c2 from tjoin2 where 20  some ( select c1 from tjoin1)

Error while compiling statement: FAILED: ParseException line 1:50 cannot 
recognize input near 'select' 'c1' 'from' in function specification



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9531) remove requirement that EXISTS subquery must be correlated

N Campbell created HIVE-9531:


 Summary: remove requirement that EXISTS subquery must be correlated
 Key: HIVE-9531
 URL: https://issues.apache.org/jira/browse/HIVE-9531
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Reporter: N Campbell


lift the documented restriction that a exists/not exists must involve a 
correlated sub-query

SQL: select rnum, c1, c2 from tjoin2 where exists ( select c1 from tjoin1)
FAILED: SemanticException Line 1:62 Invalid SubQuery expression 'c1' in 
definition of SubQuery sq_1 [
exists ( select c1 from tjoin1)
] used as sq_1 at Line 1:38: For Exists/Not Exists operator SubQuery must be 
Correlated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9534) incorrect result set for query that projects a windowed aggregate

N Campbell created HIVE-9534:


 Summary: incorrect result set for query that projects a windowed 
aggregate
 Key: HIVE-9534
 URL: https://issues.apache.org/jira/browse/HIVE-9534
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell


Result set returned by Hive has one row instead of 5

select avg( distinct tsint.csint ) over () from tsint 

create table  if not exists TSINT ( RNUM int , CSINT smallint   )
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE  ;

0|\N
1|-1
2|0
3|1
4|10




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9536) precision of a string concatenation becomes huge precision

N Campbell created HIVE-9536:


 Summary: precision of a string concatenation becomes huge precision
 Key: HIVE-9536
 URL: https://issues.apache.org/jira/browse/HIVE-9536
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell


where CCHAR is CHAR(32) and CVCHAR is VARCHAR(32)

select concat ( cchar, cchar ) from t returns a result set where the precision 
of the projected column is 64

meanwhile the following becomes 2147483647. 
select concat('1234567890' , cchar)  from t

meanwhile the following returns 64
select concat(lower(cchar), cchar) from t

create table  if not exists TCHAR ( RNUM int , CCHAR char(32 ))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE  ;

create table  if not exists TVCHAR ( RNUM int , CVCHAR varchar(32 ))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE  ;




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9537) string expressions on a fixed length character do not preserve trailing spaces

N Campbell created HIVE-9537:


 Summary: string expressions on a fixed length character do not 
preserve trailing spaces
 Key: HIVE-9537
 URL: https://issues.apache.org/jira/browse/HIVE-9537
 Project: Hive
  Issue Type: Bug
  Components: SQL
Reporter: N Campbell


cchar is char(32). when a string expression such as upper or lower is applied 
to a fixed length column the trailing spaces of the fixed length character are 
not preserved.

select cchar, concat(cchar, cchar), concat(lower(cchar), cchar), 
concat(upper(cchar), cchar) from tchar

create table  if not exists TCHAR ( RNUM int , CCHAR char(32 ))
 ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' 
 STORED AS TEXTFILE  ;

0|\N
1|
2| 
3|BB
4|EE
5|FF




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HIVE-9533) remove restriction that windowed aggregate ORDER BY can only have one key

N Campbell created HIVE-9533:


 Summary: remove restriction that windowed aggregate ORDER BY can 
only have one key
 Key: HIVE-9533
 URL: https://issues.apache.org/jira/browse/HIVE-9533
 Project: Hive
  Issue Type: Improvement
Reporter: N Campbell


current restriction makes the existing support very limited.

select rnum, c1, c2, c3, sum( c3 ) over(partition by c1 order by c2 , c3) from 
tolap

Error: Error while compiling statement: FAILED: SemanticException Range based 
Window Frame can have only 1 Sort Key
SQLState:  42000
ErrorCode: 4



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-9533) remove restriction that windowed aggregate ORDER BY can only have one key


 [ 
https://issues.apache.org/jira/browse/HIVE-9533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-9533:
-
Component/s: SQL

 remove restriction that windowed aggregate ORDER BY can only have one key
 -

 Key: HIVE-9533
 URL: https://issues.apache.org/jira/browse/HIVE-9533
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Reporter: N Campbell

 current restriction makes the existing support very limited.
 select rnum, c1, c2, c3, sum( c3 ) over(partition by c1 order by c2 , c3) 
 from tolap
 Error: Error while compiling statement: FAILED: SemanticException Range based 
 Window Frame can have only 1 Sort Key
 SQLState:  42000
 ErrorCode: 4



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-3245) UTF encoded data not displayed correctly by Hive driver

2013-12-07 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13842258#comment-13842258
 ] 

N Campbell commented on HIVE-3245:
--

Re the prior comment. If encoding information has to be specified, it should be 
expressed via the connection URL as a name-value property and not assuming that 
a mid-tier application will be started with a collection of -D options.


 UTF encoded data not displayed correctly by Hive driver
 ---

 Key: HIVE-3245
 URL: https://issues.apache.org/jira/browse/HIVE-3245
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.0
Reporter: N Campbell
Assignee: Szehon Ho
 Attachments: ASF.LICENSE.NOT.GRANTED--screenshot-1.jpg, CERT.TLJA.txt


 various foreign language data (i.e. japanese, thai etc) is loaded into string 
 columns via tab delimited text files. A simple projection of the columns in 
 the table is not displaying the correct data. Exporting the data from Hive 
 and looking at the files implies the data is loaded properly. it appears to 
 be an encoding issue at the driver but unaware of any required URL connection 
 properties re encoding that Hive JDBC requires.
 create table if not exists CERT.TLJA_JP_E ( RNUM int , C1 string, ORD int)
 row format delimited
 fields terminated by '\t'
 stored as textfile;
 create table if not exists CERT.TLJA_JP ( RNUM int , C1 string, ORD int)
 stored as sequencefile;
 load data local inpath '/home/hadoopadmin/jdbc-cert/CERT/CERT.TLJA_JP.txt'
 overwrite into table CERT.TLJA_JP_E;
 insert overwrite table CERT.TLJA_JP  select * from CERT.TLJA_JP_E;



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4662) first_value can't have more than one order by column

2013-10-13 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793651#comment-13793651
 ] 

N Campbell commented on HIVE-4662:
--

That is a fairly significant limitation.

 first_value can't have more than one order by column
 

 Key: HIVE-4662
 URL: https://issues.apache.org/jira/browse/HIVE-4662
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.11.0
Reporter: Frans Drijver

 In the current implementation of the first_value function, it's not allowed 
 to have more than one (1) order by column, as so:
 {quote}
 select distinct 
 first_value(kastr.DEWNKNR) over ( partition by kastr.DEKTRNR order by 
 kastr.DETRADT, kastr.DEVPDNR )
 from RTAVP_DRKASTR kastr
 ;
 {quote}
 Error given:
 {quote}
 FAILED: SemanticException Range based Window Frame can have only 1 Sort Key
 {quote}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-4662) first_value can't have more than one order by column

2013-10-13 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793652#comment-13793652
 ] 

N Campbell commented on HIVE-4662:
--

not just first value

select c1, c2, sum ( c3 ) over ( partition by c1 order by c2, c3 ) from t

 first_value can't have more than one order by column
 

 Key: HIVE-4662
 URL: https://issues.apache.org/jira/browse/HIVE-4662
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.11.0
Reporter: Frans Drijver

 In the current implementation of the first_value function, it's not allowed 
 to have more than one (1) order by column, as so:
 {quote}
 select distinct 
 first_value(kastr.DEWNKNR) over ( partition by kastr.DEKTRNR order by 
 kastr.DETRADT, kastr.DEVPDNR )
 from RTAVP_DRKASTR kastr
 ;
 {quote}
 Error given:
 {quote}
 FAILED: SemanticException Range based Window Frame can have only 1 Sort Key
 {quote}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-3745) Hive does improper = based string comparisons for strings with trailing whitespaces

2013-10-13 Thread N Campbell (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-3745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793658#comment-13793658
]

N Campbell commented on HIVE-3745:
--

The ISO-SQL standard is very clear as to what a vendor may choose to do re
blank padding semantics. Similarly, how operations such as min, max, distinct
etc operate on a variable length character type. Persons simply comparing to
another RDBMS need to compare what ISO states, where it allows 'vendor
implementation' and to see what a given vendor claims.

For example, if you were to use Postgres and other vendors derived from it. You
will find various differences with respect to

length( char (n) ) vs varchar(n)
group by
min
distinct/union

To some persons trailing spaces are of no interest and they may assume that one
general string type will 'ignore' spaces. Others may state to their business
application that trailing spaces are significant. That is distinct from what a
given standard states or perhaps what a vendor chooses to implement
irrespective of any given standard.

It would help either way if the Hive QL documentation could be improved to
state intent of a given construct/feature.

Hive does improper = based string comparisons for strings with trailing
whitespaces
-

Key: HIVE-3745
URL: https://issues.apache.org/jira/browse/HIVE-3745
Project: Hive
Issue Type: Bug
Components: SQL
Affects Versions: 0.9.0
Reporter: Harsh J
Assignee: Kevin Wilfong

Compared to other systems such as DB2, MySQL, etc., which disregard trailing
whitespaces in a string used when comparing two strings with the {{=}}
relational operator, Hive does not do this.
For example, note the following line from the MySQL manual:
http://dev.mysql.com/doc/refman/5.1/en/char.html
{quote}
All MySQL collations are of type PADSPACE. This means that all CHAR and
VARCHAR values in MySQL are compared without regard to any trailing spaces.
{quote}
Hive still is whitespace sensitive and regards trailing spaces of a string as
worthy elements when comparing. Ideally {{LIKE}} should consider this
strongly, but {{=}} should not.
Is there a specific reason behind this difference of implementation in Hive's
SQL?

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5529) precision and scale for DECIMAL type p=2147483647 s=2147483647

2013-10-12 Thread N Campbell (JIRA)

N Campbell created HIVE-5529:


 Summary: precision and scale for DECIMAL type p=2147483647 
s=2147483647
 Key: HIVE-5529
 URL: https://issues.apache.org/jira/browse/HIVE-5529
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Critical


If you declare a column of type DECIMAL you cannot define a precision and scale 
(per ISO-SQL). Should you attempt to describe a column in a resulset etc which 
is a decimal type it reports p=2147483647 s=2147483647.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-3173) implement getTypeInfo database metadata method

2013-10-12 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793491#comment-13793491
 ] 

N Campbell commented on HIVE-3173:
--

Hive 0.11 fails to define prefix and suffix information etc.

TYPE_NAME   DATA_TYPE   PRECISION   LITERAL_PREFIX  LITERAL_SUFFIX  
CREATE_PARAMS   NULLABLECASE_SENSITIVE  SEARCHABLE  
UNSIGNED_ATTRIBUTE  FIXED_PREC_SCALEAUTO_INCREMENT  LOCAL_TYPE_NAME 
MINIMUM_SCALE   MAXIMUM_SCALE   SQL_DATA_TYPE   SQL_DATETIME_SUB
NUM_PREC_RADIX
ARRAY   12 [VARCHAR]null  null  null  null  truefalse   no 
support  truefalse   false   null  0   0   null  null  
null
BIGINT  -5 [BIGINT] 19  null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  10
BINARY  -2 [BINARY] null  null  null  null  truefalse   
supports all WHERE  truefalse   false   null  0   0   null  
null  null
BOOLEAN 16 [BOOLEAN]null  null  null  null  truefalse   
supports all WHERE  truefalse   false   null  0   0   null  
null  null
DECIMAL 3 [DECIMAL] null  null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  null
DOUBLE  8 [DOUBLE]  15  null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  2
FLOAT   6 [FLOAT]   7   null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  2
INT 4 [INTEGER] 10  null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  10
MAP 12 [VARCHAR]null  null  null  null  truefalse   no 
support  truefalse   false   null  0   0   null  null  
null
SMALLINT5 [SMALLINT]5   null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  10
STRING  12 [VARCHAR]null  null  null  null  truetrue
supports all WHERE  truefalse   false   null  0   0   null  
null  null
STRUCT  12 [VARCHAR]null  null  null  null  truefalse   no 
support  truefalse   false   null  0   0   null  null  
null
TIMESTAMP   93 [TIMESTAMP]  null  null  null  null  truefalse   
supports all WHERE  truefalse   false   null  0   0   null  
null  null
TINYINT -6 [TINYINT]3   null  null  null  truefalse   
supports all WHERE  false   false   false   null  0   0   null  
null  10
UNIONTYPE   12 [VARCHAR]null  null  null  null  truefalse   
no support  truefalse   false   null  0   0   null  null  
null
null  12 [VARCHAR]null  null  null  null  truefalse   no 
support  truefalse   false   null  0   0   null  null  
null


 implement getTypeInfo database metadata method 
 ---

 Key: HIVE-3173
 URL: https://issues.apache.org/jira/browse/HIVE-3173
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.1
Reporter: N Campbell
 Attachments: Hive-3173.patch.txt


 The JDBC driver does not implement the database metadata method getTypeInfo. 
 Hence, an application cannot dynamically determine the available type 
 information and associated properties. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5530) null pointer exception when case returns null

2013-10-12 Thread N Campbell (JIRA)

N Campbell created HIVE-5530:


 Summary: null pointer exception when case returns null
 Key: HIVE-5530
 URL: https://issues.apache.org/jira/browse/HIVE-5530
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


The following expression will cause an NPE
 
select case when 1 = 1 then null end  from t



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5473) sqrt of -ve value returns null instead of throwing an error

2013-10-07 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-5473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788638#comment-13788638
 ] 

N Campbell commented on HIVE-5473:
--

other cases are

select 0 / 0 from tversion
select sqrt ( -4 ) from tversion
select power( 0, -1) from tversion

note, in ISO these should be exceptions
in the first case you get a garbage value lack and infinity in the others
so even if design intent in hive the docs make no statement etc.

 sqrt of -ve value returns null instead of throwing an error
 ---

 Key: HIVE-5473
 URL: https://issues.apache.org/jira/browse/HIVE-5473
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor

 select sqrt( -4 ) from t
 will return a null instead of throwing an  exception. 
 no discussion on web page that this would be by design to not throw an error.
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-DateFunctions



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5460) invalid offsets in lag lead should return an exception (per ISO-SQL)

N Campbell created HIVE-5460:


 Summary: invalid offsets in lag lead should return an exception 
(per ISO-SQL) 
 Key: HIVE-5460
 URL: https://issues.apache.org/jira/browse/HIVE-5460
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


ISO-SQL 2012 defines how lag and lead should behave when invalid offsets are 
provided to the functions.

i.e. select tint.rnum,tint.cint, lag( tint.cint, -100 )  over ( order by 
tint.rnum) from tint tint 

Instead of a meaningful error (as other vendors will emit) you get 

Error: Query returned non-zero code: 2, cause: FAILED: Execution Error, return 
code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
SQLState:  08S01
ErrorCode: 2



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5461) lag lead do not offer ISO-SQL 2012 null treatment

N Campbell created HIVE-5461:


 Summary: lag lead do not offer ISO-SQL 2012 null treatment 
 Key: HIVE-5461
 URL: https://issues.apache.org/jira/browse/HIVE-5461
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


ISO-SQL lag and lead provide clauses that allow the author to denote how nulls 
are treated (respect or ignore nulls). Hive does not.




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5462) improve error when an approximate numeric literal provided to lag lead

N Campbell created HIVE-5462:


 Summary: improve error when an approximate numeric literal 
provided to lag lead 
 Key: HIVE-5462
 URL: https://issues.apache.org/jira/browse/HIVE-5462
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


ISO SQL would define this as an error.

select vint.rnum,vint.cint, lead( vint.cint, 1.70 )  over ( order by vint.rnum) 
from vint vint 

The issue is the error message returned by Hive

Query returned non-zero code: 4, cause: FAILED: SemanticException Failed to 
breakup Windowing invocations into Groups. At least 1 group must only depend on 
input columns. Also check for circular dependencies.
Underlying error: org.apache.hadoop.hive.ql.exec.UDFArgumentTypeException: Lead 
amount must be a integer value double was passed as parameter 1.




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5463) support hypothetical ranking aggregates

N Campbell created HIVE-5463:


 Summary: support hypothetical ranking aggregates
 Key: HIVE-5463
 URL: https://issues.apache.org/jira/browse/HIVE-5463
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


existing function support does not include forms as 

select rank ( 5 ) within group ( order by vsint.csint ) from vsint vsint



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5464) allow OR conditions in table join

N Campbell created HIVE-5464:


 Summary: allow OR conditions in table join
 Key: HIVE-5464
 URL: https://issues.apache.org/jira/browse/HIVE-5464
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell


select tjoin1.c1, tjoin1.c2, tjoin2.c2 as c2j2 from tjoin1 tjoin1 inner join 
tjoin2 tjoin2 on ( tjoin1.c1 = 10 or tjoin1.c1=20 )

Query returned non-zero code: 10019, cause: FAILED: SemanticException [Error 
10019]: Line 1:96 OR not supported in JOIN currently '20'



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5465) unable to express nested join

N Campbell created HIVE-5465:


 Summary: unable to express nested join 
 Key: HIVE-5465
 URL: https://issues.apache.org/jira/browse/HIVE-5465
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell


select tjoin2.rnum, tjoin1.c1, tjoin1.c2, tjoin2.c2 as c2j2, tjoin3.c2 as 
c2j3,tjoin4.c2 as c2j4 from  (tjoin1 tjoin1 right outer join (tjoin2 tjoin2 
left outer join tjoin3 tjoin3 on tjoin2.c1=tjoin3.c1) on (tjoin1.c1=tjoin2.c1)) 
left outer join tjoin4 tjoin4 on (tjoin1.c1=tjoin4.c1)

Query returned non-zero code: 4, cause: FAILED: ParseException line 1:104 
cannot recognize input near 'tjoin1' 'tjoin1' 'right' in subquery source




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5466) add support for table and row constructors in a statement

N Campbell created HIVE-5466:


 Summary: add support for table and row constructors in a statement
 Key: HIVE-5466
 URL: https://issues.apache.org/jira/browse/HIVE-5466
 Project: Hive
  Issue Type: Improvement
Reporter: N Campbell
Priority: Minor


Various filtering requirements can be expressed and performed more efficiently 
when row and table constructors can be pushed into a statement 

where (c1, c2 ) .. ( z1, z2) 

 select t1.c1, t1.c2 from (values (10,'BB')) t1(c1,c2)

etc



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-5460) invalid offsets in lag lead should return an exception (per ISO-SQL)


[ 
https://issues.apache.org/jira/browse/HIVE-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13787673#comment-13787673
 ] 

N Campbell commented on HIVE-5460:
--

You would need to go to the ISO web site 
http://www.iso.org/iso/home/store/catalogue_ics/catalogue_detail_ics.htm?csnumber=53681

 invalid offsets in lag lead should return an exception (per ISO-SQL) 
 -

 Key: HIVE-5460
 URL: https://issues.apache.org/jira/browse/HIVE-5460
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Assignee: Edward Capriolo
Priority: Minor

 ISO-SQL 2012 defines how lag and lead should behave when invalid offsets are 
 provided to the functions.
 i.e. select tint.rnum,tint.cint, lag( tint.cint, -100 )  over ( order by 
 tint.rnum) from tint tint 
 Instead of a meaningful error (as other vendors will emit) you get 
 Error: Query returned non-zero code: 2, cause: FAILED: Execution Error, 
 return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
 SQLState:  08S01
 ErrorCode: 2



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5460) invalid offsets in lag lead should return an exception (per ISO-SQL)


 [ 
https://issues.apache.org/jira/browse/HIVE-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-5460:
-

Description: 
ISO-SQL 2011 defines how lag and lead should behave when invalid offsets are 
provided to the functions.

i.e. select tint.rnum,tint.cint, lag( tint.cint, -100 )  over ( order by 
tint.rnum) from tint tint 

Instead of a meaningful error (as other vendors will emit) you get 

Error: Query returned non-zero code: 2, cause: FAILED: Execution Error, return 
code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
SQLState:  08S01
ErrorCode: 2

  was:
ISO-SQL 2012 defines how lag and lead should behave when invalid offsets are 
provided to the functions.

i.e. select tint.rnum,tint.cint, lag( tint.cint, -100 )  over ( order by 
tint.rnum) from tint tint 

Instead of a meaningful error (as other vendors will emit) you get 

Error: Query returned non-zero code: 2, cause: FAILED: Execution Error, return 
code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
SQLState:  08S01
ErrorCode: 2


 invalid offsets in lag lead should return an exception (per ISO-SQL) 
 -

 Key: HIVE-5460
 URL: https://issues.apache.org/jira/browse/HIVE-5460
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Assignee: Edward Capriolo
Priority: Minor

 ISO-SQL 2011 defines how lag and lead should behave when invalid offsets are 
 provided to the functions.
 i.e. select tint.rnum,tint.cint, lag( tint.cint, -100 )  over ( order by 
 tint.rnum) from tint tint 
 Instead of a meaningful error (as other vendors will emit) you get 
 Error: Query returned non-zero code: 2, cause: FAILED: Execution Error, 
 return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
 SQLState:  08S01
 ErrorCode: 2



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5467) allow null ordering specification in a windowed aggregate

N Campbell created HIVE-5467:


 Summary: allow null ordering specification in a windowed aggregate
 Key: HIVE-5467
 URL: https://issues.apache.org/jira/browse/HIVE-5467
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell


A business user may require that nulls are sorted first with respect to an 
analytical expression as opposed to last. Hive does not support the null 
ordering specification.

select rnum, c1, c2, c3, rank() over(order  c1 desc nulls first  ) from tolap 
order by rnum
Error: Query returned non-zero code: 4, cause: FAILED: ParseException line 
1:37 cannot recognize input near 'order' 'c1' 'desc' in order by clause


You can fake is by ordering on a case expression

select rnum, c1, c2, c3, rank() over(order by case when c1 is null then 0 else 
1 end, c1 desc  ) from tolap order by rnum




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5468) allow null ordering to be defined on cursor specification

N Campbell created HIVE-5468:


 Summary: allow null ordering to be defined on cursor specification
 Key: HIVE-5468
 URL: https://issues.apache.org/jira/browse/HIVE-5468
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell


Similar comment to HIVE-5467

select rnum, c1, c2 from tset1 tset1 order by c1 nulls first, c2 nulls last




--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5469) support nullif

N Campbell created HIVE-5469:


 Summary: support nullif
 Key: HIVE-5469
 URL: https://issues.apache.org/jira/browse/HIVE-5469
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


Have to express case expression to work around lack of NULLIF

select nullif(cint, 1) from tint

select cint, case when cint = 1 then null else cint end from tint





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3194) implement the JDBC canonical/ISO-SQL 2011 scalar functions


 [ 
https://issues.apache.org/jira/browse/HIVE-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3194:
-

Summary: implement the JDBC canonical/ISO-SQL 2011 scalar functions  (was: 
implement the JDBC canonical/ISO-SQL 2012 scalar functions)

 implement the JDBC canonical/ISO-SQL 2011 scalar functions
 --

 Key: HIVE-3194
 URL: https://issues.apache.org/jira/browse/HIVE-3194
 Project: Hive
  Issue Type: Improvement
  Components: JDBC, SQL, UDF
Affects Versions: 0.8.0, 0.11.0
Reporter: N Campbell
 Attachments: Hive-3194.patch


 The Hive driver does not return any canonical JDBC scalar functions (per JDBC 
 3.0/4.0). Similarly, while Hive has various 'similar' scalar functions their 
 signature (name etc) is not the same as ISO-SQL equivalent. It would be 
 better for portable dynamic SQL applications to see more alignment by Hive 
 etc to both standards. In some cases it is small syntactic changes/tweaks 
 (i.e. see mod, substring etc).



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-5461) lag lead do not offer ISO-SQL 2011 null treatment


 [ 
https://issues.apache.org/jira/browse/HIVE-5461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-5461:
-

Summary: lag lead do not offer ISO-SQL 2011 null treatment   (was: lag lead 
do not offer ISO-SQL 2012 null treatment )

 lag lead do not offer ISO-SQL 2011 null treatment 
 --

 Key: HIVE-5461
 URL: https://issues.apache.org/jira/browse/HIVE-5461
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor

 ISO-SQL lag and lead provide clauses that allow the author to denote how 
 nulls are treated (respect or ignore nulls). Hive does not.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3188) allow cast to use double precision per ISO-SQL 2011


 [ 
https://issues.apache.org/jira/browse/HIVE-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3188:
-

Summary: allow cast to use double precision per ISO-SQL 2011  (was: allow 
cast to use double precision per ISO-SQL 2012)

 allow cast to use double precision per ISO-SQL 2011
 ---

 Key: HIVE-3188
 URL: https://issues.apache.org/jira/browse/HIVE-3188
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell

 While ISO-SQL allows cast('10.3' as double precision) Hive forces you to say 
  cast('10.3' as double ). Accept the additional keyword.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3190) allow INTEGER as a type name in a column/cast expression (per ISO-SQL 2011)


 [ 
https://issues.apache.org/jira/browse/HIVE-3190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3190:
-

Summary: allow INTEGER as a type name in a column/cast expression (per 
ISO-SQL 2011)  (was: allow INTEGER as a type name in a column/cast expression 
(per ISO-SQL 2012))

 allow INTEGER as a type name in a column/cast expression (per ISO-SQL 2011)
 ---

 Key: HIVE-3190
 URL: https://issues.apache.org/jira/browse/HIVE-3190
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell

 Just extend the parser to allow INTEGER instead of making folks use INT
 select cast('10' as integer) from cert.tversion tversion
 FAILED: Parse Error: line 1:20 cannot recognize input near 'integer' ')' 
 'from' in primitive type specification



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3184) remove existing restriction that union must be defined within a derived table (follow ISO-SQL 2011)


 [ 
https://issues.apache.org/jira/browse/HIVE-3184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3184:
-

Summary: remove existing restriction that union must be defined within a 
derived table (follow ISO-SQL 2011)  (was: remove existing restriction that 
union must be defined within a derived table (follow ISO-SQL 2012))

 remove existing restriction that union must be defined within a derived table 
 (follow ISO-SQL 2011)
 ---

 Key: HIVE-3184
 URL: https://issues.apache.org/jira/browse/HIVE-3184
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell

 Don't make SQL applications have to wrap the union operation within a derived 
 table per the SQL standard. Failing to do so results in Top level UNION is 
 not supported currently; use a subquery
 i.e.
 select c1, c2 from cert.tset1 tset1 union all select c1, c2 from cert.tset2 
 tset2
 vs 
 select * from ( select c1, c2 from cert.tset1 tset1 union all select c1, c2 
 from cert.tset2 tset2 ) T 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3185) allow union set operation without ALL keyword (align to ISO-SQL 2011 specification)


 [ 
https://issues.apache.org/jira/browse/HIVE-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3185:
-

Summary: allow union set operation without ALL keyword (align to ISO-SQL 
2011 specification)  (was: allow union set operation without ALL keyword (align 
to ISO-SQL 2012 specification))

 allow union set operation without ALL keyword (align to ISO-SQL 2011 
 specification)
 ---

 Key: HIVE-3185
 URL: https://issues.apache.org/jira/browse/HIVE-3185
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell

 Allow a union operation to be specified as qb1 union qb2 in addition to 
 th qb1 union all qb2
 Common requirement of queries to project two sets where it is not always the 
 case you want distinct rows in the set. Failing to do so obviously will 
 result in the current exception:
 mismatched input 'select' expecting ALL near 'union' in query operator



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HIVE-3186) support having without a grouping operation (per ISO-SQL 2011)


 [ 
https://issues.apache.org/jira/browse/HIVE-3186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3186:
-

Summary: support having without a grouping operation (per ISO-SQL 2011)  
(was: support having without a grouping operation (per ISO-SQL 2012))

 support having without a grouping operation (per ISO-SQL 2011)
 --

 Key: HIVE-3186
 URL: https://issues.apache.org/jira/browse/HIVE-3186
 Project: Hive
  Issue Type: Improvement
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell

 select count(*) from cert.tset1 tset1 having count(*)  2
 granted you can manually write a grouped derived table and filter etc but it 
 valid to define a having statement (implicit grouped table).
 HAVING specified without GROUP BY



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5470) support extract function

N Campbell created HIVE-5470:


 Summary: support extract function
 Key: HIVE-5470
 URL: https://issues.apache.org/jira/browse/HIVE-5470
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


You have the equivalent scalar functions but do not offer ISO-SQL extract which 
should be a small enhancement.

Note, the second() scalar does not return the fractional seconds in a timestamp 
with ISO-SQL would expect extract (second) to implement.

select extract (year from cts), 

vs 

select rnum, year (cts), month( cts), day( cts), hour(cts), minute (cts), 
second( cts), cts from tts



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5471) allow timestamp literal per ISO-SQL

N Campbell created HIVE-5471:


 Summary: allow timestamp literal per ISO-SQL
 Key: HIVE-5471
 URL: https://issues.apache.org/jira/browse/HIVE-5471
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell


Support the ISO-SQL timestamp literal specification

select timestamp '1999-01-01 12:23.30.01' from tversion


instead of having to use

select cast('1999-01-01 12:23:30.01' as timestamp) from tversion





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5472) support a simple scalar which returns the current timestamp

N Campbell created HIVE-5472:


 Summary: support a simple scalar which returns the current 
timestamp
 Key: HIVE-5472
 URL: https://issues.apache.org/jira/browse/HIVE-5472
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.11.0
Reporter: N Campbell


ISO-SQL has two forms of functions

local and current timestamp where the former is a TIMESTAMP WITHOUT TIMEZONE 
and the latter with TIME ZONE

select cast ( unix_timestamp() as timestamp ) from T

implement a function which computes LOCAL TIMESTAMP which would be the current 
timestamp for the users session time zone.





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-3183) case expression should allow different types per ISO-SQL 2011


[ 
https://issues.apache.org/jira/browse/HIVE-3183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13787715#comment-13787715
 ] 

N Campbell commented on HIVE-3183:
--

I am not planning to commit code changes to Apache. I'll leave that to
Cloudera, Hortonworks and others who are incrementally improving Hive-QL :-)

The intent is to enable others in the public domain to understand the bugs
and opportunities that exist re Hive-QL types and SQL support distinct from
working within the the supplied grammar or hand writing work arounds etc








 case expression should allow different types per ISO-SQL 2011
 -

 Key: HIVE-3183
 URL: https://issues.apache.org/jira/browse/HIVE-3183
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell
 Attachments: Hive-3183.patch.txt, udf_when_type_wrong2.q.out, 
 udf_when_type_wrong3.q.out


 The ISO-SQL standard specification for CASE allows the specification to 
 include different types in the WHEN and ELSE blocks including this example 
 which mixes smallint and integer types
 select case when vsint.csint is not null then vsint.csint else 1 end from 
 cert.vsint vsint 
 The Apache Hive docs do not state how it deviates from the standard or any 
 given restrictions so unsure if this is a bug vs an enhancement. Many SQL 
 applications mix so this seems to be a restrictive implementation if this is 
 by design.
 Argument type mismatch '1': The expression after ELSE should have the same 
 type as those after THEN: smallint is expected but int is found



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HIVE-5473) sqrt of -ve value returns null instead of throwing an error

N Campbell created HIVE-5473:


 Summary: sqrt of -ve value returns null instead of throwing an 
error
 Key: HIVE-5473
 URL: https://issues.apache.org/jira/browse/HIVE-5473
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: N Campbell
Priority: Minor


select sqrt( -4 ) from t

will return a null instead of throwing an  exception. 

no discussion on web page that this would be by design to not throw an error.

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-DateFunctions





--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HIVE-3121) JDBC driver's getCatalogs() method returns schema/db information

2013-07-03 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699651#comment-13699651
 ] 

N Campbell commented on HIVE-3121:
--

does this mean that getTables will work correctly now when NULL is provided for 
the catalog and schema parameters?

we found in 0.8, 0.9 and 0.10 etc that it is not default context for 
catalog/schema and returns nothing. 

an application is forced to work around this by parsing the connection URL and 
stripping any db info provided or a default name.


 JDBC driver's getCatalogs() method returns schema/db information
 

 Key: HIVE-3121
 URL: https://issues.apache.org/jira/browse/HIVE-3121
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.9.0
Reporter: Carl Steinbach
Assignee: Richard Ding
 Attachments: hive-3121_1.patch, hive-3121.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4573) JDBC Compliance getTables

2013-07-03 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699670#comment-13699670
 ] 

N Campbell commented on HIVE-4573:
--

while it is true that getTableTypes lists the types which exist, the Hive JDBC 
documentation continues to reference you to the standard JDBC pages where those 
types have existed since early days of JDBC.

where was the change in types relative to prior releases etc documented re the 
change and purpose?

as in ODBC, calling getTableTypes might help an application determine what 
types it 'expects' are shown to be supported or 'not'. However, for other types 
it becomes a crap shoot as to what a dynamic SQL application would presume to 
do re types listed that it is not familiar with. 


https://hive.apache.org/docs/r0.10.0/api/org/apache/hadoop/hive/jdbc/HiveDatabaseMetaData.html
and follow the links from getTables().

http://docs.oracle.com/javase/6/docs/api/java/sql/DatabaseMetaData.html?is-external=true#getTables(java.lang.String,
 java.lang.String, java.lang.String, java.lang.String[])

 JDBC Compliance getTables
 -

 Key: HIVE-4573
 URL: https://issues.apache.org/jira/browse/HIVE-4573
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.10.0
Reporter: Johndee Burks
Priority: Minor

 The getTables jdbc function no longer returns information when using normal 
 JDBC table types like TABLE or VIEW. You must now use a more specific type 
 such as MANAGED_TABLE or VIRTUAL_VIEW. An example application that will fail 
 to return results against 0.10 is below, works without issue in 0.9. In my 
 0.10 test I used HS2. 
 {code}
 import java.sql.SQLException;
 import java.sql.Connection;
 import java.sql.ResultSet;
 import java.sql.Statement;
 import java.sql.DriverManager;
 import org.apache.hive.jdbc.HiveDriver;
 import java.sql.DatabaseMetaData;
 public class TestGet {
   private static String driverName = org.apache.hive.jdbc.HiveDriver;
   /**
  * @param args
  * @throws SQLException
*/
   public static void main(String[] args) throws SQLException {
   try {
   Class.forName(driverName);
 } catch (ClassNotFoundException e) {
   // TODO Auto-generated catch block
   e.printStackTrace();
   System.exit(1);
 }
 Connection con = 
 DriverManager.getConnection(jdbc:hive2://hostname:1/default);
 DatabaseMetaData dbmd = con.getMetaData();
 String[] types = {TABLE};
 ResultSet rs = dbmd.getTables(null, null, %, types);
   while (rs.next()) {
 System.out.println(rs.getString(TABLE_NAME));
   }
 }
   }
 }
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4256) JDBC2 HiveConnection does not use the specified database

2013-07-03 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699675#comment-13699675
 ] 

N Campbell commented on HIVE-4256:
--

we have seen 'similar' issues with Hive 0.8, 0.9 and 0.10 re metadata methods 
such as getTables etc failing to work when null is provided as parameter values 
for schema etc. 

 JDBC2 HiveConnection does not use the specified database
 

 Key: HIVE-4256
 URL: https://issues.apache.org/jira/browse/HIVE-4256
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2, JDBC
Affects Versions: 0.11.0
Reporter: Chris Drome
Assignee: Chris Drome

 HiveConnection ignores the database specified in the connection string when 
 configuring the connection.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3176) implement returning values for SQLException getSQLState()

2013-05-07 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-3176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13650764#comment-13650764
 ] 

N Campbell commented on HIVE-3176:
--

I'd expect the Hive driver to align to the JDBC 3.x or 4.x API specification 
with respect to exception handling re transient and non-transient exceptions 
and the associated SQL state codes

http://docs.oracle.com/javase/7/docs/api/java/sql/package-summary.html

 implement returning values for SQLException getSQLState()
 -

 Key: HIVE-3176
 URL: https://issues.apache.org/jira/browse/HIVE-3176
 Project: Hive
  Issue Type: Improvement
  Components: JDBC
Affects Versions: 0.8.1
Reporter: N Campbell
  Labels: newbie, patch
 Attachments: HIVE-3176.patch.txt


 a dynamic SQL application should be able to check the values returned by 
 getSQLState on a SQLException object. Currently the Hive driver is not doing 
 this (throws exceptions etc).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3245) UTF encoded data not displayed correctly by Hive driver

N Campbell created HIVE-3245:


 Summary: UTF encoded data not displayed correctly by Hive driver
 Key: HIVE-3245
 URL: https://issues.apache.org/jira/browse/HIVE-3245
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.0
Reporter: N Campbell


various foreign language data (i.e. japanese, thai etc) is loaded into string 
columns via tab delimited text files. A simple projection of the columns in the 
table is not displaying the correct data. Exporting the data from Hive and 
looking at the files implies the data is loaded properly. it appears to be an 
encoding issue at the driver but unaware of any required URL connection 
properties re encoding that Hive JDBC requires

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3245) UTF encoded data not displayed correctly by Hive driver


 [ 
https://issues.apache.org/jira/browse/HIVE-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3245:
-

Attachment: CERT.TLJA.txt

example JP data

 UTF encoded data not displayed correctly by Hive driver
 ---

 Key: HIVE-3245
 URL: https://issues.apache.org/jira/browse/HIVE-3245
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.0
Reporter: N Campbell
 Attachments: CERT.TLJA.txt


 various foreign language data (i.e. japanese, thai etc) is loaded into string 
 columns via tab delimited text files. A simple projection of the columns in 
 the table is not displaying the correct data. Exporting the data from Hive 
 and looking at the files implies the data is loaded properly. it appears to 
 be an encoding issue at the driver but unaware of any required URL connection 
 properties re encoding that Hive JDBC requires

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3245) UTF encoded data not displayed correctly by Hive driver

[
https://issues.apache.org/jira/browse/HIVE-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

N Campbell updated HIVE-3245:
-

Description:
various foreign language data (i.e. japanese, thai etc) is loaded into string
columns via tab delimited text files. A simple projection of the columns in the
table is not displaying the correct data. Exporting the data from Hive and
looking at the files implies the data is loaded properly. it appears to be an
encoding issue at the driver but unaware of any required URL connection
properties re encoding that Hive JDBC requires.

create table if not exists CERT.TLJA_JP_E ( RNUM int , C1 string, ORD int)
row format delimited
fields terminated by '\t'
stored as textfile;

create table if not exists CERT.TLJA_JP ( RNUM int , C1 string, ORD int)
stored as sequencefile;

load data local inpath '/home/hadoopadmin/jdbc-cert/CERT/CERT.TLJA_JP.txt'
overwrite into table CERT.TLJA_JP_E;
insert overwrite table CERT.TLJA_JP select * from CERT.TLJA_JP_E;

was:various foreign language data (i.e. japanese, thai etc) is loaded into
string columns via tab delimited text files. A simple projection of the columns
in the table is not displaying the correct data. Exporting the data from Hive
and looking at the files implies the data is loaded properly. it appears to be
an encoding issue at the driver but unaware of any required URL connection
properties re encoding that Hive JDBC requires

UTF encoded data not displayed correctly by Hive driver
---

Key: HIVE-3245
URL: https://issues.apache.org/jira/browse/HIVE-3245
Project: Hive
Issue Type: Bug
Components: JDBC
Affects Versions: 0.8.0
Reporter: N Campbell
Attachments: CERT.TLJA.txt

various foreign language data (i.e. japanese, thai etc) is loaded into string
columns via tab delimited text files. A simple projection of the columns in
the table is not displaying the correct data. Exporting the data from Hive
and looking at the files implies the data is loaded properly. it appears to
be an encoding issue at the driver but unaware of any required URL connection
properties re encoding that Hive JDBC requires.
create table if not exists CERT.TLJA_JP_E ( RNUM int , C1 string, ORD int)
row format delimited
fields terminated by '\t'
stored as textfile;
create table if not exists CERT.TLJA_JP ( RNUM int , C1 string, ORD int)
stored as sequencefile;
load data local inpath '/home/hadoopadmin/jdbc-cert/CERT/CERT.TLJA_JP.txt'
overwrite into table CERT.TLJA_JP_E;
insert overwrite table CERT.TLJA_JP select * from CERT.TLJA_JP_E;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3245) UTF encoded data not displayed correctly by Hive driver


 [ 
https://issues.apache.org/jira/browse/HIVE-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3245:
-

Attachment: screenshot-1.jpg

example of test shown in SQLSquirrel

 UTF encoded data not displayed correctly by Hive driver
 ---

 Key: HIVE-3245
 URL: https://issues.apache.org/jira/browse/HIVE-3245
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.0
Reporter: N Campbell
 Attachments: CERT.TLJA.txt, screenshot-1.jpg


 various foreign language data (i.e. japanese, thai etc) is loaded into string 
 columns via tab delimited text files. A simple projection of the columns in 
 the table is not displaying the correct data. Exporting the data from Hive 
 and looking at the files implies the data is loaded properly. it appears to 
 be an encoding issue at the driver but unaware of any required URL connection 
 properties re encoding that Hive JDBC requires.
 create table if not exists CERT.TLJA_JP_E ( RNUM int , C1 string, ORD int)
 row format delimited
 fields terminated by '\t'
 stored as textfile;
 create table if not exists CERT.TLJA_JP ( RNUM int , C1 string, ORD int)
 stored as sequencefile;
 load data local inpath '/home/hadoopadmin/jdbc-cert/CERT/CERT.TLJA_JP.txt'
 overwrite into table CERT.TLJA_JP_E;
 insert overwrite table CERT.TLJA_JP  select * from CERT.TLJA_JP_E;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3239) exception will be thrown for timestamp column in resultset from getColumns metadata method

2012-07-06 Thread N Campbell (JIRA)

N Campbell created HIVE-3239:


 Summary: exception will be thrown for timestamp column in 
resultset from getColumns metadata method
 Key: HIVE-3239
 URL: https://issues.apache.org/jira/browse/HIVE-3239
 Project: Hive
  Issue Type: Bug
Reporter: N Campbell


create table cert.tts ( x int, y timestamp )

describe the table column metadata using

rs = meta.getColumns(null, cert, tts, %);
..
while (rs.next()) {
for (int i = 1; i = rsmd.getColumnCount(); i++) {
 if (i  1) {
 System.out.print(,);
 }
System.out.print(String.format((%s), rs.getObject(i)));
.

java.sql.SQLException: Unrecognized column type: timestamp
at org.apache.hadoop.hive.jdbc.Utils.hiveTypeToSqlType(Utils.java:56)
at org.apache.hadoop.hive.jdbc.JdbcColumn.getSqlType(JdbcColumn.java:62)
at 
org.apache.hadoop.hive.jdbc.HiveDatabaseMetaData$2.next(HiveDatabaseMetaData.java:244)
at bug.main(bug.java:232)


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3239) exception will be thrown for timestamp column in resultset from getColumns metadata method

2012-07-06 Thread N Campbell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3239:
-

  Component/s: JDBC
Affects Version/s: 0.8.1

 exception will be thrown for timestamp column in resultset from getColumns 
 metadata method
 --

 Key: HIVE-3239
 URL: https://issues.apache.org/jira/browse/HIVE-3239
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.1
Reporter: N Campbell

 create table cert.tts ( x int, y timestamp )
 describe the table column metadata using
 rs = meta.getColumns(null, cert, tts, %);
 ..
 while (rs.next()) {
   for (int i = 1; i = rsmd.getColumnCount(); i++) {
if (i  1) {
System.out.print(,);
}
   System.out.print(String.format((%s), rs.getObject(i)));
 .
 java.sql.SQLException: Unrecognized column type: timestamp
   at org.apache.hadoop.hive.jdbc.Utils.hiveTypeToSqlType(Utils.java:56)
   at org.apache.hadoop.hive.jdbc.JdbcColumn.getSqlType(JdbcColumn.java:62)
   at 
 org.apache.hadoop.hive.jdbc.HiveDatabaseMetaData$2.next(HiveDatabaseMetaData.java:244)
   at bug.main(bug.java:232)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-3216) DateDiff UDF overload with a unit parameter


[ 
https://issues.apache.org/jira/browse/HIVE-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13407002#comment-13407002
 ] 

N Campbell commented on HIVE-3216:
--

Note, ISO-SQL does not have DateDiff rather timestamp - timestamp returns 
an interval type where you can say ( ... ) interval month etc to control what 
interval type you want. The DateDiff like function if semantically equivalent 
would be a syntactic short form of the equivalent intent however the return 
type would be a numeric type and not an interval.


 DateDiff UDF overload with a unit parameter
 ---

 Key: HIVE-3216
 URL: https://issues.apache.org/jira/browse/HIVE-3216
 Project: Hive
  Issue Type: Improvement
  Components: SQL, UDF
Affects Versions: 0.10.0, 0.9.1
Reporter: Shefali Vohra
Priority: Minor
  Labels: date, patch, sql, timestamp, udf
 Fix For: 0.9.1


 *Parameters*
 This function overloads the current DateDiff(expr1, expr2) by adding another 
 parameter to specify the units. It takes 3 parameters. The first two are 
 timestamps, and the formats accepted are:
 -MM-dd
 -MM-dd HH:mm:ss
 -MM-dd HH:mm:ss.milli
 These are the formats accepted by the current DateDiff(expr1, expr2) function 
 and allow for that consistency. The accepted data types for the timestamp 
 will be Text, TimestampWritable, Date, and String, just as with the already 
 existing function.
 The third parameter is the units the user wants the response to be in. 
 Acceptable units are:
 Microsecond
 Millisecond
 Second
 Minute
 Hour
 Day
 Week
 Month
 Quarter
 Year
 When calculating the difference, the full timestamp is used when the 
 specified unit is hour or smaller (microsecond, millisecond, second, minute, 
 hour), and only the date part is used if the unit is day or larger (day, 
 week, month, quarter, year). If for the smaller units the time is not 
 specified and the format -MM-dd is used, the time 00:00:00.0 is used. 
 Leap years are accounted for by the Calendar class in Java, which inherently 
 addresses the issue.
 The assumption is made that all these time parameters are in the same time 
 zone.
 *Return Value*
 The function returns expr1 -  expr2 expressed as an int in the units 
 specified.
 *Hive vs. SQL*
 SQL also has a DateDiff() function with some more acceptable units. The order 
 of parameters is different between SQL and Hive. The reason for this is that 
 Hive already has a DateDiff() function with the same first two parameters, 
 and having this order here allows for that consistency within Hive.
 *Example Query*
 hive  DATEDIFF(DATE_FIELD, '2012-06-01', ‘day’); 
 *Diagnostic Error Messages*
 Invalid table alias or column name reference
 Table not found

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3228) unable to load null values that represent a timestamp value

N Campbell created HIVE-3228:


 Summary: unable to load null values that represent a timestamp 
value
 Key: HIVE-3228
 URL: https://issues.apache.org/jira/browse/HIVE-3228
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: N Campbell


Attempting to load delimited data into a table with one or more timestamp 
columns will fail when null values are represented in the input set.


load data local inpath 'CERT.TTS.txt'
overwrite into table CERT.TTS_E;
insert overwrite table CERT.TTS  select * from CERT.TTS_E;

Error: Query returned non-zero code: 9, cause: FAILED: Execution Error, return 
code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
SQLState:  08S01
ErrorCode: 9

create table if not exists CERT.TTS_E ( RNUM int , CTS timestamp)
row format delimited
fields terminated by '\t'
stored as textfile;


create table if not exists CERT.TTS ( RNUM int , CTS timestamp) 
stored as sequencefile;



0   
1   1996-01-01 00:00:00.0
2   1996-01-01 12:00:00.0
3   1996-01-01 23:59:30.12300
4   2000-01-01 00:00:00.0
5   2000-01-01 12:00:00.0
6   2000-01-01 23:59:30.12300
7   2000-12-31 00:00:00.0
8   2000-12-31 12:00:00.0
9   2000-12-31 12:15:30.12300


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3228) unable to load null values that represent a timestamp value


 [ 
https://issues.apache.org/jira/browse/HIVE-3228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3228:
-

Attachment: CERT.TTS.txt

input data file using tab delimited columns

 unable to load null values that represent a timestamp value
 ---

 Key: HIVE-3228
 URL: https://issues.apache.org/jira/browse/HIVE-3228
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: N Campbell
 Attachments: CERT.TTS.txt


 Attempting to load delimited data into a table with one or more timestamp 
 columns will fail when null values are represented in the input set.
 load data local inpath 'CERT.TTS.txt'
 overwrite into table CERT.TTS_E;
 insert overwrite table CERT.TTS  select * from CERT.TTS_E;
 Error: Query returned non-zero code: 9, cause: FAILED: Execution Error, 
 return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
 SQLState:  08S01
 ErrorCode: 9
 create table if not exists CERT.TTS_E ( RNUM int , CTS timestamp)
 row format delimited
 fields terminated by '\t'
 stored as textfile;
 create table if not exists CERT.TTS ( RNUM int , CTS timestamp) 
 stored as sequencefile;
 0 
 1 1996-01-01 00:00:00.0
 2 1996-01-01 12:00:00.0
 3 1996-01-01 23:59:30.12300
 4 2000-01-01 00:00:00.0
 5 2000-01-01 12:00:00.0
 6 2000-01-01 23:59:30.12300
 7 2000-12-31 00:00:00.0
 8 2000-12-31 12:00:00.0
 9 2000-12-31 12:15:30.12300

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3229) null values being loaded as non-null values into Hive

N Campbell created HIVE-3229:


 Summary: null values being loaded as non-null values into Hive
 Key: HIVE-3229
 URL: https://issues.apache.org/jira/browse/HIVE-3229
 Project: Hive
  Issue Type: Bug
Reporter: N Campbell
 Attachments: CERT.TSET1.txt

various tab delimited input files contain one or more columns that represent 
null values in rows. the data appears to load (without an error such as in JIRA 
3228) however the resulting values are now non-null values which is incorrect.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3229) null values being loaded as non-null values into Hive


 [ 
https://issues.apache.org/jira/browse/HIVE-3229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3229:
-

Attachment: CERT.TSET1.txt

One example

 null values being loaded as non-null values into Hive
 -

 Key: HIVE-3229
 URL: https://issues.apache.org/jira/browse/HIVE-3229
 Project: Hive
  Issue Type: Bug
Reporter: N Campbell
 Attachments: CERT.TSET1.txt


 various tab delimited input files contain one or more columns that represent 
 null values in rows. the data appears to load (without an error such as in 
 JIRA 3228) however the resulting values are now non-null values which is 
 incorrect.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-3229) null values being loaded as non-null values into Hive


 [ 
https://issues.apache.org/jira/browse/HIVE-3229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

N Campbell updated HIVE-3229:
-

Description: 
various tab delimited input files contain one or more columns that represent 
null values in rows. the data appears to load (without an error such as in JIRA 
3228) however the resulting values are now non-null values which is incorrect.

create table if not exists CERT.TSET1_E ( RNUM int , C1 int, C2 string)
row format delimited
fields terminated by '\t'
stored as textfile;

create table if not exists CERT.TSET1 ( RNUM int , C1 int, C2 string)
stored as sequencefile;


load data local inpath 'CERT.TSET1.txt'
overwrite into table CERT.TSET1_E;
insert overwrite table CERT.TSET1  select * from CERT.TSET1_E;

  was:
various tab delimited input files contain one or more columns that represent 
null values in rows. the data appears to load (without an error such as in JIRA 
3228) however the resulting values are now non-null values which is incorrect.




 null values being loaded as non-null values into Hive
 -

 Key: HIVE-3229
 URL: https://issues.apache.org/jira/browse/HIVE-3229
 Project: Hive
  Issue Type: Bug
Reporter: N Campbell
 Attachments: CERT.TSET1.txt


 various tab delimited input files contain one or more columns that represent 
 null values in rows. the data appears to load (without an error such as in 
 JIRA 3228) however the resulting values are now non-null values which is 
 incorrect.
 create table if not exists CERT.TSET1_E ( RNUM int , C1 int, C2 string)
 row format delimited
 fields terminated by '\t'
 stored as textfile;
 create table if not exists CERT.TSET1 ( RNUM int , C1 int, C2 string)
 stored as sequencefile;
 load data local inpath 'CERT.TSET1.txt'
 overwrite into table CERT.TSET1_E;
 insert overwrite table CERT.TSET1  select * from CERT.TSET1_E;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-2703) ResultSetMetaData.getColumnType() always returns VARCHAR(string) for partition columns irrespective of partition column type

2012-06-28 Thread N Campbell (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13403048#comment-13403048
 ] 

N Campbell commented on HIVE-2703:
--

Is this fix (or any other) addressing similar issues with respect to map, 
array, struct... where the type is always returned and described a string type?


 ResultSetMetaData.getColumnType() always returns VARCHAR(string) for 
 partition columns irrespective of partition column type
 

 Key: HIVE-2703
 URL: https://issues.apache.org/jira/browse/HIVE-2703
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Affects Versions: 0.8.0
Reporter: Mythili Gopalakrishnan
Assignee: tamtam180
Priority: Critical
 Attachments: HIVE-2703.D2829.1.patch


 ResultSetMetaData.getColumnType() always returns VARCHAR(string) as column 
 type, no matter what the column type is for the partition column.
 However DatabaseMetadata.getColumnType() returns correct type. 
 Create a table with a partition column having a type other than string, you 
 will see that ResultSet.getColumnType() always returns string as the type for 
 int or boolean or float columns...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3196) date_add of a timestamp returning a string representation of the date portion

2012-06-25 Thread N Campbell (JIRA)

N Campbell created HIVE-3196:


 Summary: date_add of a timestamp returning a string representation 
of the date portion
 Key: HIVE-3196
 URL: https://issues.apache.org/jira/browse/HIVE-3196
 Project: Hive
  Issue Type: Bug
  Components: SQL
Affects Versions: 0.8.0
Reporter: N Campbell


the following expressions are expected to derive a new timestamp where the 
input has been incremented by n days. The resulting value is a string 
representation of the date portion (time component truncated)

select date_add(cast(from_unixtime(unix_timestamp()) as timestamp), 5) from 
cert.tversion

select date_add(from_unixtime(unix_timestamp()), 5) from cert.tversion

the apache docs do not clarify if the is an implicit type conversion to date or 
the docs need updating to day they will support a timestamp type as an input 
(which I would expect given the claim to offer timestamp types).

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3183) case expression should allow different types per ISO-SQL 2012

N Campbell created HIVE-3183:


 Summary: case expression should allow different types per ISO-SQL 
2012
 Key: HIVE-3183
 URL: https://issues.apache.org/jira/browse/HIVE-3183
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: N Campbell


The ISO-SQL standard specification for CASE allows the specification to include 
different types in the WHEN and ELSE blocks including this example which mixes 
smallint and integer types

select case when vsint.csint is not null then vsint.csint else 1 end from 
cert.vsint vsint 

The Apache Hive docs do not state how it deviates from the standard or any 
given restrictions so unsure if this is a bug vs an enhancement. Many SQL 
applications mix so this seems to be a restrictive implementation if this is by 
design.

Argument type mismatch '1': The expression after ELSE should have the same type 
as those after THEN: smallint is expected but int is found

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3184) remove existing restriction that union must be defined within a derived table (follow ISO-SQL 2012)

N Campbell created HIVE-3184:


 Summary: remove existing restriction that union must be defined 
within a derived table (follow ISO-SQL 2012)
 Key: HIVE-3184
 URL: https://issues.apache.org/jira/browse/HIVE-3184
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


Don't make SQL applications have to wrap the union operation within a derived 
table per the SQL standard. Failing to do so results in Top level UNION is not 
supported currently; use a subquery

i.e.

select c1, c2 from cert.tset1 tset1 union all select c1, c2 from cert.tset2 
tset2


vs 

select * from ( select c1, c2 from cert.tset1 tset1 union all select c1, c2 
from cert.tset2 tset2 ) T 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3185) allow union set operation without ALL keyword (align to ISO-SQL 2012 specification)

N Campbell created HIVE-3185:


 Summary: allow union set operation without ALL keyword (align to 
ISO-SQL 2012 specification)
 Key: HIVE-3185
 URL: https://issues.apache.org/jira/browse/HIVE-3185
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


Allow a union operation to be specified as qb1 union qb2 in addition to th 
qb1 union all qb2

Common requirement of queries to project two sets where it is not always the 
case you want distinct rows in the set. Failing to do so obviously will result 
in the current exception:
mismatched input 'select' expecting ALL near 'union' in query operator

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3186) support having without a grouping operation (per ISO-SQL 2012)

N Campbell created HIVE-3186:


 Summary: support having without a grouping operation (per ISO-SQL 
2012)
 Key: HIVE-3186
 URL: https://issues.apache.org/jira/browse/HIVE-3186
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


select count(*) from cert.tset1 tset1 having count(*)  2

granted you can manually write a grouped derived table and filter etc but it 
valid to define a having statement (implicit grouped table).

HAVING specified without GROUP BY

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3187) support ISO-2012 timestamp literals

N Campbell created HIVE-3187:


 Summary: support ISO-2012 timestamp literals
 Key: HIVE-3187
 URL: https://issues.apache.org/jira/browse/HIVE-3187
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


Enable the JDBC driver/Hive SQL engine to accept JDBC canonical or ISO-SQL 20xx 
Timestamp literals

ie.
select 1 from cert.tversion tversion where timestamp '1989-01-01 
10:20:30.0'  timestamp '2000-12-31 12:15:30.12300'

instead of
unix_timestamp('.)


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3188) allow cast to use double precision per ISO-SQL 2012

N Campbell created HIVE-3188:


 Summary: allow cast to use double precision per ISO-SQL 2012
 Key: HIVE-3188
 URL: https://issues.apache.org/jira/browse/HIVE-3188
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


While ISO-SQL allows cast('10.3' as double precision) Hive forces you to say 
 cast('10.3' as double ). Accept the additional keyword.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3189) cast ( string type as bigint) returning null values

N Campbell created HIVE-3189:


 Summary: cast ( string type as bigint) returning null values
 Key: HIVE-3189
 URL: https://issues.apache.org/jira/browse/HIVE-3189
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: N Campbell


select rnum, c1, cast(c1 as bigint) from cert.tsdchar tsdchar where rnum in 
(0,1,2)

create table if not exists CERT.TSDCHAR ( RNUM int , C1 string)
row format sequencefile

rnumc1  _c2
0   -1  null
1   0   null
2   10  null


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3190) allow INTEGER as a type name in a column/cast expression (per ISO-SQL 2012)

N Campbell created HIVE-3190:


 Summary: allow INTEGER as a type name in a column/cast expression 
(per ISO-SQL 2012)
 Key: HIVE-3190
 URL: https://issues.apache.org/jira/browse/HIVE-3190
 Project: Hive
  Issue Type: Improvement
Affects Versions: 0.8.0
Reporter: N Campbell


Just extend the parser to allow INTEGER instead of making folks use INT


select cast('10' as integer) from cert.tversion tversion

FAILED: Parse Error: line 1:20 cannot recognize input near 'integer' ')' 'from' 
in primitive type specification


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3192) between predicate parsing issues

N Campbell created HIVE-3192:


 Summary: between predicate parsing issues
 Key: HIVE-3192
 URL: https://issues.apache.org/jira/browse/HIVE-3192
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.0
Reporter: N Campbell


These SQL statements will not parse.

select tbint.rnum,
case  
when tbint.cbint between 6 and 8  
then 'between'  
else 'other' 
end 
from cert.tbint tbint 
Error: Query returned non-zero code: 11, cause: FAILED: Parse Error: line 3:17 
mismatched input 'between' expecting KW_THEN near 'cbint' in case expression

SQLState:  42000
ErrorCode: 11   

select tbint.rnum 
from cert.tbint tbint 
where tbint.cbint between 6 and 8 
Error: Query returned non-zero code: 11, cause: FAILED: Parse Error: line 3:18 
mismatched input 'between' expecting EOF near 'cbint'

SQLState:  42000
ErrorCode: 11

create table if not exists CERT.TBINT ( RNUM int , CBINT bigint)
stored as sequencefile;

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-3193) cannot query views using qualified table references