[jira] [Created] (HIVE-24694) Early connection close to release server resources during creating

2021-01-27 Thread Zhihua Deng (Jira)
Zhihua Deng created HIVE-24694:
--

 Summary: Early connection close to release server resources during 
creating
 Key: HIVE-24694
 URL: https://issues.apache.org/jira/browse/HIVE-24694
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Reporter: Zhihua Deng
Assignee: Zhihua Deng


If exception happens during we try to get the connection from HiveDriver,  the 
opened transport or session may leave unclosed as the connection returned is 
null, we cannot call the close method to release the server 
resources(threads/connection quota), this could make things more worse if the 
user rearches the connection limit,  the following calls to get the connection 
will be failed until we restart the hs2.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24692) Max number of table hive-metastore can handle

2021-01-27 Thread Izek Greenfield (Jira)
Izek Greenfield created HIVE-24692:
--

 Summary: Max number of table hive-metastore can handle
 Key: HIVE-24692
 URL: https://issues.apache.org/jira/browse/HIVE-24692
 Project: Hive
  Issue Type: Wish
  Components: Hive
Reporter: Izek Greenfield


What the Max/recommended number of table hive meta-store can handle? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24693) Parquet Timestamp Values Read/Write Very Slow

2021-01-27 Thread David Mollitor (Jira)
David Mollitor created HIVE-24693:
-

 Summary: Parquet Timestamp Values Read/Write Very Slow
 Key: HIVE-24693
 URL: https://issues.apache.org/jira/browse/HIVE-24693
 Project: Hive
  Issue Type: Improvement
Reporter: David Mollitor
Assignee: David Mollitor


Parquet {{DataWriteableWriter}} relias on {{NanoTimeUtils}} to convert a 
timestamp object into a binary value.  The way in which it does this,... it 
calls {{toString()}} on the timestamp object, and then parses the String.  This 
particular timestamp do not carry a timezone, so the string is something like:

{{2021-21-03 12:32:23....}}

The parse code tries to parse the string assuming there is a time zone, and if 
not, falls-back and applies the provided "default time zone".  As was noted in 
[HIVE-24353], if something fails to parse, it is very expensive to try to parse 
again.  So, for each timestamp in the Parquet file, it:

* Builds a string from the time stamp
* Parses it (throws an exception, parses again)

There is no need to do this kind of string manipulations/parsing, it should 
just be using the epoch millis/seconds/time stored internal to the Timestamp 
object.

{code:java}
  // Converts Timestamp to TimestampTZ.
  public static TimestampTZ convert(Timestamp ts, ZoneId defaultTimeZone) {
return parse(ts.toString(), defaultTimeZone);
  }
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24691) Ban commons-logging (again)

2021-01-27 Thread Zoltan Matyus (Jira)
Zoltan Matyus created HIVE-24691:


 Summary: Ban commons-logging (again)
 Key: HIVE-24691
 URL: https://issues.apache.org/jira/browse/HIVE-24691
 Project: Hive
  Issue Type: Bug
  Components: Logging
Affects Versions: 4.0.0
Reporter: Zoltan Matyus
Assignee: Zoltan Matyus


The usage of commons-logging has been completely removed once from Hive in 
HIVE-20019. However, new usage has been added since, despite attempts to ban 
this (bannedDependencies). I'm removing all usage again, and add another way to 
ban using it (restrictImports).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24690) GlobalLimitOptimizer Fails To Identify Some Queries With LIMIT Operator

2021-01-27 Thread Syed Shameerur Rahman (Jira)
Syed Shameerur Rahman created HIVE-24690:


 Summary: GlobalLimitOptimizer Fails To Identify Some Queries With 
LIMIT Operator
 Key: HIVE-24690
 URL: https://issues.apache.org/jira/browse/HIVE-24690
 Project: Hive
  Issue Type: Bug
  Components: Query Planning
Affects Versions: 3.1.0, 2.1.0, 1.1.0
Reporter: Syed Shameerur Rahman
Assignee: Syed Shameerur Rahman


As per 
[https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GlobalLimitOptimizer.java#L88]
 queries like
{code:java}
CREATE TABLE ... AS SELECT col1, col2 FROM tbl LIMIT ..
INSERT OVERWRITE TABLE ... SELECT col1, hash(col2), split(col1) FROM ... 
LIMIT...
{code}
falls under the category of qualified list, But after HIVE-9444 it is not.

On investigating this issue, It is found that for
{code:java}
CREATE TABLE ... AS SELECT col1, col2 FROM tbl LIMIT 
{code}
query the operator tree looks like *TS -> SEL -> LIM -> RS -> SEL -> LIM -> FS*

Since only only LIMIT operator is allowed as per 
https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/optimizer/GlobalLimitOptimizer.java#L196
 , The *GlobalLimitOptimizer* fails to identify such queries.

*Steps To Reproduce*

{code:java}
set hive.limit.optimize.enable=true;
create table t1 (a int);
create table t2 select * from t1 LIMIT 10;
{code}






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24689) Changing file format (from ORC) is not supported for table xxx

2021-01-27 Thread yanhw (Jira)
yanhw created HIVE-24689:


 Summary: Changing file format (from ORC) is not supported for 
table xxx
 Key: HIVE-24689
 URL: https://issues.apache.org/jira/browse/HIVE-24689
 Project: Hive
  Issue Type: Bug
Affects Versions: 3.1.2
Reporter: yanhw


use test;
drop table if exists test_lzo_partition;

alter table test_lzo_partition
 set FILEFORMAT INPUTFORMAT 'com.hadoop.mapred.DeprecatedLzoTextInputFormat'
 OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
 SERDE 'org.openx.data.jsonserde.JsonSerDe' ;

alter table test_lzo_partition add if not exists partition (t=2);
alter table test_lzo_partition PARTITION (t=2)
 set FILEFORMAT INPUTFORMAT 'com.hadoop.mapred.DeprecatedLzoTextInputFormat'
 OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
 SERDE 'org.openx.data.jsonserde.JsonSerDe' ;

It will return:

FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.DDLTask. Changing file format (from ORC) is not 
supported for table test.test_lzo_partition



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-24688) Optimise ObjectInspectorUtils.copyToStandardObject

2021-01-27 Thread Jira
László Bodor created HIVE-24688:
---

 Summary: Optimise ObjectInspectorUtils.copyToStandardObject
 Key: HIVE-24688
 URL: https://issues.apache.org/jira/browse/HIVE-24688
 Project: Hive
  Issue Type: Improvement
Reporter: László Bodor






--
This message was sent by Atlassian Jira
(v8.3.4#803005)