[jira] [Created] (HIVE-25220) Query with union fails CBO with OOM

2021-06-08 Thread Krisztian Kasa (Jira)
Krisztian Kasa created HIVE-25220:
-

 Summary: Query with union fails CBO with OOM
 Key: HIVE-25220
 URL: https://issues.apache.org/jira/browse/HIVE-25220
 Project: Hive
  Issue Type: Bug
  Components: CBO
Reporter: Krisztian Kasa
Assignee: Krisztian Kasa
 Fix For: 4.0.0






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25219) Backward incompatible timestamp serialization in Avro for certain timezones

2021-06-08 Thread Stamatis Zampetakis (Jira)
Stamatis Zampetakis created HIVE-25219:
--

 Summary: Backward incompatible timestamp serialization in Avro for 
certain timezones
 Key: HIVE-25219
 URL: https://issues.apache.org/jira/browse/HIVE-25219
 Project: Hive
  Issue Type: Bug
  Components: Serializers/Deserializers
Affects Versions: 3.1.0
Reporter: Stamatis Zampetakis
Assignee: Stamatis Zampetakis
 Fix For: 4.0.0


HIVE-12192, HIVE-20007 changed the way that timestamp computations are 
performed and to some extend how timestamps are serialized and deserialized in 
files (Parquet, Avro).

In versions that include HIVE-12192 or HIVE-20007 the serialization in Avro 
files is not backwards compatible. In other words writing timestamps with a 
version of Hive that includes HIVE-12192/HIVE-20007 and reading them with 
another (not including the previous issues) may lead to different results 
depending on the default timezone of the system.

Consider the following scenario where the default system timezone is set to 
US/Pacific.

At apache/master commit eedcd82bc2d61861a27205f925ba0ffab9b6bca8
{code:sql}
CREATE EXTERNAL TABLE employee(eid INT,birth timestamp) STORED AS AVRO
 LOCATION '/tmp/hiveexttbl/employee';
INSERT INTO employee VALUES (1, '1880-01-01 00:00:00');
INSERT INTO employee VALUES (2, '1884-01-01 00:00:00');
INSERT INTO employee VALUES (3, '1990-01-01 00:00:00');
SELECT * FROM employee;
{code}
|1|1880-01-01 00:00:00|
|2|1884-01-01 00:00:00|
|3|1990-01-01 00:00:00|

At apache/branch-2.3 commit 324f9faf12d4b91a9359391810cb3312c004d356
{code:sql}
CREATE EXTERNAL TABLE employee(eid INT,birth timestamp) STORED AS AVRO
 LOCATION '/tmp/hiveexttbl/employee';
SELECT * FROM employee;
{code}
|1|1879-12-31 23:52:58|
|2|1884-01-01 00:00:00|
|3|1990-01-01 00:00:00|

The timestamp for {{eid=1}} in branch-2.3 is different from the one in master.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25218) Add a replication migration tool for external tables

2021-06-08 Thread Ayush Saxena (Jira)
Ayush Saxena created HIVE-25218:
---

 Summary: Add a replication migration tool for external tables
 Key: HIVE-25218
 URL: https://issues.apache.org/jira/browse/HIVE-25218
 Project: Hive
  Issue Type: Improvement
Reporter: Ayush Saxena
Assignee: Ayush Saxena


Add a tool which can confirm migration of external tables post replication from 
one cluster to another.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[VOTE] Should we release Hive Storage API 2.7.3-rc0 ?

2021-06-08 Thread Panos Garefalakis
Hello all,

I would like to propose a new storage-api release with HIVE-24458
 in it (as discussed in a
previous thead).

Shall we release the following artifacts as Hive Storage API 2.7.3?

tar: http://home.apache.org/~pgaref/hive-storage-2.7.3/
tag: https://github.com/apache/hive/releases/tag/storage-release-2.7.3-rc0
jiras: https://issues.apache.org/jira/projects/HIVE/versions/12350287

Cheers,
Panagiotis


[jira] [Created] (HIVE-25217) Move isEligibleForCompaction evaluation under the Initiator thread pool

2021-06-08 Thread Denys Kuzmenko (Jira)
Denys Kuzmenko created HIVE-25217:
-

 Summary: Move isEligibleForCompaction evaluation under the 
Initiator thread pool
 Key: HIVE-25217
 URL: https://issues.apache.org/jira/browse/HIVE-25217
 Project: Hive
  Issue Type: Bug
Reporter: Denys Kuzmenko


Checking for eligibility >1 mil of distinct table / partition combinations can 
take a while by the Initiator since all steps are performed in the main thread. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HIVE-25216) Vectorized reading of ORC tables via Iceberg

2021-06-08 Thread Jira
Ádám Szita created HIVE-25216:
-

 Summary: Vectorized reading of ORC tables via Iceberg
 Key: HIVE-25216
 URL: https://issues.apache.org/jira/browse/HIVE-25216
 Project: Hive
  Issue Type: Improvement
Reporter: Ádám Szita
Assignee: Ádám Szita


As [https://github.com/apache/iceberg/pull/2613] is resolved, we should port it 
to Hive codebase, to enable vectorized ORC reads on Iceberg-backed tables.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)