[jira] [Resolved] (HIVE-25763) HIVE IS BAD

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar resolved HIVE-25763.
-
Release Note: OK
  Resolution: Fixed

> HIVE IS BAD
> ---
>
> Key: HIVE-25763
> URL: https://issues.apache.org/jira/browse/HIVE-25763
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> BAD



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25763) HIVE IS BAD

2021-12-01 Thread Pravin Pawar (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452180#comment-17452180
 ] 

Pravin Pawar commented on HIVE-25763:
-

OK

> HIVE IS BAD
> ---
>
> Key: HIVE-25763
> URL: https://issues.apache.org/jira/browse/HIVE-25763
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> BAD



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work started] (HIVE-25763) HIVE IS BAD

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25763 started by Pravin Pawar.
---
> HIVE IS BAD
> ---
>
> Key: HIVE-25763
> URL: https://issues.apache.org/jira/browse/HIVE-25763
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> BAD



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25763) HIVE IS BAD

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar reassigned HIVE-25763:
---

Assignee: Pravin Pawar

> HIVE IS BAD
> ---
>
> Key: HIVE-25763
> URL: https://issues.apache.org/jira/browse/HIVE-25763
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> BAD



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25729) ThriftUnionObjectInspector should be notified when fully inited

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25729?focusedWorklogId=689042=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-689042
 ]

ASF GitHub Bot logged work on HIVE-25729:
-

Author: ASF GitHub Bot
Created on: 02/Dec/21 04:38
Start Date: 02/Dec/21 04:38
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on pull request #2808:
URL: https://github.com/apache/hive/pull/2808#issuecomment-984286215


   > Sorry @dengzhhu653 , was on vacation. Can you add some details on why this 
change is necessary? thanks.
   
   Thank you for the feedback. 
[HIVE-11586](https://issues.apache.org/jira/browse/HIVE-11586) has solved the 
problem that a uninitialized instance of ReflectionStructObjectInspector may be 
returned in case of multi threads trying to get it, npe would be thrown if we 
refer to some fields of the inspector. In order to make true that the 
ReflectionStructObjectInspector is fully inited, [others may wait for up to 3 
seconds](https://github.com/apache/hive/blob/master/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ObjectInspectorFactory.java#L106-L121).
  The 
[ThriftStructObjectInspector](https://github.com/apache/hive/blob/7b3ecf617a6d46f48a3b6f77e0339fd4ad95a420/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ReflectionStructObjectInspector.java#L165-L178)
 will notify the monitor on finishing initialization, but 
[ThriftUnionObjectInspector](https://github.com/apache/hive/blob/master/serde/src/java/org/apache/hadoop/hive/serde2/objectinspector/ThriftUnionObjectInspector.java#L95-L104)
 misses to do this in his own overriden `init` method.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 689042)
Time Spent: 40m  (was: 0.5h)

> ThriftUnionObjectInspector should be notified when fully inited
> ---
>
> Key: HIVE-25729
> URL: https://issues.apache.org/jira/browse/HIVE-25729
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Reporter: Zhihua Deng
>Assignee: Zhihua Deng
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> For thread safe purpose,  a ReflectionStructObjectInspector instance would 
> wait for 3 seconds to ensure the returning ObjectInspector is fully inited, 
> {code:java}
> synchronized (soi) {
>   while (!soi.isFullyInited(checkedTypes)) {
> //   
>     soi.wait(3000);  
>   }
> } {code}
> It seems that we are missing to notify ThriftUnionObjectInspector when it has 
> been inited.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25729) ThriftUnionObjectInspector should be notified when fully inited

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25729?focusedWorklogId=689033=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-689033
 ]

ASF GitHub Bot logged work on HIVE-25729:
-

Author: ASF GitHub Bot
Created on: 02/Dec/21 03:48
Start Date: 02/Dec/21 03:48
Worklog Time Spent: 10m 
  Work Description: sunchao commented on pull request #2808:
URL: https://github.com/apache/hive/pull/2808#issuecomment-984267079


   Sorry @dengzhhu653 , was on vacation. Can you add some details on why this 
change is necessary? thanks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 689033)
Time Spent: 0.5h  (was: 20m)

> ThriftUnionObjectInspector should be notified when fully inited
> ---
>
> Key: HIVE-25729
> URL: https://issues.apache.org/jira/browse/HIVE-25729
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Reporter: Zhihua Deng
>Assignee: Zhihua Deng
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> For thread safe purpose,  a ReflectionStructObjectInspector instance would 
> wait for 3 seconds to ensure the returning ObjectInspector is fully inited, 
> {code:java}
> synchronized (soi) {
>   while (!soi.isFullyInited(checkedTypes)) {
> //   
>     soi.wait(3000);  
>   }
> } {code}
> It seems that we are missing to notify ThriftUnionObjectInspector when it has 
> been inited.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25762) ok

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar reassigned HIVE-25762:
---

Assignee: Pravin Pawar

ok

> ok
> --
>
> Key: HIVE-25762
> URL: https://issues.apache.org/jira/browse/HIVE-25762
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Major
>
> ok



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25761) unable see hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar resolved HIVE-25761.
-
Release Note: resolved
  Resolution: Fixed

> unable see hive jobs
> 
>
> Key: HIVE-25761
> URL: https://issues.apache.org/jira/browse/HIVE-25761
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> no able to login



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25761) unable see hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452141#comment-17452141
 ] 

Pravin Pawar commented on HIVE-25761:
-

rca issue reolved

> unable see hive jobs
> 
>
> Key: HIVE-25761
> URL: https://issues.apache.org/jira/browse/HIVE-25761
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> no able to login



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work started] (HIVE-25761) unable see hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25761 started by Pravin Pawar.
---
> unable see hive jobs
> 
>
> Key: HIVE-25761
> URL: https://issues.apache.org/jira/browse/HIVE-25761
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> no able to login



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25761) unable see hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar reassigned HIVE-25761:
---

Assignee: Pravin Pawar

> unable see hive jobs
> 
>
> Key: HIVE-25761
> URL: https://issues.apache.org/jira/browse/HIVE-25761
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> no able to login



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work started] (HIVE-25760) impala is not running

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25760 started by Pravin Pawar.
---
> impala is not running
> -
>
> Key: HIVE-25760
> URL: https://issues.apache.org/jira/browse/HIVE-25760
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> impala is not working properly



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25760) impala is not running

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar resolved HIVE-25760.
-
Release Note: issue resolved
  Resolution: Fixed

> impala is not running
> -
>
> Key: HIVE-25760
> URL: https://issues.apache.org/jira/browse/HIVE-25760
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> impala is not working properly



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25760) impala is not running

2021-12-01 Thread Pravin Pawar (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452136#comment-17452136
 ] 

Pravin Pawar commented on HIVE-25760:
-

issue resolved

> impala is not running
> -
>
> Key: HIVE-25760
> URL: https://issues.apache.org/jira/browse/HIVE-25760
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> impala is not working properly



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25760) impala is not running

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar reassigned HIVE-25760:
---

Assignee: Pravin Pawar

> impala is not running
> -
>
> Key: HIVE-25760
> URL: https://issues.apache.org/jira/browse/HIVE-25760
> Project: Hive
>  Issue Type: Bug
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Critical
>
> impala is not working properly



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25759) unable to raise hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar resolved HIVE-25759.
-
Release Note: issue resolved
  Resolution: Fixed

> unable to raise hive jobs
> -
>
> Key: HIVE-25759
> URL: https://issues.apache.org/jira/browse/HIVE-25759
> Project: Hive
>  Issue Type: Bug
>  Components: Avro
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Minor
>
> unable to run job



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25759) unable to raise hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452125#comment-17452125
 ] 

Pravin Pawar commented on HIVE-25759:
-

RCA - issue resolved

> unable to raise hive jobs
> -
>
> Key: HIVE-25759
> URL: https://issues.apache.org/jira/browse/HIVE-25759
> Project: Hive
>  Issue Type: Bug
>  Components: Avro
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Minor
>
> unable to run job



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work started] (HIVE-25759) unable to raise hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25759 started by Pravin Pawar.
---
> unable to raise hive jobs
> -
>
> Key: HIVE-25759
> URL: https://issues.apache.org/jira/browse/HIVE-25759
> Project: Hive
>  Issue Type: Bug
>  Components: Avro
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Minor
>
> unable to run job



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25759) unable to raise hive jobs

2021-12-01 Thread Pravin Pawar (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Pawar reassigned HIVE-25759:
---


> unable to raise hive jobs
> -
>
> Key: HIVE-25759
> URL: https://issues.apache.org/jira/browse/HIVE-25759
> Project: Hive
>  Issue Type: Bug
>  Components: Avro
>Reporter: Pravin Pawar
>Assignee: Pravin Pawar
>Priority: Minor
>
> unable to run job



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25652) Add constraints in result of “SHOW CREATE TABLE ”

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25652?focusedWorklogId=688998=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688998
 ]

ASF GitHub Bot logged work on HIVE-25652:
-

Author: ASF GitHub Bot
Created on: 02/Dec/21 01:47
Start Date: 02/Dec/21 01:47
Worklog Time Spent: 10m 
  Work Description: soumyakanti3578 commented on a change in pull request 
#2777:
URL: https://github.com/apache/hive/pull/2777#discussion_r760702646



##
File path: 
ql/src/test/results/clientpositive/llap/show_create_table_db_table.q.out
##
@@ -120,8 +120,8 @@ POSTHOOK: query: SHOW CREATE TABLE tmp_feng.tmp_showcrt4
 POSTHOOK: type: SHOW_CREATETABLE
 POSTHOOK: Input: tmp_feng@tmp_showcrt4
 CREATE TABLE `tmp_feng`.`tmp_showcrt4`(
-  `s1` struct<`p1`:string>, 
-  `s2` struct<`p2`:array, 
array>)
+  `s1` struct<``p1``:string>, 
+  `s2` struct<``p2``:array, 
array>)

Review comment:
   Hey @kasakrisz , what do you think about these changes? These occurred 
due to the change that you requested. I feel these are fine, but please let me 
know if you think otherwise.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688998)
Time Spent: 4h 50m  (was: 4h 40m)

> Add constraints in result of “SHOW CREATE TABLE ”
> -
>
> Key: HIVE-25652
> URL: https://issues.apache.org/jira/browse/HIVE-25652
> Project: Hive
>  Issue Type: Improvement
>Reporter: Soumyakanti Das
>Assignee: Soumyakanti Das
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 4h 50m
>  Remaining Estimate: 0h
>
> Currently show create table doesn’t pull any constraint info like not null, 
> defaults, primary key.
> Example:
> Create table
>  
> {code:java}
> CREATE TABLE TEST(
>   col1 varchar(100) NOT NULL COMMENT "comment for column 1",
>   col2 timestamp DEFAULT CURRENT_TIMESTAMP() COMMENT "comment for column 2",
>   col3 decimal,
>   col4 varchar(512) NOT NULL,
>   col5 varchar(100),
>   primary key(col1, col2) disable novalidate)
> ROW FORMAT SERDE
>   'org.apache.hadoop.hive.ql.io.orc.OrcSerde'
> STORED AS INPUTFORMAT
>   'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat'
> OUTPUTFORMAT
>   'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat';
> {code}
> Currently {{SHOW CREATE TABLE TEST}} doesn't show the column constraints.
> {code:java}
> CREATE TABLE `test`(
>   `col1` varchar(100) COMMENT 'comment for column 1', 
>   `col2` timestamp COMMENT 'comment for column 2', 
>   `col3` decimal(10,0), 
>   `col4` varchar(512), 
>   `col5` varchar(100))
> ROW FORMAT SERDE 
>   'org.apache.hadoop.hive.ql.io.orc.OrcSerde' 
> STORED AS INPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' 
> OUTPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25708) Implement creation of table_diff

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25708?focusedWorklogId=688977=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688977
 ]

ASF GitHub Bot logged work on HIVE-25708:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 23:45
Start Date: 01/Dec/21 23:45
Worklog Time Spent: 10m 
  Work Description: pkumarsinha commented on a change in pull request #2795:
URL: https://github.com/apache/hive/pull/2795#discussion_r760635567



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/exec/repl/OptimisedBootstrapUtils.java
##
@@ -0,0 +1,290 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.hadoop.hive.ql.exec.repl;
+
+import org.apache.commons.io.IOUtils;
+import org.apache.hadoop.fs.FSDataInputStream;
+import org.apache.hadoop.fs.FileStatus;
+import org.apache.hadoop.fs.FileSystem;
+import org.apache.hadoop.fs.Path;
+import org.apache.hadoop.fs.RemoteIterator;
+import org.apache.hadoop.hive.common.FileUtils;
+import org.apache.hadoop.hive.conf.HiveConf;
+import org.apache.hadoop.hive.metastore.api.Database;
+import org.apache.hadoop.hive.metastore.api.NotificationEvent;
+import 
org.apache.hadoop.hive.metastore.messaging.event.filters.DatabaseAndTableFilter;
+import org.apache.hadoop.hive.metastore.utils.MetaStoreUtils;
+import org.apache.hadoop.hive.ql.exec.repl.util.ReplUtils;
+import org.apache.hadoop.hive.ql.metadata.Hive;
+import org.apache.hadoop.hive.ql.metadata.HiveException;
+import org.apache.hadoop.hive.ql.metadata.Partition;
+import org.apache.hadoop.hive.ql.metadata.Table;
+import org.apache.hadoop.hive.ql.parse.SemanticException;
+import org.apache.hadoop.hive.ql.parse.repl.DumpType;
+import org.apache.hadoop.hive.ql.parse.repl.dump.Utils;
+import org.apache.hadoop.hive.ql.parse.repl.load.DumpMetaData;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.io.IOException;
+import java.nio.charset.Charset;
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.HashSet;
+import java.util.List;
+import java.util.stream.Collectors;
+
+import static 
org.apache.hadoop.hive.ql.parse.ReplicationSpec.getLastReplicatedStateFromParameters;
+
+public class OptimisedBootstrapUtils {

Review comment:
   Can you pls add a javadoc comment for the class?

##
File path: 
itests/hive-unit/src/test/java/org/apache/hadoop/hive/ql/parse/TestReplicationOptimisedBootstrap.java
##
@@ -0,0 +1,409 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.hadoop.hive.ql.parse;
+
+import org.apache.hadoop.fs.ContentSummary;
+import org.apache.hadoop.fs.Path;
+import org.apache.hadoop.hdfs.DistributedFileSystem;
+import org.apache.hadoop.hive.conf.HiveConf;
+import org.apache.hadoop.hive.metastore.conf.MetastoreConf;
+import 
org.apache.hadoop.hive.metastore.messaging.json.gzip.GzipJSONMessageEncoder;
+import org.apache.hadoop.hive.ql.metadata.HiveException;
+import org.apache.hadoop.security.UserGroupInformation;
+
+import org.junit.After;
+import org.junit.Before;
+import org.junit.BeforeClass;
+import org.junit.Test;
+
+import java.util.Arrays;
+import java.util.HashMap;
+import java.util.HashSet;
+import java.util.List;
+
+import static org.apache.hadoop.hdfs.protocol.HdfsConstants.QUOTA_RESET;
+import static 

[jira] [Updated] (HIVE-25758) OOM due to recursive application of CBO rules

2021-12-01 Thread Alessandro Solimando (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alessandro Solimando updated HIVE-25758:

Summary: OOM due to recursive application of CBO rules  (was: OOM due to 
recursive application CBO rules)

> OOM due to recursive application of CBO rules
> -
>
> Key: HIVE-25758
> URL: https://issues.apache.org/jira/browse/HIVE-25758
> Project: Hive
>  Issue Type: Bug
>  Components: CBO, Query Planning
>Affects Versions: 4.0.0
>Reporter: Alessandro Solimando
>Priority: Major
>
>  
> Reproducing query is as follows:
> {code:java}
> create table test1 (act_nbr string);
> create table test2 (month int);
> create table test3 (mth int, con_usd double);
> EXPLAIN
>SELECT c.month,
>   d.con_usd
>FROM
>  (SELECT 
> cast(regexp_replace(substr(add_months(from_unixtime(unix_timestamp(), 
> '-MM-dd'), -1), 1, 7), '-', '') AS int) AS month
>   FROM test1
>   UNION ALL
>   SELECT month
>   FROM test2
>   WHERE month = 202110) c
>JOIN test3 d ON c.month = d.mth; {code}
>  
> Different plans are generated during the first CBO steps, last being:
> {noformat}
> 2021-12-01T08:28:08,598 DEBUG [a18191bb-3a2b-4193-9abf-4e37dd1996bb main] 
> parse.CalcitePlanner: Plan after decorre
> lation:
> HiveProject(month=[$0], con_usd=[$2])
>   HiveJoin(condition=[=($0, $1)], joinType=[inner], algorithm=[none], 
> cost=[not available])
>     HiveProject(month=[$0])
>       HiveUnion(all=[true])
>         
> HiveProject(month=[CAST(regexp_replace(substr(add_months(FROM_UNIXTIME(UNIX_TIMESTAMP,
>  _UTF-16LE'-MM-d
> d':VARCHAR(2147483647) CHARACTER SET "UTF-16LE"), -1), 1, 7), 
> _UTF-16LE'-':VARCHAR(2147483647) CHARACTER SET "UTF-
> 16LE", _UTF-16LE'':VARCHAR(2147483647) CHARACTER SET "UTF-16LE")):INTEGER])
>           HiveTableScan(table=[[default, test1]], table:alias=[test1])
>         HiveProject(month=[$0])
>           HiveFilter(condition=[=($0, CAST(202110):INTEGER)])
>             HiveTableScan(table=[[default, test2]], table:alias=[test2])
>     HiveTableScan(table=[[default, test3]], table:alias=[d]){noformat}
>  
> Then, the HEP planner will keep expanding the filter expression with 
> redundant expressions, such as the following, where the identical CAST 
> expression is present multiple times:
>  
> {noformat}
> rel#118:HiveFilter.HIVE.[].any(input=HepRelVertex#39,condition=IN(CAST(regexp_replace(substr(add_months(FROM_UNIXTIME(UNIX_TIMESTAMP,
>  _UTF-16LE'-MM-dd':VARCHAR(2147483647) CHARACTER SET "UTF-16LE"), -1), 1, 
> 7), _UTF-16LE'-':VARCHAR(2147483647) CHARACTER SET "UTF-16LE", 
> _UTF-16LE'':VARCHAR(2147483647) CHARACTER SET "UTF-16LE")):INTEGER, 
> CAST(regexp_replace(substr(add_months(FROM_UNIXTIME(UNIX_TIMESTAMP, 
> _UTF-16LE'-MM-dd':VARCHAR(2147483647) CHARACTER SET "UTF-16LE"), -1), 1, 
> 7), _UTF-16LE'-':VARCHAR(2147483647) CHARACTER SET "UTF-16LE", 
> _UTF-16LE'':VARCHAR(2147483647) CHARACTER SET "UTF-16LE")):INTEGER, 
> 202110)){noformat}
>  
> The problem seems to come from a bad interaction of at least 
> _HiveFilterProjectTransposeRule_ and 
> {_}HiveJoinPushTransitivePredicatesRule{_}, possibly more.
> Most probably then UNION part can be removed and the reproducer be simplified 
> even further.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25757) Use cached database type to choose metastore backend queries

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25757?focusedWorklogId=688786=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688786
 ]

ASF GitHub Bot logged work on HIVE-25757:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 16:59
Start Date: 01/Dec/21 16:59
Worklog Time Spent: 10m 
  Work Description: yongzhi opened a new pull request #2833:
URL: https://github.com/apache/hive/pull/2833


   
   
   ### What changes were proposed in this pull request?
   
   
   
   ### Why are the changes needed?
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   
   
   ### How was this patch tested?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688786)
Remaining Estimate: 0h
Time Spent: 10m

> Use cached database type to choose metastore backend queries
> 
>
> Key: HIVE-25757
> URL: https://issues.apache.org/jira/browse/HIVE-25757
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore
>Affects Versions: 4.0.0
>Reporter: Yongzhi Chen
>Assignee: Yongzhi Chen
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> In HIVE-21075, we use DatabaseProduct.determineDatabaseProduct which can be 
> expensive.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25757) Use cached database type to choose metastore backend queries

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-25757:
--
Labels: pull-request-available  (was: )

> Use cached database type to choose metastore backend queries
> 
>
> Key: HIVE-25757
> URL: https://issues.apache.org/jira/browse/HIVE-25757
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore
>Affects Versions: 4.0.0
>Reporter: Yongzhi Chen
>Assignee: Yongzhi Chen
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> In HIVE-21075, we use DatabaseProduct.determineDatabaseProduct which can be 
> expensive.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25609) Preserve XAttrs in normal file copy case.

2021-12-01 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha resolved HIVE-25609.
-
Resolution: Fixed

Committed to master.

Thanks for the patch [~haymant]  !!

> Preserve XAttrs in normal file copy case.
> -
>
> Key: HIVE-25609
> URL: https://issues.apache.org/jira/browse/HIVE-25609
> Project: Hive
>  Issue Type: Improvement
>Reporter: Haymant Mangla
>Assignee: Haymant Mangla
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25738) NullIf doesn't support complex types

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25738?focusedWorklogId=688772=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688772
 ]

ASF GitHub Bot logged work on HIVE-25738:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 16:26
Start Date: 01/Dec/21 16:26
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #2816:
URL: https://github.com/apache/hive/pull/2816


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688772)
Time Spent: 2.5h  (was: 2h 20m)

> NullIf doesn't support complex types
> 
>
> Key: HIVE-25738
> URL: https://issues.apache.org/jira/browse/HIVE-25738
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> {code}
> SELECT NULLIF(array(1,2,3),array(1,2,3))
> {code}
> results in:
> {code}
>  java.lang.ClassCastException: 
> org.apache.hadoop.hive.serde2.objectinspector.StandardListObjectInspector 
> cannot be cast to 
> org.apache.hadoop.hive.serde2.objectinspector.PrimitiveObjectInspector
>   at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDFNullif.evaluate(GenericUDFNullif.java:96)
>   at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDF.initializeAndFoldConstants(GenericUDF.java:177)
>   at 
> org.apache.hadoop.hive.ql.parse.type.HiveFunctionHelper.getReturnType(HiveFunctionHelper.java:135)
>   at 
> org.apache.hadoop.hive.ql.parse.type.RexNodeExprFactory.createFuncCallExpr(RexNodeExprFactory.java:647)
> [...]
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25738) NullIf doesn't support complex types

2021-12-01 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25738.
-
Fix Version/s: 4.0.0
 Assignee: Zoltan Haindrich
   Resolution: Fixed

merged into master. Thank you [~zabetak] and [~dengzh] for reviewing the 
changes!

> NullIf doesn't support complex types
> 
>
> Key: HIVE-25738
> URL: https://issues.apache.org/jira/browse/HIVE-25738
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> {code}
> SELECT NULLIF(array(1,2,3),array(1,2,3))
> {code}
> results in:
> {code}
>  java.lang.ClassCastException: 
> org.apache.hadoop.hive.serde2.objectinspector.StandardListObjectInspector 
> cannot be cast to 
> org.apache.hadoop.hive.serde2.objectinspector.PrimitiveObjectInspector
>   at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDFNullif.evaluate(GenericUDFNullif.java:96)
>   at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDF.initializeAndFoldConstants(GenericUDF.java:177)
>   at 
> org.apache.hadoop.hive.ql.parse.type.HiveFunctionHelper.getReturnType(HiveFunctionHelper.java:135)
>   at 
> org.apache.hadoop.hive.ql.parse.type.RexNodeExprFactory.createFuncCallExpr(RexNodeExprFactory.java:647)
> [...]
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25757) Use cached database type to choose metastore backend queries

2021-12-01 Thread Yongzhi Chen (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yongzhi Chen reassigned HIVE-25757:
---


> Use cached database type to choose metastore backend queries
> 
>
> Key: HIVE-25757
> URL: https://issues.apache.org/jira/browse/HIVE-25757
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore
>Affects Versions: 4.0.0
>Reporter: Yongzhi Chen
>Assignee: Yongzhi Chen
>Priority: Major
>
> In HIVE-21075, we use DatabaseProduct.determineDatabaseProduct which can be 
> expensive.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25756) Fix replication metrics backward compatibility issue.

2021-12-01 Thread Haymant Mangla (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Haymant Mangla reassigned HIVE-25756:
-


> Fix replication metrics backward compatibility issue.
> -
>
> Key: HIVE-25756
> URL: https://issues.apache.org/jira/browse/HIVE-25756
> Project: Hive
>  Issue Type: Bug
>Reporter: Haymant Mangla
>Assignee: Haymant Mangla
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688714=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688714
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:52
Start Date: 01/Dec/21 14:52
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760259374



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/txn/CompactionTxnHandler.java
##
@@ -336,9 +336,9 @@ public void markCompacted(CompactionInfo info) throws 
MetaException {
  * By filtering on minOpenTxnWaterMark, we will only cleanup after 
every transaction is committed, that could see
  * the uncompacted deltas. This way the cleaner can clean up 
everything that was made obsolete by this compaction.
  */
-String s = "SELECT \"CQ_ID\", \"CQ_DATABASE\", \"CQ_TABLE\", 
\"CQ_PARTITION\", "
-+ "\"CQ_TYPE\", \"CQ_RUN_AS\", \"CQ_HIGHEST_WRITE_ID\" FROM 
\"COMPACTION_QUEUE\" WHERE \"CQ_STATE\" = '"
-+ READY_FOR_CLEANING + "'";
+String s = "SELECT \"CQ_ID\", \"CQ_DATABASE\", \"CQ_TABLE\", 
\"CQ_PARTITION\", \"CQ_TYPE\", " 
++ "\"CQ_RUN_AS\", \"CQ_HIGHEST_WRITE_ID\", \"CQ_TBLPROPERTIES\" 
FROM \"COMPACTION_QUEUE\" WHERE \"CQ_STATE\" = '"

Review comment:
   it's an existing field, so should be in sysdb tables. This part only 
affects the Cleaner.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688714)
Time Spent: 11.5h  (was: 11h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 11.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688712=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688712
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:50
Start Date: 01/Dec/21 14:50
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760257691



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/txn/TxnHandler.java
##
@@ -1451,11 +1453,11 @@ public void commitTxn(CommitTxnRequest rqst) throws 
NoSuchTxnException, TxnAbort
 OperationType.UPDATE + "," + OperationType.DELETE + ")";
 long tempCommitId = generateTemporaryId();
 
-if (txnType.get() == TxnType.COMPACTION) {
+if (txnType == TxnType.SOFT_DELETE || txnType == TxnType.COMPACTION) {
   acquireTxnLock(stmt, false);
   commitId = getHighWaterMark(stmt);
 
-} else if (txnType.get() != TxnType.READ_ONLY && !isReplayedReplTxn) {
+} else if (txnType != TxnType.READ_ONLY && !isReplayedReplTxn) {

Review comment:
   I need to check if we send the metadata delete event for repl, however, 
for the data removal, we are doing this in Cleaner. 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688712)
Time Spent: 11h 20m  (was: 11h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 11h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688710=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688710
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:48
Start Date: 01/Dec/21 14:48
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760255174



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/AcidEventListener.java
##
@@ -60,9 +65,30 @@ public void onDropDatabase (DropDatabaseEvent dbEvent) 
throws MetaException {
 
   @Override
   public void onDropTable(DropTableEvent tableEvent)  throws MetaException {
-if (TxnUtils.isTransactionalTable(tableEvent.getTable())) {
+Table table = tableEvent.getTable();
+
+if (TxnUtils.isTransactionalTable(table)) {
   txnHandler = getTxnHandler();
-  txnHandler.cleanupRecords(HiveObjectType.TABLE, null, 
tableEvent.getTable(), null);
+  txnHandler.cleanupRecords(HiveObjectType.TABLE, null, table, null, 
!tableEvent.getDeleteData());
+  
+  if (!tableEvent.getDeleteData()) {
+long currentTxn = 
Optional.ofNullable(tableEvent.getEnvironmentContext())
+  .map(EnvironmentContext::getProperties)
+  .map(prop -> prop.get("txnId"))
+  .map(Long::parseLong)
+  .orElse(0L);
+
+if (currentTxn > 0) {
+  CompactionRequest rqst = new CompactionRequest(table.getDbName(), 
table.getTableName(), CompactionType.MAJOR);
+  rqst.putToProperties("location", table.getSd().getLocation());

Review comment:
   I didn't want to change the compaction queue schema + it kinda fits the 
definition of CQ_TBLPROPERTIES column




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688710)
Time Spent: 11h 10m  (was: 11h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 11h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688707=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688707
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:46
Start Date: 01/Dec/21 14:46
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760253006



##
File path: 
standalone-metastore/metastore-common/src/main/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
##
@@ -1772,6 +1773,22 @@ public void dropTable(String dbname, String name, 
boolean deleteData,
 dropTable(getDefaultCatalog(conf), dbname, name, deleteData, 
ignoreUnknownTab, ifPurge);
   }
 
+  @Override
+  public void dropTable(Table tbl, boolean deleteData, boolean 
ignoreUnknownTbl, boolean ifPurge) throws TException {
+EnvironmentContext context = null;
+if (ifPurge) {

Review comment:
   good catch, it's not handled yet, as we don't support purge in Cleaner, 
however, we could pass `purge` attribute via the queue tblproperties.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688707)
Time Spent: 11h  (was: 10h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 11h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688703=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688703
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:43
Start Date: 01/Dec/21 14:43
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760251029



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Initiator.java
##
@@ -249,7 +249,7 @@ protected String resolveUserToRunAs(Map 
cache, Table t, Partitio
 
 String user = cache.get(fullTableName);
 if (user == null) {
-  user = findUserToRunAs(sd.getLocation(), t);
+  user = TxnUtils.findUserToRunAs(sd.getLocation(), t, conf);

Review comment:
   nope, refactor needed to set runAs user for `safe-delete` in 
AcidEventListener.dropTable()




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688703)
Time Spent: 10h 50m  (was: 10h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688701=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688701
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:42
Start Date: 01/Dec/21 14:42
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760249779



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -318,8 +335,8 @@ private static String idWatermark(CompactionInfo ci) {
   private boolean removeFiles(String location, ValidWriteIdList writeIdList, 
CompactionInfo ci)
   throws IOException, NoSuchObjectException, MetaException {
 Path locPath = new Path(location);
-AcidDirectory dir = AcidUtils.getAcidState(locPath.getFileSystem(conf), 
locPath, conf, writeIdList, Ref.from(
-false), false);
+FileSystem fs = locPath.getFileSystem(conf);

Review comment:
   no, created a separate cleanup method to handle `safe-delete` + 
extracted the common part 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688701)
Time Spent: 10h 40m  (was: 10.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688699=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688699
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:40
Start Date: 01/Dec/21 14:40
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r76024



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -304,7 +321,7 @@ private ValidReaderWriteIdList 
getValidCleanerWriteIdList(CompactionInfo ci, Tab
   }
 
   private static boolean isDynPartAbort(Table t, CompactionInfo ci) {
-return t.getPartitionKeys() != null && t.getPartitionKeys().size() > 0
+return Optional.ofNullable(t).map(Table::getPartitionKeys).filter(pk -> 
pk.size() > 0).isPresent()

Review comment:
   `safe-delete` use case - table object would be null here 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688699)
Time Spent: 10.5h  (was: 10h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688698=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688698
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:39
Start Date: 01/Dec/21 14:39
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760246130



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -279,10 +292,14 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   }
 }
   }
+  
+  private interface CleanerTask {
+T execute() throws Exception;
+  }
 
-  private ValidReaderWriteIdList getValidCleanerWriteIdList(CompactionInfo ci, 
Table t, ValidTxnList validTxnList)
+  private ValidReaderWriteIdList getValidCleanerWriteIdList(CompactionInfo ci, 
ValidTxnList validTxnList)

Review comment:
   it's needed for `safe_delete` use-case as we won't be able to resolve 
the table object - metadata would be already removed, however, we could get 
full tableName from the ci object.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688698)
Time Spent: 10h 20m  (was: 10h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688697=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688697
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:38
Start Date: 01/Dec/21 14:38
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760246130



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -279,10 +292,14 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   }
 }
   }
+  
+  private interface CleanerTask {
+T execute() throws Exception;
+  }
 
-  private ValidReaderWriteIdList getValidCleanerWriteIdList(CompactionInfo ci, 
Table t, ValidTxnList validTxnList)
+  private ValidReaderWriteIdList getValidCleanerWriteIdList(CompactionInfo ci, 
ValidTxnList validTxnList)

Review comment:
   it's needed for `safe_delete` use case as we won't be able to resolve 
table object - metadata would be already removed, we should get full tableName 
from the ci object.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688697)
Time Spent: 10h 10m  (was: 10h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688693=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688693
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:34
Start Date: 01/Dec/21 14:34
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760241861



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -279,10 +292,14 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   }
 }
   }
+  
+  private interface CleanerTask {

Review comment:
   not really, good catch!




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688693)
Time Spent: 10h  (was: 9h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688691=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688691
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:31
Start Date: 01/Dec/21 14:31
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760238917



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -172,22 +174,29 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   if (metricsEnabled) {
 perfLogger.perfLogBegin(CLASS_NAME, cleanerMetric);
   }
-  Table t = resolveTable(ci);
-  if (t == null) {
-// The table was dropped before we got around to cleaning it.
-LOG.info("Unable to find table " + ci.getFullTableName() + ", assuming 
it was dropped." +
+  String location = 
Optional.ofNullable(ci.properties).map(StringableMap::new)

Review comment:
   we only support this feature for acid tables with a non-customized 
location:
   
   if (!TableType.VIRTUAL_VIEW.toString().equals(tbl.getTableType())) {
   if (tbl.getSd().getLocation() == null || 
tbl.getSd().getLocation().isEmpty()) {
   tblPath = wh.getDefaultTablePath(db, 
getPathWithSuffix(tbl.getTableName(), tbl), isExternal(tbl));
   }
   
   String getPathWithSuffix(String path, Table tbl) {
   return path + (tbl.isSetTxnId() ?
   VISIBILITY_PREFIX + String.format(DELTA_DIGITS, tbl.getTxnId()) : 
"");
   }
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688691)
Time Spent: 9h 50m  (was: 9h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688688=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688688
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:26
Start Date: 01/Dec/21 14:26
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760233519



##
File path: ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
##
@@ -1334,6 +1344,20 @@ public void dropTable(String tableName, boolean ifPurge) 
throws HiveException {
 dropTable(names[0], names[1], true, true, ifPurge);
   }
 
+  public void dropTable(Table table, boolean ifPurge) throws HiveException {
+boolean tableWithSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+  && AcidUtils.isTransactionalTable(table)
+  && table.getSd().getLocation().matches(PATH_WITH_VERSION_ID);
+
+if (tableWithSuffix) {
+  long txnId = Optional.ofNullable(SessionState.get())
+.map(ss -> ss.getTxnMgr().getCurrentTxnId()).orElse(0L);
+  table.getTTable().setTxnId(txnId);
+}

Review comment:
   we could have old tables that were created without this feature being 
enabled, and we still should support regular drops for them.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688688)
Time Spent: 9h 40m  (was: 9.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688684=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688684
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:21
Start Date: 01/Dec/21 14:21
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760229198



##
File path: ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
##
@@ -1273,14 +1276,21 @@ public void createTable(Table tbl, boolean ifNotExists,
   principalPrivs.setRolePrivileges(grants.getRoleGrants());
   tTbl.setPrivileges(principalPrivs);
 }
+boolean createTableUseSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+  && AcidUtils.isTransactionalTable(tbl);
+
+if (createTableUseSuffix) {
+  tTbl.setTxnId(ss.getTxnMgr().getCurrentTxnId());

Review comment:
   I am relying on txnId a flag that shows if the non-blocking drop is 
supported. if not set, we won't create a versioned table.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688684)
Time Spent: 9.5h  (was: 9h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688673=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688673
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:09
Start Date: 01/Dec/21 14:09
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760218118



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/ddl/table/drop/DropTableAnalyzer.java
##
@@ -51,7 +55,14 @@ public void analyzeInternal(ASTNode root) throws 
SemanticException {
 Table table = getTable(tableName, throwException);
 if (table != null) {
   inputs.add(new ReadEntity(table));
-  outputs.add(new WriteEntity(table, WriteEntity.WriteType.DDL_EXCLUSIVE));
+
+  boolean tableWithSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+  || HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+&& AcidUtils.isTransactionalTable(table)
+&& table.getSd().getLocation().matches(PATH_WITH_VERSION_ID);
+
+  outputs.add(new WriteEntity(table, 

Review comment:
   it's already logged by DbTxnManager when calling 
AcidUtils.makeLockComponents




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688673)
Time Spent: 9h 20m  (was: 9h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688670=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688670
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:06
Start Date: 01/Dec/21 14:06
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760214887



##
File path: ql/src/java/org/apache/hadoop/hive/ql/io/AcidUtils.java
##
@@ -3109,10 +3109,16 @@ public static TxnType getTxnType(Configuration conf, 
ASTNode tree) {
 if (tree.getFirstChildWithType(HiveParser.TOK_ALTERTABLE_COMPACT) != null){
   return TxnType.COMPACTION;
 }
+// check if soft delete
+if (tree.getToken().getType() == HiveParser.TOK_DROPTABLE 
+  && (HiveConf.getBoolVar(conf, ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))){
+  return TxnType.SOFT_DELETE;

Review comment:
   I don't think so as we still should be able to drop tables created 
without that option (not versioned)




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688670)
Time Spent: 9h 10m  (was: 9h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688669=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688669
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 14:04
Start Date: 01/Dec/21 14:04
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760213666



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/ddl/table/drop/DropTableOperation.java
##
@@ -109,7 +109,7 @@ public int execute() throws HiveException {
 }
 
 // TODO: API w/catalog name
-context.getDb().dropTable(desc.getTableName(), desc.isPurge());
+context.getDb().dropTable(table, desc.isPurge());

Review comment:
   I need to pass extra info like txnId through the tbl object




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688669)
Time Spent: 9h  (was: 8h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 9h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-21075) Metastore: Drop partition performance downgrade with Postgres DB

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21075?focusedWorklogId=688661=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688661
 ]

ASF GitHub Bot logged work on HIVE-21075:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 13:51
Start Date: 01/Dec/21 13:51
Worklog Time Spent: 10m 
  Work Description: yongzhi merged pull request #2826:
URL: https://github.com/apache/hive/pull/2826


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688661)
Time Spent: 7h 50m  (was: 7h 40m)

> Metastore: Drop partition performance downgrade with Postgres DB
> 
>
> Key: HIVE-21075
> URL: https://issues.apache.org/jira/browse/HIVE-21075
> Project: Hive
>  Issue Type: Bug
>  Components: Metastore
>Affects Versions: 3.0.0
>Reporter: Yongzhi Chen
>Assignee: Oleksiy Sayankin
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-21075.2.patch
>
>  Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> In order to workaround oracle not supporting limit statement caused 
> performance issue, HIVE-9447 makes all the backend DB run select count(1) 
> from SDS where SDS.CD_ID=? to check if the specific CD_ID is referenced in 
> SDS table before drop a partition. This select count(1) statement does not 
> scale well in Postgres, and there is no index for CD_ID column in SDS table.
> For a SDS table with with 1.5 million rows, select count(1) has average 700ms 
> without index, while in 10-20ms with index. But the statement before 
> HIVE-9447( SELECT * FROM "SDS" "A0" WHERE "A0"."CD_ID" = $1 limit 1) uses 
> less than 10ms .



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688643=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688643
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 13:13
Start Date: 01/Dec/21 13:13
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760170333



##
File path: 
standalone-metastore/metastore-common/src/gen/thrift/gen-cpp/hive_metastore_types.cpp
##
@@ -615,16 +615,18 @@ int _kTxnTypeValues[] = {
   TxnType::REPL_CREATED,
   TxnType::READ_ONLY,
   TxnType::COMPACTION,
-  TxnType::MATER_VIEW_REBUILD
+  TxnType::MATER_VIEW_REBUILD,
+  TxnType::SOFT_DELETE

Review comment:
   however, old HMS won't have support for lockless drop as well, so we 
should be good




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688643)
Time Spent: 8h 50m  (was: 8h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688642=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688642
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 13:13
Start Date: 01/Dec/21 13:13
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760170333



##
File path: 
standalone-metastore/metastore-common/src/gen/thrift/gen-cpp/hive_metastore_types.cpp
##
@@ -615,16 +615,18 @@ int _kTxnTypeValues[] = {
   TxnType::REPL_CREATED,
   TxnType::READ_ONLY,
   TxnType::COMPACTION,
-  TxnType::MATER_VIEW_REBUILD
+  TxnType::MATER_VIEW_REBUILD,
+  TxnType::SOFT_DELETE

Review comment:
   however, old HMS won't have support for lockless read, so we should be 
good




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688642)
Time Spent: 8h 40m  (was: 8.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688640=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688640
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 13:11
Start Date: 01/Dec/21 13:11
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760168937



##
File path: 
standalone-metastore/metastore-common/src/gen/thrift/gen-cpp/hive_metastore_types.cpp
##
@@ -615,16 +615,18 @@ int _kTxnTypeValues[] = {
   TxnType::REPL_CREATED,
   TxnType::READ_ONLY,
   TxnType::COMPACTION,
-  TxnType::MATER_VIEW_REBUILD
+  TxnType::MATER_VIEW_REBUILD,
+  TxnType::SOFT_DELETE

Review comment:
   Old versions of HMS would treat new values as null (default - null), in 
this case, we won't ever enter the if branch for SOFT_DELETE, meaning, we won't 
populate the highest write id required by cleaner and might clean up some 
deltas prematurely, see HIVE-24291. 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688640)
Time Spent: 8.5h  (was: 8h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688633=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688633
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:55
Start Date: 01/Dec/21 12:55
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760156827



##
File path: 
itests/hive-unit/src/test/java/org/apache/hadoop/hive/ql/TestAcidOnTez.java
##
@@ -18,6 +18,7 @@
 
 package org.apache.hadoop.hive.ql;
 
+import static org.apache.hadoop.hive.ql.TxnCommandsBaseForTests.*;

Review comment:
   import is required after the tests refactor, previous visibility was 
package-private.
   ps: moved tests refactor under https://github.com/apache/hive/pull/2831




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688633)
Time Spent: 8h 10m  (was: 8h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688634=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688634
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:55
Start Date: 01/Dec/21 12:55
Worklog Time Spent: 10m 
  Work Description: deniskuzZ commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760157455



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TxnCommandsBaseForTests.java
##
@@ -202,8 +208,29 @@ public int compare(int[] row1, int[] row2) {
   return 0;
 }
   }
-  protected String makeValuesClause(int[][] rows) {
-return TestTxnCommands2.makeValuesClause(rows);
+  public static String makeValuesClause(int[][] rows) {

Review comment:
   moved test refactor under: https://github.com/apache/hive/pull/2831




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688634)
Time Spent: 8h 20m  (was: 8h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25754) Fix column projection for union all queries with multiple aliases

2021-12-01 Thread Marton Bod (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marton Bod updated HIVE-25754:
--
Description: 
Given two tables:
{code:java}
create table source1 (dt string, d1 int, d2 int) stored as orc;
create table source2 (dt string, d1 int, d2 int) stored as orc;
insert into source1 values ('20211107', 1, 2);
insert into source2 values ('20211108', 11, 22);{code}
If you run this query with UNION ALL, the {{key}} column will be missing from 
the output:
{code:java}
select explode(map('D219', D219
,'D220', D220)) as (key, value) from (
   {{select '20211107' as date_key
,1 as D219
,2 as D220
) t}}
union all
select explode(map('D221', D221
,'D222', D222)) as (key, value)
from (}}
  {{select '20211107' as date_key
,1 as D221
,2 as D222
) t{code}
Result:
{code:java}
1
2
11
22{code}
 Correct result should be:
{code:java}
D219 1
D220 2
D221 11
D222 22{code}

  was:
Given two tables:

 
{code:java}
create table source1 (dt string, d1 int, d2 int) stored as orc;
create table source2 (dt string, d1 int, d2 int) stored as orc;
insert into source1 values ('20211107', 1, 2);
insert into source2 values ('20211108', 11, 22);{code}

If you run this query with UNION ALL, the {{key}} column will be missing from 
the output:

 

 
{code:java}
select explode(map('D219', D219
,'D220', D220)) as (key, value) from (
   {{select '20211107' as date_key
,1 as D219
,2 as D220
) t}}
union all
select explode(map('D221', D221
,'D222', D222)) as (key, value)
from (}}
  {{select '20211107' as date_key
,1 as D221
,2 as D222
) t{code}

Result:

 
{code:java}
1
2
11
22{code}
 
Correct result should be:
{code:java}
D219 1
D220 2
D221 11
D222 22{code}


> Fix column projection for union all queries with multiple aliases
> -
>
> Key: HIVE-25754
> URL: https://issues.apache.org/jira/browse/HIVE-25754
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>
> Given two tables:
> {code:java}
> create table source1 (dt string, d1 int, d2 int) stored as orc;
> create table source2 (dt string, d1 int, d2 int) stored as orc;
> insert into source1 values ('20211107', 1, 2);
> insert into source2 values ('20211108', 11, 22);{code}
> If you run this query with UNION ALL, the {{key}} column will be missing from 
> the output:
> {code:java}
> select explode(map('D219', D219
> ,'D220', D220)) as (key, value) from (
>    {{select '20211107' as date_key
> ,1 as D219
> ,2 as D220
> ) t}}
> union all
> select explode(map('D221', D221
> ,'D222', D222)) as (key, value)
> from (}}
>   {{select '20211107' as date_key
> ,1 as D221
> ,2 as D222
> ) t{code}
> Result:
> {code:java}
> 1
> 2
> 11
> 22{code}
>  Correct result should be:
> {code:java}
> D219 1
> D220 2
> D221 11
> D222 22{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688625=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688625
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:37
Start Date: 01/Dec/21 12:37
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2831:
URL: https://github.com/apache/hive/pull/2831#discussion_r760143965



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TestTxnCommands2.java
##
@@ -767,7 +709,7 @@ public void testNonAcidToAcidConversion3() throws Exception 
{
 // 4. Update the existing row, and insert another row to newly-converted 
ACID table
 runStatementOnDriver("update " + Table.NONACIDORCTBL + " set b=3 where 
a=1");
 runStatementOnDriver("insert into " + Table.NONACIDORCTBL + "(a,b) 
values(3,4)");
-status = fs.listStatus(new Path(TEST_WAREHOUSE_DIR + "/" +
+status = fs.listStatus(new Path(getWarehouseDir() + "/" +

Review comment:
   Question: Would it worth to create a method for this?
   `listStatus(String tableName)`




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688625)
Time Spent: 1h  (was: 50m)

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688624=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688624
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:36
Start Date: 01/Dec/21 12:36
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2831:
URL: https://github.com/apache/hive/pull/2831#discussion_r760143118



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TestTxnCommands2.java
##
@@ -1172,7 +1114,7 @@ private int getFileSystemCacheSize() throws Exception {
   this.total = total;
 }
   }
-  private static CompactionsByState countCompacts(TxnStore txnHandler) throws 
MetaException {
+  private CompactionsByState countCompacts(TxnStore txnHandler) throws 
MetaException {

Review comment:
   nit: new line before




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688624)
Time Spent: 50m  (was: 40m)

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688623=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688623
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:35
Start Date: 01/Dec/21 12:35
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2831:
URL: https://github.com/apache/hive/pull/2831#discussion_r760142963



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TestTxnCommands2.java
##
@@ -1010,7 +952,7 @@ public void testInsertOverwriteWithSelfJoin() throws 
Exception {
 Assert.assertEquals("Insert overwrite partition failed", 
stringifyValues(updatedData), rs2);
 //insert overwrite not supported for ACID tables
   }
-  private static void checkCompactionState(CompactionsByState expected, 
CompactionsByState actual) {
+  private void checkCompactionState(CompactionsByState expected, 
CompactionsByState actual) {

Review comment:
   nit: new line before




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688623)
Time Spent: 40m  (was: 0.5h)

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688622=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688622
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:34
Start Date: 01/Dec/21 12:34
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2831:
URL: https://github.com/apache/hive/pull/2831#discussion_r760141833



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TestTxnCommands.java
##
@@ -1397,7 +1396,7 @@ public void testNonAcidToAcidConversion01() throws 
Exception {
 BucketCodec.V1.encode(new 
AcidOutputFormat.Options(hiveConf).bucket(1)));
 
 //run Compaction
-runStatementOnDriver("alter table "+ TestTxnCommands2.Table.NONACIDORCTBL 
+" compact 'major'");
+runStatementOnDriver("alter table "+ Table.NONACIDORCTBL +" compact 
'major'");

Review comment:
   nit: Maybe OCD but put spaces around the `+` signs please




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688622)
Time Spent: 0.5h  (was: 20m)

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688621=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688621
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:33
Start Date: 01/Dec/21 12:33
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2831:
URL: https://github.com/apache/hive/pull/2831#discussion_r760141497



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TestTxnCommands.java
##
@@ -1352,7 +1351,7 @@ public void run() {
   }
 }, 5000);
 long start = System.currentTimeMillis();
-runStatementOnDriver("alter table "+ TestTxnCommands2.Table.ACIDTBL +" 
compact 'major' AND WAIT");
+runStatementOnDriver("alter table "+ Table.ACIDTBL +" compact 'major' AND 
WAIT");

Review comment:
   nit: Maybe OCD but put spaces around the `+` signs please




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688621)
Time Spent: 20m  (was: 10m)

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?focusedWorklogId=688618=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688618
 ]

ASF GitHub Bot logged work on HIVE-25755:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 12:32
Start Date: 01/Dec/21 12:32
Worklog Time Spent: 10m 
  Work Description: deniskuzZ opened a new pull request #2831:
URL: https://github.com/apache/hive/pull/2831


   
   
   ### What changes were proposed in this pull request?
   
   
   
   ### Why are the changes needed?
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   
   
   ### How was this patch tested?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688618)
Remaining Estimate: 0h
Time Spent: 10m

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-25755:
--
Labels: pull-request-available  (was: )

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25755) TestTxnCommands classes refactor

2021-12-01 Thread Denys Kuzmenko (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Denys Kuzmenko reassigned HIVE-25755:
-

Assignee: Denys Kuzmenko

> TestTxnCommands classes refactor
> 
>
> Key: HIVE-25755
> URL: https://issues.apache.org/jira/browse/HIVE-25755
> Project: Hive
>  Issue Type: Test
>Reporter: Denys Kuzmenko
>Assignee: Denys Kuzmenko
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25754) Fix column projection for union all queries with multiple aliases

2021-12-01 Thread Marton Bod (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17451733#comment-17451733
 ] 

Marton Bod commented on HIVE-25754:
---

PR: [https://github.com/apache/hive/pull/2822]

 

> Fix column projection for union all queries with multiple aliases
> -
>
> Key: HIVE-25754
> URL: https://issues.apache.org/jira/browse/HIVE-25754
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>
> Given two tables:
>  
> {code:java}
> create table source1 (dt string, d1 int, d2 int) stored as orc;
> create table source2 (dt string, d1 int, d2 int) stored as orc;
> insert into source1 values ('20211107', 1, 2);
> insert into source2 values ('20211108', 11, 22);{code}
> If you run this query with UNION ALL, the {{key}} column will be missing from 
> the output:
>  
>  
> {code:java}
> select explode(map('D219', D219
> ,'D220', D220)) as (key, value) from (
>    {{select '20211107' as date_key
> ,1 as D219
> ,2 as D220
> ) t}}
> union all
> select explode(map('D221', D221
> ,'D222', D222)) as (key, value)
> from (}}
>   {{select '20211107' as date_key
> ,1 as D221
> ,2 as D222
> ) t{code}
> Result:
>  
> {code:java}
> 1
> 2
> 11
> 22{code}
>  
> Correct result should be:
> {code:java}
> D219 1
> D220 2
> D221 11
> D222 22{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25754) Fix column projection for union all queries with multiple aliases

2021-12-01 Thread Marton Bod (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marton Bod updated HIVE-25754:
--
Description: 
Given two tables:

 
{code:java}
create table source1 (dt string, d1 int, d2 int) stored as orc;
create table source2 (dt string, d1 int, d2 int) stored as orc;
insert into source1 values ('20211107', 1, 2);
insert into source2 values ('20211108', 11, 22);{code}

If you run this query with UNION ALL, the {{key}} column will be missing from 
the output:

 

 
{code:java}
select explode(map('D219', D219
,'D220', D220)) as (key, value) from (
   {{select '20211107' as date_key
,1 as D219
,2 as D220
) t}}
union all
select explode(map('D221', D221
,'D222', D222)) as (key, value)
from (}}
  {{select '20211107' as date_key
,1 as D221
,2 as D222
) t{code}

Result:

 
{code:java}
1
2
11
22{code}
 
Correct result should be:
{code:java}
D219 1
D220 2
D221 11
D222 22{code}

  was:
Given two tables:

{{create table source1 (dt string, d1 int, d2 int) stored as orc;
create table source2 (dt string, d1 int, d2 int) stored as orc;
insert into source1 values ('20211107', 1, 2);
insert into source2 values ('20211108', 11, 22);}}
If you run this query with UNION ALL, the {{key}} column will be missing from 
the output:

{{select explode(map('D219', D219
,'D220', D220)) as (key, value) from (}}
   {{select '20211107' as date_key
,1 as D219
,2 as D220
) t}}
{{union all}}
{{select explode(map('D221', D221
,'D222', D222)) as (key, value)
from (}}
  {{select '20211107' as date_key
,1 as D221
,2 as D222
) t}}
Result:

{{1}}
{{2}}
{{11}}
{{22}}
 
Correct result should be:

{{D219  1}}
{{D220  2}}
{{D221  11}}
{{D222  22}}


> Fix column projection for union all queries with multiple aliases
> -
>
> Key: HIVE-25754
> URL: https://issues.apache.org/jira/browse/HIVE-25754
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>
> Given two tables:
>  
> {code:java}
> create table source1 (dt string, d1 int, d2 int) stored as orc;
> create table source2 (dt string, d1 int, d2 int) stored as orc;
> insert into source1 values ('20211107', 1, 2);
> insert into source2 values ('20211108', 11, 22);{code}
> If you run this query with UNION ALL, the {{key}} column will be missing from 
> the output:
>  
>  
> {code:java}
> select explode(map('D219', D219
> ,'D220', D220)) as (key, value) from (
>    {{select '20211107' as date_key
> ,1 as D219
> ,2 as D220
> ) t}}
> union all
> select explode(map('D221', D221
> ,'D222', D222)) as (key, value)
> from (}}
>   {{select '20211107' as date_key
> ,1 as D221
> ,2 as D222
> ) t{code}
> Result:
>  
> {code:java}
> 1
> 2
> 11
> 22{code}
>  
> Correct result should be:
> {code:java}
> D219 1
> D220 2
> D221 11
> D222 22{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25754) Fix column projection for union all queries with multiple aliases

2021-12-01 Thread Marton Bod (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marton Bod reassigned HIVE-25754:
-


> Fix column projection for union all queries with multiple aliases
> -
>
> Key: HIVE-25754
> URL: https://issues.apache.org/jira/browse/HIVE-25754
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>
> Given two tables:
> {{create table source1 (dt string, d1 int, d2 int) stored as orc;
> create table source2 (dt string, d1 int, d2 int) stored as orc;
> insert into source1 values ('20211107', 1, 2);
> insert into source2 values ('20211108', 11, 22);}}
> If you run this query with UNION ALL, the {{key}} column will be missing from 
> the output:
> {{select explode(map('D219', D219
> ,'D220', D220)) as (key, value) from (}}
>    {{select '20211107' as date_key
> ,1 as D219
> ,2 as D220
> ) t}}
> {{union all}}
> {{select explode(map('D221', D221
> ,'D222', D222)) as (key, value)
> from (}}
>   {{select '20211107' as date_key
> ,1 as D221
> ,2 as D222
> ) t}}
> Result:
> {{1}}
> {{2}}
> {{11}}
> {{22}}
>  
> Correct result should be:
> {{D2191}}
> {{D2202}}
> {{D22111}}
> {{D22222}}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25740) Handle race condition between compaction txn abort/commit and heartbeater

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25740?focusedWorklogId=688576=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688576
 ]

ASF GitHub Bot logged work on HIVE-25740:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:45
Start Date: 01/Dec/21 10:45
Worklog Time Spent: 10m 
  Work Description: marton-bod commented on a change in pull request #2817:
URL: https://github.com/apache/hive/pull/2817#discussion_r760063403



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Worker.java
##
@@ -233,39 +234,36 @@ void gatherStats() {
 private final CompactionTxn compactionTxn;
 private final String tableName;
 private final HiveConf conf;
-private final long txnTimeout;
+private final AtomicBoolean shouldLogError;
 
 public CompactionHeartbeater(CompactionTxn compactionTxn, String 
tableName, HiveConf conf) {
   this.tableName = Objects.requireNonNull(tableName);
   this.compactionTxn = Objects.requireNonNull(compactionTxn);
   this.conf = Objects.requireNonNull(conf);
-
-  this.txnTimeout = MetastoreConf.getTimeVar(conf, 
MetastoreConf.ConfVars.TXN_TIMEOUT, TimeUnit.MILLISECONDS);
+  this.shouldLogError = new AtomicBoolean(true);
 
   setDaemon(true);
   setPriority(MIN_PRIORITY);
   setName("CompactionHeartbeater-" + compactionTxn.getTxnId());
 }
 
+public void shouldLogError(boolean shouldLogError) {
+  this.shouldLogError.set(shouldLogError);
+}
+
 @Override
 public void run() {
   LOG.debug("Heartbeating compaction transaction id {} for table: {}", 
compactionTxn, tableName);
-
   IMetaStoreClient msc = null;
   try {
 // Create a metastore client for each thread since it is not thread 
safe
 msc = HiveMetaStoreUtils.getHiveMetastoreClient(conf);

Review comment:
   That makes sense, and I was debating that internally as well. However, 
it would complicate the cleanup of the thread quite a bit. Since the thread 
runs infrequently (every 2.5 minutes or so) I think the cost of client creation 
should be pretty low, and not worth the extra complexity of storing it in a 
field and worry about clean up in the various scenarios. Let me know if that 
makes sense.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688576)
Time Spent: 2h  (was: 1h 50m)

> Handle race condition between compaction txn abort/commit and heartbeater
> -
>
> Key: HIVE-25740
> URL: https://issues.apache.org/jira/browse/HIVE-25740
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> This issue is the following: once the compaction worker finishes, 
> commitTxn/abortTxn is invoked first, and the heartbeater thread is only 
> interrupted after that. This can lead to race conditions where the txn has 
> already been deleted from the backend DB via commit/abort, but the 
> concurrently running heartbeater thread still attempts to send a last 
> heartbeat after that, but the txn id won't be found in the DB, leading to 
> {{{}NoSuchTxnException{}}}.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25740) Handle race condition between compaction txn abort/commit and heartbeater

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25740?focusedWorklogId=688575=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688575
 ]

ASF GitHub Bot logged work on HIVE-25740:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:43
Start Date: 01/Dec/21 10:43
Worklog Time Spent: 10m 
  Work Description: marton-bod commented on a change in pull request #2817:
URL: https://github.com/apache/hive/pull/2817#discussion_r760061381



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Worker.java
##
@@ -751,11 +748,28 @@ void wasSuccessful() {
   if (status == TxnStatus.UNKNOWN) {
 return;

Review comment:
   Good idea. I've refactored the `close()` method accordingly




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688575)
Time Spent: 1h 50m  (was: 1h 40m)

> Handle race condition between compaction txn abort/commit and heartbeater
> -
>
> Key: HIVE-25740
> URL: https://issues.apache.org/jira/browse/HIVE-25740
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> This issue is the following: once the compaction worker finishes, 
> commitTxn/abortTxn is invoked first, and the heartbeater thread is only 
> interrupted after that. This can lead to race conditions where the txn has 
> already been deleted from the backend DB via commit/abort, but the 
> concurrently running heartbeater thread still attempts to send a last 
> heartbeat after that, but the txn id won't be found in the DB, leading to 
> {{{}NoSuchTxnException{}}}.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25740) Handle race condition between compaction txn abort/commit and heartbeater

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25740?focusedWorklogId=688574=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688574
 ]

ASF GitHub Bot logged work on HIVE-25740:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:42
Start Date: 01/Dec/21 10:42
Worklog Time Spent: 10m 
  Work Description: marton-bod commented on a change in pull request #2817:
URL: https://github.com/apache/hive/pull/2817#discussion_r760061124



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Worker.java
##
@@ -233,39 +234,36 @@ void gatherStats() {
 private final CompactionTxn compactionTxn;
 private final String tableName;
 private final HiveConf conf;
-private final long txnTimeout;
+private final AtomicBoolean shouldLogError;
 
 public CompactionHeartbeater(CompactionTxn compactionTxn, String 
tableName, HiveConf conf) {
   this.tableName = Objects.requireNonNull(tableName);
   this.compactionTxn = Objects.requireNonNull(compactionTxn);
   this.conf = Objects.requireNonNull(conf);
-
-  this.txnTimeout = MetastoreConf.getTimeVar(conf, 
MetastoreConf.ConfVars.TXN_TIMEOUT, TimeUnit.MILLISECONDS);
+  this.shouldLogError = new AtomicBoolean(true);
 
   setDaemon(true);
   setPriority(MIN_PRIORITY);
   setName("CompactionHeartbeater-" + compactionTxn.getTxnId());
 }
 
+public void shouldLogError(boolean shouldLogError) {
+  this.shouldLogError.set(shouldLogError);
+}
+
 @Override
 public void run() {
   LOG.debug("Heartbeating compaction transaction id {} for table: {}", 
compactionTxn, tableName);

Review comment:
   Yep, makes sense. I remove one of the logs.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688574)
Time Spent: 1h 40m  (was: 1.5h)

> Handle race condition between compaction txn abort/commit and heartbeater
> -
>
> Key: HIVE-25740
> URL: https://issues.apache.org/jira/browse/HIVE-25740
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> This issue is the following: once the compaction worker finishes, 
> commitTxn/abortTxn is invoked first, and the heartbeater thread is only 
> interrupted after that. This can lead to race conditions where the txn has 
> already been deleted from the backend DB via commit/abort, but the 
> concurrently running heartbeater thread still attempts to send a last 
> heartbeat after that, but the txn id won't be found in the DB, leading to 
> {{{}NoSuchTxnException{}}}.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688566=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688566
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:28
Start Date: 01/Dec/21 10:28
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760050115



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/txn/CompactionTxnHandler.java
##
@@ -336,9 +336,9 @@ public void markCompacted(CompactionInfo info) throws 
MetaException {
  * By filtering on minOpenTxnWaterMark, we will only cleanup after 
every transaction is committed, that could see
  * the uncompacted deltas. This way the cleaner can clean up 
everything that was made obsolete by this compaction.
  */
-String s = "SELECT \"CQ_ID\", \"CQ_DATABASE\", \"CQ_TABLE\", 
\"CQ_PARTITION\", "
-+ "\"CQ_TYPE\", \"CQ_RUN_AS\", \"CQ_HIGHEST_WRITE_ID\" FROM 
\"COMPACTION_QUEUE\" WHERE \"CQ_STATE\" = '"
-+ READY_FOR_CLEANING + "'";
+String s = "SELECT \"CQ_ID\", \"CQ_DATABASE\", \"CQ_TABLE\", 
\"CQ_PARTITION\", \"CQ_TYPE\", " 
++ "\"CQ_RUN_AS\", \"CQ_HIGHEST_WRITE_ID\", \"CQ_TBLPROPERTIES\" 
FROM \"COMPACTION_QUEUE\" WHERE \"CQ_STATE\" = '"

Review comment:
   Is this an already existing field?
   Do we show this in the sysdb tables?
   What happens with this compaction if an old VW picks this up?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688566)
Time Spent: 8h  (was: 7h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 8h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688564=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688564
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:26
Start Date: 01/Dec/21 10:26
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760047913



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/txn/TxnHandler.java
##
@@ -1451,11 +1453,11 @@ public void commitTxn(CommitTxnRequest rqst) throws 
NoSuchTxnException, TxnAbort
 OperationType.UPDATE + "," + OperationType.DELETE + ")";
 long tempCommitId = generateTemporaryId();
 
-if (txnType.get() == TxnType.COMPACTION) {
+if (txnType == TxnType.SOFT_DELETE || txnType == TxnType.COMPACTION) {
   acquireTxnLock(stmt, false);
   commitId = getHighWaterMark(stmt);
 
-} else if (txnType.get() != TxnType.READ_ONLY && !isReplayedReplTxn) {
+} else if (txnType != TxnType.READ_ONLY && !isReplayedReplTxn) {

Review comment:
   What will happen wrt replication of these soft delete-s? Do they know 
about the changes we propose?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688564)
Time Spent: 7h 50m  (was: 7h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688561=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688561
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:23
Start Date: 01/Dec/21 10:23
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760045866



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/AcidEventListener.java
##
@@ -60,9 +65,30 @@ public void onDropDatabase (DropDatabaseEvent dbEvent) 
throws MetaException {
 
   @Override
   public void onDropTable(DropTableEvent tableEvent)  throws MetaException {
-if (TxnUtils.isTransactionalTable(tableEvent.getTable())) {
+Table table = tableEvent.getTable();
+
+if (TxnUtils.isTransactionalTable(table)) {
   txnHandler = getTxnHandler();
-  txnHandler.cleanupRecords(HiveObjectType.TABLE, null, 
tableEvent.getTable(), null);
+  txnHandler.cleanupRecords(HiveObjectType.TABLE, null, table, null, 
!tableEvent.getDeleteData());
+  
+  if (!tableEvent.getDeleteData()) {
+long currentTxn = 
Optional.ofNullable(tableEvent.getEnvironmentContext())
+  .map(EnvironmentContext::getProperties)
+  .map(prop -> prop.get("txnId"))
+  .map(Long::parseLong)
+  .orElse(0L);
+
+if (currentTxn > 0) {
+  CompactionRequest rqst = new CompactionRequest(table.getDbName(), 
table.getTableName(), CompactionType.MAJOR);
+  rqst.putToProperties("location", table.getSd().getLocation());

Review comment:
   Why properties, and not optional field?
   




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688561)
Time Spent: 7h 40m  (was: 7.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688560=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688560
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:21
Start Date: 01/Dec/21 10:21
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760043808



##
File path: 
standalone-metastore/metastore-common/src/main/java/org/apache/hadoop/hive/metastore/HiveMetaStoreClient.java
##
@@ -1772,6 +1773,22 @@ public void dropTable(String dbname, String name, 
boolean deleteData,
 dropTable(getDefaultCatalog(conf), dbname, name, deleteData, 
ignoreUnknownTab, ifPurge);
   }
 
+  @Override
+  public void dropTable(Table tbl, boolean deleteData, boolean 
ignoreUnknownTbl, boolean ifPurge) throws TException {
+EnvironmentContext context = null;
+if (ifPurge) {

Review comment:
   How will we handle purge with soft delete?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688560)
Time Spent: 7.5h  (was: 7h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688554=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688554
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:13
Start Date: 01/Dec/21 10:13
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760037581



##
File path: ql/src/test/org/apache/hadoop/hive/ql/TxnCommandsBaseForTests.java
##
@@ -202,8 +208,29 @@ public int compare(int[] row1, int[] row2) {
   return 0;
 }
   }
-  protected String makeValuesClause(int[][] rows) {
-return TestTxnCommands2.makeValuesClause(rows);
+  public static String makeValuesClause(int[][] rows) {

Review comment:
   nit: empty line




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688554)
Time Spent: 7h 10m  (was: 7h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688558=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688558
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:20
Start Date: 01/Dec/21 10:20
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760042920



##
File path: 
standalone-metastore/metastore-common/src/gen/thrift/gen-cpp/hive_metastore_types.cpp
##
@@ -615,16 +615,18 @@ int _kTxnTypeValues[] = {
   TxnType::REPL_CREATED,
   TxnType::READ_ONLY,
   TxnType::COMPACTION,
-  TxnType::MATER_VIEW_REBUILD
+  TxnType::MATER_VIEW_REBUILD,
+  TxnType::SOFT_DELETE

Review comment:
   Will this cause any compatibility issues?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688558)
Time Spent: 7h 20m  (was: 7h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-14261) Support set/unset partition parameters

2021-12-01 Thread xiepengjie (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-14261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17451692#comment-17451692
 ] 

xiepengjie commented on HIVE-14261:
---

Yes, you'r right. This pr is just about enriching the syntax of HS2. For other 
example, we want to have a life cycle management system about the partition, so 
we need to set different parameters for different partition.

> Support set/unset partition parameters
> --
>
> Key: HIVE-14261
> URL: https://issues.apache.org/jira/browse/HIVE-14261
> Project: Hive
>  Issue Type: New Feature
>Reporter: Pengcheng Xiong
>Assignee: Pengcheng Xiong
>Priority: Major
> Attachments: HIVE-14261.01.patch
>
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688552=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688552
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:08
Start Date: 01/Dec/21 10:08
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760033191



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -279,10 +292,14 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   }
 }
   }
+  
+  private interface CleanerTask {

Review comment:
   Why do we need this specific interface instead of `Callable`?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688552)
Time Spent: 7h  (was: 6h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 7h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688549=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688549
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:05
Start Date: 01/Dec/21 10:05
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760030923



##
File path: ql/src/java/org/apache/hadoop/hive/ql/txn/compactor/Cleaner.java
##
@@ -172,22 +174,29 @@ private void clean(CompactionInfo ci, long minOpenTxnGLB, 
boolean metricsEnabled
   if (metricsEnabled) {
 perfLogger.perfLogBegin(CLASS_NAME, cleanerMetric);
   }
-  Table t = resolveTable(ci);
-  if (t == null) {
-// The table was dropped before we got around to cleaning it.
-LOG.info("Unable to find table " + ci.getFullTableName() + ", assuming 
it was dropped." +
+  String location = 
Optional.ofNullable(ci.properties).map(StringableMap::new)

Review comment:
   How this behaves with non-standard locations, and transformer generated 
locations.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688549)
Time Spent: 6h 50m  (was: 6h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688546=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688546
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 10:01
Start Date: 01/Dec/21 10:01
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760027429



##
File path: ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
##
@@ -1334,6 +1344,20 @@ public void dropTable(String tableName, boolean ifPurge) 
throws HiveException {
 dropTable(names[0], names[1], true, true, ifPurge);
   }
 
+  public void dropTable(Table table, boolean ifPurge) throws HiveException {
+boolean tableWithSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+  && AcidUtils.isTransactionalTable(table)
+  && table.getSd().getLocation().matches(PATH_WITH_VERSION_ID);
+
+if (tableWithSuffix) {
+  long txnId = Optional.ofNullable(SessionState.get())
+.map(ss -> ss.getTxnMgr().getCurrentTxnId()).orElse(0L);
+  table.getTTable().setTxnId(txnId);
+}

Review comment:
   if the location is not PATH_WITH_VERSION_ID, we might have a situation 
were we are trying to do a softdelete, but could not do it. Do we need to 
handle this?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688546)
Time Spent: 6h 40m  (was: 6.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-14261) Support set/unset partition parameters

2021-12-01 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-14261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17451682#comment-17451682
 ] 

Stamatis Zampetakis commented on HIVE-14261:


[~xiepengjie] Your mentioned previously that there are a lot of projects who 
use exclusively HMS thus use the JAVA snippet above. However, the change you 
propose here is about enriching the syntax of HS2 to be able to change the 
properties via HiveQL. How will this help other projects who use only HMS? Did 
I misunderstood something ?

> Support set/unset partition parameters
> --
>
> Key: HIVE-14261
> URL: https://issues.apache.org/jira/browse/HIVE-14261
> Project: Hive
>  Issue Type: New Feature
>Reporter: Pengcheng Xiong
>Assignee: Pengcheng Xiong
>Priority: Major
> Attachments: HIVE-14261.01.patch
>
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688545=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688545
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:59
Start Date: 01/Dec/21 09:59
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760026327



##
File path: ql/src/java/org/apache/hadoop/hive/ql/metadata/Hive.java
##
@@ -1273,14 +1276,21 @@ public void createTable(Table tbl, boolean ifNotExists,
   principalPrivs.setRolePrivileges(grants.getRoleGrants());
   tTbl.setPrivileges(principalPrivs);
 }
+boolean createTableUseSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+  && AcidUtils.isTransactionalTable(tbl);
+
+if (createTableUseSuffix) {
+  tTbl.setTxnId(ss.getTxnMgr().getCurrentTxnId());

Review comment:
   Shall we just push this transactionId for every transactional table?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688545)
Time Spent: 6.5h  (was: 6h 20m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6.5h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688544=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688544
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:58
Start Date: 01/Dec/21 09:58
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760025563



##
File path: ql/src/java/org/apache/hadoop/hive/ql/io/AcidUtils.java
##
@@ -3109,10 +3109,16 @@ public static TxnType getTxnType(Configuration conf, 
ASTNode tree) {
 if (tree.getFirstChildWithType(HiveParser.TOK_ALTERTABLE_COMPACT) != null){
   return TxnType.COMPACTION;
 }
+// check if soft delete
+if (tree.getToken().getType() == HiveParser.TOK_DROPTABLE 
+  && (HiveConf.getBoolVar(conf, ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+|| HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))){
+  return TxnType.SOFT_DELETE;

Review comment:
   Shall we throw an exception if we try to drop a table but the location 
is not PATH_WITH_VERSION_ID?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688544)
Time Spent: 6h 20m  (was: 6h 10m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6h 20m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688542=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688542
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:57
Start Date: 01/Dec/21 09:57
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760024012



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/ddl/table/drop/DropTableOperation.java
##
@@ -109,7 +109,7 @@ public int execute() throws HiveException {
 }
 
 // TODO: API w/catalog name
-context.getDb().dropTable(desc.getTableName(), desc.isPurge());
+context.getDb().dropTable(table, desc.isPurge());

Review comment:
   nit: Why is this change needed?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688542)
Time Spent: 6h 10m  (was: 6h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688541=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688541
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:56
Start Date: 01/Dec/21 09:56
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760023405



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/ddl/table/drop/DropTableAnalyzer.java
##
@@ -51,7 +55,14 @@ public void analyzeInternal(ASTNode root) throws 
SemanticException {
 Table table = getTable(tableName, throwException);
 if (table != null) {
   inputs.add(new ReadEntity(table));
-  outputs.add(new WriteEntity(table, WriteEntity.WriteType.DDL_EXCLUSIVE));
+
+  boolean tableWithSuffix = (HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_CREATE_TABLE_USE_SUFFIX)
+  || HiveConf.getBoolVar(conf, 
ConfVars.HIVE_ACID_LOCKLESS_READS_ENABLED))
+&& AcidUtils.isTransactionalTable(table)
+&& table.getSd().getLocation().matches(PATH_WITH_VERSION_ID);
+
+  outputs.add(new WriteEntity(table, 

Review comment:
   It would be good to see this in the log. Minimally with debug level




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688541)
Time Spent: 6h  (was: 5h 50m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 6h
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688540=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688540
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:54
Start Date: 01/Dec/21 09:54
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760021737



##
File path: 
itests/hive-unit/src/test/java/org/apache/hadoop/hive/ql/TestAcidOnTez.java
##
@@ -18,6 +18,7 @@
 
 package org.apache.hadoop.hive.ql;
 
+import static org.apache.hadoop.hive.ql.TxnCommandsBaseForTests.*;

Review comment:
   Is there any other change in this file than the import change? If not I 
would revert this file




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688540)
Time Spent: 5h 50m  (was: 5h 40m)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 5h 50m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24445) Non blocking DROP TABLE implementation

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24445?focusedWorklogId=688538=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688538
 ]

ASF GitHub Bot logged work on HIVE-24445:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:53
Start Date: 01/Dec/21 09:53
Worklog Time Spent: 10m 
  Work Description: pvary commented on a change in pull request #2772:
URL: https://github.com/apache/hive/pull/2772#discussion_r760021077



##
File path: 
itests/hive-unit/src/test/java/org/apache/hadoop/hive/ql/TestAcidOnTez.java
##
@@ -18,6 +18,7 @@
 
 package org.apache.hadoop.hive.ql;
 
+import static org.apache.hadoop.hive.ql.TxnCommandsBaseForTests.*;

Review comment:
   nit: do not use wildcard imports




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688538)
Time Spent: 5h 40m  (was: 5.5h)

> Non blocking DROP TABLE implementation
> --
>
> Key: HIVE-24445
> URL: https://issues.apache.org/jira/browse/HIVE-24445
> Project: Hive
>  Issue Type: New Feature
>  Components: Hive
>Reporter: Zoltan Chovan
>Assignee: Denys Kuzmenko
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 5h 40m
>  Remaining Estimate: 0h
>
> Implement a way to execute drop table operations in a way that doesn't have 
> to wait for currently running read operations to be finished.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25609) Preserve XAttrs in normal file copy case.

2021-12-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25609?focusedWorklogId=688514=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-688514
 ]

ASF GitHub Bot logged work on HIVE-25609:
-

Author: ASF GitHub Bot
Created on: 01/Dec/21 09:14
Start Date: 01/Dec/21 09:14
Worklog Time Spent: 10m 
  Work Description: pkumarsinha merged pull request #2793:
URL: https://github.com/apache/hive/pull/2793


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 688514)
Time Spent: 40m  (was: 0.5h)

> Preserve XAttrs in normal file copy case.
> -
>
> Key: HIVE-25609
> URL: https://issues.apache.org/jira/browse/HIVE-25609
> Project: Hive
>  Issue Type: Improvement
>Reporter: Haymant Mangla
>Assignee: Haymant Mangla
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)