[jira] [Updated] (HIVE-23603) transformDatabase() should work with changes from HIVE-22995

2020-06-09 Thread Naveen Gangam (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Naveen Gangam updated HIVE-23603:
-
Status: Patch Available  (was: Open)

> transformDatabase() should work with changes from HIVE-22995
> 
>
> Key: HIVE-23603
> URL: https://issues.apache.org/jira/browse/HIVE-23603
> Project: Hive
>  Issue Type: Sub-task
>  Components: Hive
>Reporter: Naveen Gangam
>Assignee: Naveen Gangam
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-23603.patch
>
>
> The translation layer alters the locationUri on Database based on the 
> capabilities of the client. Now that we have separate locations for managed 
> and external for database, the implementation should be adjusted to work with 
> both locations. locationUri could already be external location.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23603) transformDatabase() should work with changes from HIVE-22995

2020-06-09 Thread Naveen Gangam (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Naveen Gangam updated HIVE-23603:
-
Attachment: HIVE-23603.patch

> transformDatabase() should work with changes from HIVE-22995
> 
>
> Key: HIVE-23603
> URL: https://issues.apache.org/jira/browse/HIVE-23603
> Project: Hive
>  Issue Type: Sub-task
>  Components: Hive
>Reporter: Naveen Gangam
>Assignee: Naveen Gangam
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-23603.patch
>
>
> The translation layer alters the locationUri on Database based on the 
> capabilities of the client. Now that we have separate locations for managed 
> and external for database, the implementation should be adjusted to work with 
> both locations. locationUri could already be external location.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23659) Add Retry for Ranger Replication

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23659?focusedWorklogId=443568=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443568
 ]

ASF GitHub Bot logged work on HIVE-23659:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 05:35
Start Date: 10/Jun/20 05:35
Worklog Time Spent: 10m 
  Work Description: pkumarsinha commented on a change in pull request #1086:
URL: https://github.com/apache/hive/pull/1086#discussion_r437861515



##
File path: 
standalone-metastore/metastore-common/src/test/java/org/apache/hadoop/hive/metastore/utils/RetryTest.java
##
@@ -50,8 +56,49 @@ public Void execute() {
 };
 try {
   retriable.run();
+  Assert.fail();
 } catch (Exception e) {
   Assert.assertEquals(RuntimeException.class, e.getClass());
 }
   }
+
+  @Test
+  public void testRetryFailureWithDelay() {
+Retry retriable = new Retry(NullPointerException.class) {
+  @Override
+  public Void execute() {
+throw new RuntimeException();
+  }
+};
+try {
+  retriable.runWithDelay();
+  Assert.fail();
+} catch (Exception e) {
+  Assert.assertEquals(RuntimeException.class, e.getClass());

Review comment:
   Add one more assertion for the time taken being more than total time for 
all three attempts( i.e 180 sec)





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443568)
Time Spent: 20m  (was: 10m)

> Add Retry for Ranger Replication
> 
>
> Key: HIVE-23659
> URL: https://issues.apache.org/jira/browse/HIVE-23659
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23659.01.patch, HIVE-23659.02.patch, 
> HIVE-23659.03.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-13482) str_to_map function delimiters are regex

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-13482?focusedWorklogId=443557=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443557
 ]

ASF GitHub Bot logged work on HIVE-13482:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 04:21
Start Date: 10/Jun/20 04:21
Worklog Time Spent: 10m 
  Work Description: MichaelChirico commented on pull request #553:
URL: https://github.com/apache/hive/pull/553#issuecomment-641711222


   @kgyrtkirk thanks, I updated the string in that test, as well as updating to 
master.
   
   Is there a reason to be testing exact docstrings in the test? I wouldn't 
have expected a test to fail because of a few words changing in the docs.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443557)
Time Spent: 3.5h  (was: 3h 20m)

> str_to_map function delimiters are regex
> 
>
> Key: HIVE-13482
> URL: https://issues.apache.org/jira/browse/HIVE-13482
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 1.0.0
>Reporter: Janick Bernet
>Assignee: Catherine K Anderson
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-13482.patch, HIVE-13482.patch, HIVE.13482.01.patch
>
>  Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> The two delimiters passed to the 'str_to_map' function are both interpreted 
> as regular expressions, which means that using the pipe ('|') as a delimiter 
> will lead to very unexpected results.
> This behaviour is the same for the closely related 'split' function, however 
> that is clearly documented in the function description (as per 
> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF). 
> Either the documentation for 'str_to_map' should be updated to reflect that 
> the delimiters are both regular expressions, too, or the implementation 
> should be changed to not interpret them as regexes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23659) Add Retry for Ranger Replication

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-23659:
--
Labels: pull-request-available  (was: )

> Add Retry for Ranger Replication
> 
>
> Key: HIVE-23659
> URL: https://issues.apache.org/jira/browse/HIVE-23659
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23659.01.patch, HIVE-23659.02.patch, 
> HIVE-23659.03.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23659) Add Retry for Ranger Replication

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23659?focusedWorklogId=443542=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443542
 ]

ASF GitHub Bot logged work on HIVE-23659:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 02:57
Start Date: 10/Jun/20 02:57
Worklog Time Spent: 10m 
  Work Description: aasha opened a new pull request #1086:
URL: https://github.com/apache/hive/pull/1086


   ## NOTICE
   
   Please create an issue in ASF JIRA before opening a pull request,
   and you need to set the title of the pull request which starts with
   the corresponding JIRA issue number. (e.g. HIVE-X: Fix a typo in YYY)
   For more details, please see 
https://cwiki.apache.org/confluence/display/Hive/HowToContribute
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443542)
Remaining Estimate: 0h
Time Spent: 10m

> Add Retry for Ranger Replication
> 
>
> Key: HIVE-23659
> URL: https://issues.apache.org/jira/browse/HIVE-23659
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
> Attachments: HIVE-23659.01.patch, HIVE-23659.02.patch, 
> HIVE-23659.03.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23597) VectorizedOrcAcidRowBatchReader::ColumnizedDeleteEventRegistry reads delete delta directories multiple times

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23597?focusedWorklogId=443532=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443532
 ]

ASF GitHub Bot logged work on HIVE-23597:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 02:11
Start Date: 10/Jun/20 02:11
Worklog Time Spent: 10m 
  Work Description: rbalamohan commented on a change in pull request #1081:
URL: https://github.com/apache/hive/pull/1081#discussion_r437820538



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/io/orc/VectorizedOrcAcidRowBatchReader.java
##
@@ -1561,24 +1572,22 @@ public int compareTo(CompressedOwid other) {
   try {
 final Path[] deleteDeltaDirs = getDeleteDeltaDirsFromSplit(orcSplit);
 if (deleteDeltaDirs.length > 0) {
+  FileSystem fs = orcSplit.getPath().getFileSystem(conf);
+  AcidOutputFormat.Options orcSplitMinMaxWriteIds =
+  AcidUtils.parseBaseOrDeltaBucketFilename(orcSplit.getPath(), 
conf);
   int totalDeleteEventCount = 0;
   for (Path deleteDeltaDir : deleteDeltaDirs) {
-FileSystem fs = deleteDeltaDir.getFileSystem(conf);
+if (!isQualifiedDeleteDeltaForSplit(orcSplitMinMaxWriteIds, 
deleteDeltaDir)) {
+  continue;
+}
 Path[] deleteDeltaFiles = 
OrcRawRecordMerger.getDeltaFiles(deleteDeltaDir, bucket,
 new OrcRawRecordMerger.Options().isCompacting(false), null);
 for (Path deleteDeltaFile : deleteDeltaFiles) {
-  // NOTE: Calling last flush length below is more for 
future-proofing when we have
-  // streaming deletes. But currently we don't support streaming 
deletes, and this can
-  // be removed if this becomes a performance issue.
-  long length = OrcAcidUtils.getLastFlushLength(fs, 
deleteDeltaFile);
+  // NOTE: When streaming deletes are supported, consider using 
OrcAcidUtils.getLastFlushLength(fs, deleteDeltaFile)
   // NOTE: A check for existence of deleteDeltaFile is required 
because we may not have
   // deletes for the bucket being taken into consideration for 
this split processing.
-  if (length != -1 && fs.exists(deleteDeltaFile)) {
-/**
- * todo: we have OrcSplit.orcTail so we should be able to get 
stats from there
- */
-Reader deleteDeltaReader = 
OrcFile.createReader(deleteDeltaFile,
-OrcFile.readerOptions(conf).maxLength(length));
+  if (fs.exists(deleteDeltaFile)) {

Review comment:
   Yes, also runtime can be further reduced if DeleteReader values are in 
memory as well.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443532)
Time Spent: 40m  (was: 0.5h)

> VectorizedOrcAcidRowBatchReader::ColumnizedDeleteEventRegistry reads delete 
> delta directories multiple times
> 
>
> Key: HIVE-23597
> URL: https://issues.apache.org/jira/browse/HIVE-23597
> Project: Hive
>  Issue Type: Improvement
>Reporter: Rajesh Balamohan
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> [https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/orc/VectorizedOrcAcidRowBatchReader.java#L1562]
> {code:java}
> try {
> final Path[] deleteDeltaDirs = getDeleteDeltaDirsFromSplit(orcSplit);
> if (deleteDeltaDirs.length > 0) {
>   int totalDeleteEventCount = 0;
>   for (Path deleteDeltaDir : deleteDeltaDirs) {
> {code}
>  
> Consider a directory layout like the following. This was created by having 
> simple set of "insert --> update --> select" queries.
>  
> {noformat}
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/base_001
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/base_002
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_003_003_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_004_004_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_005_005_
> 

[jira] [Work logged] (HIVE-20291) Allow HiveStreamingConnection to receive a WriteId

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20291?focusedWorklogId=443491=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443491
 ]

ASF GitHub Bot logged work on HIVE-20291:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 00:24
Start Date: 10/Jun/20 00:24
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #406:
URL: https://github.com/apache/hive/pull/406#issuecomment-641648333


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443491)
Remaining Estimate: 0h
Time Spent: 10m

> Allow HiveStreamingConnection to receive a WriteId
> --
>
> Key: HIVE-20291
> URL: https://issues.apache.org/jira/browse/HIVE-20291
> Project: Hive
>  Issue Type: Improvement
>Reporter: Jaume M
>Assignee: Jaume M
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20291.1.patch, HIVE-20291.10.patch, 
> HIVE-20291.11.patch, HIVE-20291.2.patch, HIVE-20291.3.patch, 
> HIVE-20291.4.patch, HIVE-20291.5.patch, HIVE-20291.6.patch, 
> HIVE-20291.7.patch, HIVE-20291.8.patch, HIVE-20291.9.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> If the writeId is received externally it won't need to open connections to 
> the metastore. It won't be able to the commit in this case as well so it must 
> be done by the entity passing the writeId.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23162) Remove swapping logic to merge joins in AST converter

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23162?focusedWorklogId=443489=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443489
 ]

ASF GitHub Bot logged work on HIVE-23162:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 00:23
Start Date: 10/Jun/20 00:23
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #978:
URL: https://github.com/apache/hive/pull/978#issuecomment-641648148


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443489)
Time Spent: 20m  (was: 10m)

> Remove swapping logic to merge joins in AST converter
> -
>
> Key: HIVE-23162
> URL: https://issues.apache.org/jira/browse/HIVE-23162
> Project: Hive
>  Issue Type: Bug
>  Components: CBO
>Reporter: Jesus Camacho Rodriguez
>Assignee: Jesus Camacho Rodriguez
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23162.01.patch, HIVE-23162.02.patch, 
> HIVE-23162.03.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> In ASTConverter, there is some logic to invert join inputs so the logic to 
> merge joins in SemanticAnalyzer kicks in.
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/translator/ASTConverter.java#L407
> There is a bug because inputs are swapped but the schema is not.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23266) Remove QueryWrapper from ObjectStore

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23266?focusedWorklogId=443479=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443479
 ]

ASF GitHub Bot logged work on HIVE-23266:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 00:01
Start Date: 10/Jun/20 00:01
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on a change in pull request #1078:
URL: https://github.com/apache/hive/pull/1078#discussion_r437786895



##
File path: 
standalone-metastore/metastore-server/src/test/java/org/apache/hadoop/hive/metastore/VerifyingObjectStore.java
##
@@ -93,9 +93,11 @@ public boolean getPartitionsByExpr(String catName, String 
dbName, String tblName
   @Override
   public List getPartitions(
   String catName, String dbName, String tableName, int maxParts) throws 
MetaException, NoSuchObjectException {
+openTransaction();
 List sqlResults = getPartitionsInternal(catName, dbName, 
tableName, maxParts, true, false);
 List ormResults = getPartitionsInternal(catName, dbName, 
tableName, maxParts, false, true);
 verifyLists(sqlResults, ormResults, Partition.class);
+commitTransaction();

Review comment:
   Got it.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443479)
Time Spent: 1h 10m  (was: 1h)

> Remove QueryWrapper from ObjectStore
> 
>
> Key: HIVE-23266
> URL: https://issues.apache.org/jira/browse/HIVE-23266
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23266.1.patch, HIVE-23266.10.patch, 
> HIVE-23266.11.patch, HIVE-23266.2.patch, HIVE-23266.2.patch, 
> HIVE-23266.3.patch, HIVE-23266.4.patch, HIVE-23266.5.patch, 
> HIVE-23266.6.patch, HIVE-23266.7.patch, HIVE-23266.8.patch, HIVE-23266.9.patch
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> There is currently a utility called {{QueryWrapper}} that makes a normal 
> {{Query}} auto-closable.  However, {{Query}} is now in fact already 
> auto-closing, so there is no need for this class.  In trying to remove it, I 
> realized that this wrapper was being passed around in pretty convoluted ways 
> and also it was sometimes being created in a {{try-with-resources}} block but 
> then never actually used in any way.
> Remove the {{QueryWrapper}} from the class and simplify some of the DB 
> interactions.
> https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java#L178



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23266) Remove QueryWrapper from ObjectStore

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23266?focusedWorklogId=443480=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443480
 ]

ASF GitHub Bot logged work on HIVE-23266:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 00:01
Start Date: 10/Jun/20 00:01
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on a change in pull request #1078:
URL: https://github.com/apache/hive/pull/1078#discussion_r437786895



##
File path: 
standalone-metastore/metastore-server/src/test/java/org/apache/hadoop/hive/metastore/VerifyingObjectStore.java
##
@@ -93,9 +93,11 @@ public boolean getPartitionsByExpr(String catName, String 
dbName, String tblName
   @Override
   public List getPartitions(
   String catName, String dbName, String tableName, int maxParts) throws 
MetaException, NoSuchObjectException {
+openTransaction();
 List sqlResults = getPartitionsInternal(catName, dbName, 
tableName, maxParts, true, false);
 List ormResults = getPartitionsInternal(catName, dbName, 
tableName, maxParts, false, true);
 verifyLists(sqlResults, ormResults, Partition.class);
+commitTransaction();

Review comment:
   OK, got it.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443480)
Time Spent: 1h 20m  (was: 1h 10m)

> Remove QueryWrapper from ObjectStore
> 
>
> Key: HIVE-23266
> URL: https://issues.apache.org/jira/browse/HIVE-23266
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23266.1.patch, HIVE-23266.10.patch, 
> HIVE-23266.11.patch, HIVE-23266.2.patch, HIVE-23266.2.patch, 
> HIVE-23266.3.patch, HIVE-23266.4.patch, HIVE-23266.5.patch, 
> HIVE-23266.6.patch, HIVE-23266.7.patch, HIVE-23266.8.patch, HIVE-23266.9.patch
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> There is currently a utility called {{QueryWrapper}} that makes a normal 
> {{Query}} auto-closable.  However, {{Query}} is now in fact already 
> auto-closing, so there is no need for this class.  In trying to remove it, I 
> realized that this wrapper was being passed around in pretty convoluted ways 
> and also it was sometimes being created in a {{try-with-resources}} block but 
> then never actually used in any way.
> Remove the {{QueryWrapper}} from the class and simplify some of the DB 
> interactions.
> https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java#L178



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23266) Remove QueryWrapper from ObjectStore

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23266?focusedWorklogId=443478=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443478
 ]

ASF GitHub Bot logged work on HIVE-23266:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 23:59
Start Date: 09/Jun/20 23:59
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on a change in pull request #1078:
URL: https://github.com/apache/hive/pull/1078#discussion_r437786344



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java
##
@@ -7109,22 +7121,37 @@ protected String describeResult() {
   }
 
   @Override
-  public List listPrincipalDBGrantsAll(
-  String principalName, PrincipalType principalType) {
-try (QueryWrapper queryWrapper = new QueryWrapper()) {
-  return convertDB(listPrincipalAllDBGrant(principalName, principalType, 
queryWrapper));
+  public List listPrincipalDBGrantsAll(String 
principalName, PrincipalType principalType) {
+List results = Collections.emptyList();
+try {
+  openTransaction();
+  results = convertDB(listPrincipalAllDBGrant(principalName, 
principalType));
+  commitTransaction();
+} catch (Exception e) {
+  throw new RuntimeException(e);
+} finally {
+  rollbackAndCleanup(true, null);
 }
+return results;
   }
 
   @Override
   public List listDBGrantsAll(String catName, String 
dbName) {
-return listDBGrantsAll(catName, dbName, null);
+List results = Collections.emptyList();
+try {
+  openTransaction();
+  results = listDBGrantsAll(catName, dbName, null);
+  commitTransaction();
+} catch (Exception e) {
+  throw new RuntimeException(e);
+} finally {
+  rollbackAndCleanup(true, null);

Review comment:
   Sorry for the delay, my concern is that set success to true on 
```rollbackAndCleanup``` here may make the transaction unable to rollback. 





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443478)
Time Spent: 1h  (was: 50m)

> Remove QueryWrapper from ObjectStore
> 
>
> Key: HIVE-23266
> URL: https://issues.apache.org/jira/browse/HIVE-23266
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23266.1.patch, HIVE-23266.10.patch, 
> HIVE-23266.11.patch, HIVE-23266.2.patch, HIVE-23266.2.patch, 
> HIVE-23266.3.patch, HIVE-23266.4.patch, HIVE-23266.5.patch, 
> HIVE-23266.6.patch, HIVE-23266.7.patch, HIVE-23266.8.patch, HIVE-23266.9.patch
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> There is currently a utility called {{QueryWrapper}} that makes a normal 
> {{Query}} auto-closable.  However, {{Query}} is now in fact already 
> auto-closing, so there is no need for this class.  In trying to remove it, I 
> realized that this wrapper was being passed around in pretty convoluted ways 
> and also it was sometimes being created in a {{try-with-resources}} block but 
> then never actually used in any way.
> Remove the {{QueryWrapper}} from the class and simplify some of the DB 
> interactions.
> https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java#L178



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23266) Remove QueryWrapper from ObjectStore

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23266?focusedWorklogId=443474=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443474
 ]

ASF GitHub Bot logged work on HIVE-23266:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 23:48
Start Date: 09/Jun/20 23:48
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on a change in pull request #1078:
URL: https://github.com/apache/hive/pull/1078#discussion_r437783154



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java
##
@@ -7109,22 +7121,37 @@ protected String describeResult() {
   }
 
   @Override
-  public List listPrincipalDBGrantsAll(
-  String principalName, PrincipalType principalType) {
-try (QueryWrapper queryWrapper = new QueryWrapper()) {
-  return convertDB(listPrincipalAllDBGrant(principalName, principalType, 
queryWrapper));
+  public List listPrincipalDBGrantsAll(String 
principalName, PrincipalType principalType) {
+List results = Collections.emptyList();
+try {
+  openTransaction();
+  results = convertDB(listPrincipalAllDBGrant(principalName, 
principalType));
+  commitTransaction();
+} catch (Exception e) {
+  throw new RuntimeException(e);
+} finally {
+  rollbackAndCleanup(true, null);

Review comment:
   
https://github.com/apache/hive/blob/871ee8009380e1bab160b58dc378a7f668c64584/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java#L11729-L11739
   I'm wondering that the opened transaction may be unable to be committed when 
exception happens on ```listPrincipalAllDBGrant```  or rolled back as we set 
the ```success``` to true  on the method ```rollbackAndCleanup```. Could you 
explain more about why set ```success``` to true here?  Thanks @belugabehr





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443474)
Time Spent: 50m  (was: 40m)

> Remove QueryWrapper from ObjectStore
> 
>
> Key: HIVE-23266
> URL: https://issues.apache.org/jira/browse/HIVE-23266
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23266.1.patch, HIVE-23266.10.patch, 
> HIVE-23266.11.patch, HIVE-23266.2.patch, HIVE-23266.2.patch, 
> HIVE-23266.3.patch, HIVE-23266.4.patch, HIVE-23266.5.patch, 
> HIVE-23266.6.patch, HIVE-23266.7.patch, HIVE-23266.8.patch, HIVE-23266.9.patch
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> There is currently a utility called {{QueryWrapper}} that makes a normal 
> {{Query}} auto-closable.  However, {{Query}} is now in fact already 
> auto-closing, so there is no need for this class.  In trying to remove it, I 
> realized that this wrapper was being passed around in pretty convoluted ways 
> and also it was sometimes being created in a {{try-with-resources}} block but 
> then never actually used in any way.
> Remove the {{QueryWrapper}} from the class and simplify some of the DB 
> interactions.
> https://github.com/apache/hive/blob/master/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/ObjectStore.java#L178



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HIVE-23568) special_character_in_tabnames_1.q is unstable

2020-06-09 Thread Stamatis Zampetakis (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stamatis Zampetakis reassigned HIVE-23568:
--

Assignee: Stamatis Zampetakis

> special_character_in_tabnames_1.q is unstable
> -
>
> Key: HIVE-23568
> URL: https://issues.apache.org/jira/browse/HIVE-23568
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Zoltan Haindrich
>Assignee: Stamatis Zampetakis
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HIVE-23568) special_character_in_tabnames_1.q is unstable

2020-06-09 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-23568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129881#comment-17129881
 ] 

Stamatis Zampetakis commented on HIVE-23568:


I will take this one. Random pick hope it does not turn out too tricky :D

> special_character_in_tabnames_1.q is unstable
> -
>
> Key: HIVE-23568
> URL: https://issues.apache.org/jira/browse/HIVE-23568
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Zoltan Haindrich
>Assignee: Stamatis Zampetakis
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23551) Acid: Update queries should treat dirCache as read-only in AcidUtils

2020-06-09 Thread Rajesh Balamohan (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated HIVE-23551:

Fix Version/s: 4.0.0
 Assignee: Rajesh Balamohan
   Resolution: Fixed
   Status: Resolved  (was: Patch Available)

Thanks [~pvary], [~ashutoshc]. Committed to master.

> Acid: Update queries should treat dirCache as read-only in AcidUtils
> 
>
> Key: HIVE-23551
> URL: https://issues.apache.org/jira/browse/HIVE-23551
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Reporter: Rajesh Balamohan
>Assignee: Rajesh Balamohan
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-23551.1.patch, HIVE-23551.2.patch, 
> HIVE-23551.3.patch, HIVE-23551.4.patch, HIVE-23551.5.patch, HIVE-23551.6.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Update statements create delta folders at the end of the execution. When 
> {{insert overwrite}} followed by {{update}} is executed, it does not get any 
> open txns and ends up caching the {{base}} folder. However, the delta folder 
> which gets created at the end of the statement never makes it to the cache. 
> This creates wrong results.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23561) FIX Arrow Decimal serialization for native VectorRowBatches

2020-06-09 Thread Ashutosh Chauhan (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-23561:

Fix Version/s: 4.0.0
   Resolution: Fixed
   Status: Resolved  (was: Patch Available)

Pushed to master. Thanks, [~pgaref]

> FIX Arrow Decimal serialization for native VectorRowBatches
> ---
>
> Key: HIVE-23561
> URL: https://issues.apache.org/jira/browse/HIVE-23561
> Project: Hive
>  Issue Type: Bug
>Reporter: Panagiotis Garefalakis
>Assignee: Panagiotis Garefalakis
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-23561.01.patch
>
>
> Arrow Serializer does not properly handle Decimal primitive values when 
> selected array is used.
> In more detail, decimalValueSetter should be setting the value at 
> *arrowIndex[i]* as the value at *hiveIndex[j]*, however currently its using 
> the _same_ index!
> https://github.com/apache/hive/blob/eac25e711ea750bc52f41da7ed3c32bfe36d4f67/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/Serializer.java#L926
> This works fine for cases where i == j (selected is not used) but returns 
> wrong decimal row values when i != j.
> This ticket fixes this inconsistency and adds tests with selected indexes for 
> all supported types



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HIVE-19549) Enable TestAcidOnTez#testCtasTezUnion

2020-06-09 Thread Jesus Camacho Rodriguez (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-19549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jesus Camacho Rodriguez reassigned HIVE-19549:
--

Assignee: (was: Eugene Koifman)

> Enable TestAcidOnTez#testCtasTezUnion
> -
>
> Key: HIVE-19549
> URL: https://issues.apache.org/jira/browse/HIVE-19549
> Project: Hive
>  Issue Type: Test
>  Components: Test
>Affects Versions: 3.1.0
>Reporter: Jesus Camacho Rodriguez
>Priority: Critical
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Attachment: HIVE-23539.01.patch

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Attachment: (was: HIVE-23539.01.patch)

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HIVE-23659) Add Retry for Ranger Replication

2020-06-09 Thread Pravin Sinha (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129703#comment-17129703
 ] 

Pravin Sinha commented on HIVE-23659:
-

Can you please check the link for pull request? Couldn't locate it.

> Add Retry for Ranger Replication
> 
>
> Key: HIVE-23659
> URL: https://issues.apache.org/jira/browse/HIVE-23659
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
> Attachments: HIVE-23659.01.patch, HIVE-23659.02.patch, 
> HIVE-23659.03.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Attachment: HIVE-23539.01.patch

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Attachment: (was: HIVE-23539.01.patch)

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HIVE-23395) TestScheduledReplicationScenarios#testAcidTablesReplLoadBootstrapIncr is unstable

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha reassigned HIVE-23395:
---

Assignee: Pravin Sinha

> TestScheduledReplicationScenarios#testAcidTablesReplLoadBootstrapIncr is 
> unstable
> -
>
> Key: HIVE-23395
> URL: https://issues.apache.org/jira/browse/HIVE-23395
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Zoltan Haindrich
>Assignee: Pravin Sinha
>Priority: Major
>
> sometimes it fails with below exception
> {code}
> Error Message
> expected:<4> but was:<3>
> Stacktrace
> java.lang.AssertionError: expected:<4> but was:<3>
>   at org.junit.Assert.fail(Assert.java:88)
>   at org.junit.Assert.failNotEquals(Assert.java:743)
>   at org.junit.Assert.assertEquals(Assert.java:118)
>   at org.junit.Assert.assertEquals(Assert.java:555)
>   at org.junit.Assert.assertEquals(Assert.java:542)
>   at 
> org.apache.hadoop.hive.ql.parse.WarehouseInstance.verifyResults(WarehouseInstance.java:395)
>   at 
> org.apache.hadoop.hive.ql.parse.TestScheduledReplicationScenarios.testAcidTablesReplLoadBootstrapIncr(TestScheduledReplicationScenarios.java:145)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:498)
>   at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
>   at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>   at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
>   at 
> org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
>   at org.junit.rules.RunRules.evaluate(RunRules.java:20)
>   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
>   at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
>   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
>   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
>   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
>   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
>   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
>   at 
> org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
>   at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
>   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
>   at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:379)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:340)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:125)
>   at 
> org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:413)
> Standard Output
> Formatting using clusterid: testClusterID
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HIVE-22687) Query hangs indefinitely if LLAP daemon registers after the query is submitted

2020-06-09 Thread Attila Magyar (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-22687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129639#comment-17129639
 ] 

Attila Magyar commented on HIVE-22687:
--

I reproduced this issue by putting a sleep between the worker + slot node 
creation and submitting a query in between those two events.

After I applied the patch I was no longer able to reproduce it, so this seems 
to be a viable fix to me.

cc: [~ashutoshc] [~prasanth_j]

> Query hangs indefinitely if LLAP daemon registers after the query is submitted
> --
>
> Key: HIVE-22687
> URL: https://issues.apache.org/jira/browse/HIVE-22687
> Project: Hive
>  Issue Type: Bug
>  Components: llap
>Affects Versions: 3.1.0
>Reporter: Himanshu Mishra
>Assignee: Himanshu Mishra
>Priority: Major
> Attachments: HIVE-22687.01.patch, HIVE-22687.02.patch
>
>
> If a query is submitted and no LLAP daemon is running, it waits for 1 minute 
> and times out with error {{SERVICE_UNAVAILABLE}}.
> While waiting, if a new LLAP Daemon starts, then the timeout is cancelled, 
> and the tasks do not get scheduled as well. As a result, the query hangs 
> indefinitely.
> This is due to the race condition where LLAP Daemon first registers the LLAP 
> instance at {{.../workers/worker-}}, and afterwards registers 
> {{.../workers/slot-}}. In the gap between two, Tez AM gets notified of 
> worker zk node and while processing it checks if slot zk node is present, if 
> not it rejects the LLAP Daemon. Error in Tez AM is:
> {code:java}
> [INFO] [LlapScheduler] |impl.LlapZookeeperRegistryImpl|: Unknown slot for 
> 8ebfdc45-0382-4757-9416-52898885af90{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HIVE-23601) Hive Statement Clear Statement Handle on Error

2020-06-09 Thread David Mollitor (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-23601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129625#comment-17129625
 ] 

David Mollitor commented on HIVE-23601:
---

[~ngangam] Can you please review my PR?  Thanks!

> Hive Statement Clear Statement Handle on Error
> --
>
> Key: HIVE-23601
> URL: https://issues.apache.org/jira/browse/HIVE-23601
> Project: Hive
>  Issue Type: Bug
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> {code:java}
>   private void closeStatementIfNeeded() throws SQLException {
> try {
>   if (stmtHandle != null) {
> TCloseOperationReq closeReq = new TCloseOperationReq(stmtHandle);
> TCloseOperationResp closeResp = client.CloseOperation(closeReq);
> Utils.verifySuccessWithInfo(closeResp.getStatus());
> stmtHandle = null;
>   }
> } catch (SQLException e) {
>   throw e;
> } catch (Exception e) {
>   throw new SQLException("Failed to close statement", "08S01", e);
> }
>   }
>  void closeClientOperation() throws SQLException {
> closeStatementIfNeeded();
> isQueryClosed = true;
> stmtHandle = null;
>   }
> {code}
> {{verifySuccessWithInfo}} throws an {{Exception}} if it finds an error code 
> and therefore leapfrogs over setting the statement handle to null (twice).  
> Probably not what is intended since the original author(s) are tried twice to 
> null it out.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23601) Hive Statement Clear Statement Handle on Error

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23601?focusedWorklogId=443281=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443281
 ]

ASF GitHub Bot logged work on HIVE-23601:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:59
Start Date: 09/Jun/20 16:59
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #1054:
URL: https://github.com/apache/hive/pull/1054#issuecomment-641446810


   @nrg4878 Can you review?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443281)
Time Spent: 20m  (was: 10m)

> Hive Statement Clear Statement Handle on Error
> --
>
> Key: HIVE-23601
> URL: https://issues.apache.org/jira/browse/HIVE-23601
> Project: Hive
>  Issue Type: Bug
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> {code:java}
>   private void closeStatementIfNeeded() throws SQLException {
> try {
>   if (stmtHandle != null) {
> TCloseOperationReq closeReq = new TCloseOperationReq(stmtHandle);
> TCloseOperationResp closeResp = client.CloseOperation(closeReq);
> Utils.verifySuccessWithInfo(closeResp.getStatus());
> stmtHandle = null;
>   }
> } catch (SQLException e) {
>   throw e;
> } catch (Exception e) {
>   throw new SQLException("Failed to close statement", "08S01", e);
> }
>   }
>  void closeClientOperation() throws SQLException {
> closeStatementIfNeeded();
> isQueryClosed = true;
> stmtHandle = null;
>   }
> {code}
> {{verifySuccessWithInfo}} throws an {{Exception}} if it finds an error code 
> and therefore leapfrogs over setting the statement handle to null (twice).  
> Probably not what is intended since the original author(s) are tried twice to 
> null it out.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-13482) str_to_map function delimiters are regex

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-13482?focusedWorklogId=443276=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443276
 ]

ASF GitHub Bot logged work on HIVE-13482:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:52
Start Date: 09/Jun/20 16:52
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #553:
URL: https://github.com/apache/hive/pull/553#issuecomment-641438796


   you will need to also update 1 test to get this in:
   ```
   mvn install -Dtest=TestMiniLlapCliDriver#testCliDriver[str_to_map] 
-Dtest.output.overwrite -Pitests
   ```



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443276)
Time Spent: 3h 20m  (was: 3h 10m)

> str_to_map function delimiters are regex
> 
>
> Key: HIVE-13482
> URL: https://issues.apache.org/jira/browse/HIVE-13482
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 1.0.0
>Reporter: Janick Bernet
>Assignee: Catherine K Anderson
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-13482.patch, HIVE-13482.patch, HIVE.13482.01.patch
>
>  Time Spent: 3h 20m
>  Remaining Estimate: 0h
>
> The two delimiters passed to the 'str_to_map' function are both interpreted 
> as regular expressions, which means that using the pipe ('|') as a delimiter 
> will lead to very unexpected results.
> This behaviour is the same for the closely related 'split' function, however 
> that is clearly documented in the function description (as per 
> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF). 
> Either the documentation for 'str_to_map' should be updated to reflect that 
> the delimiters are both regular expressions, too, or the implementation 
> should be changed to not interpret them as regexes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HIVE-23668) Clean up Task for Hive Metrics

2020-06-09 Thread Aasha Medhi (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aasha Medhi reassigned HIVE-23668:
--


> Clean up Task for Hive Metrics
> --
>
> Key: HIVE-23668
> URL: https://issues.apache.org/jira/browse/HIVE-23668
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Status: Patch Available  (was: Open)

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
> Attachments: HIVE-23539.01.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread Pravin Sinha (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pravin Sinha updated HIVE-23539:

Attachment: HIVE-23539.01.patch

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
> Attachments: HIVE-23539.01.patch
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?focusedWorklogId=443269=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443269
 ]

ASF GitHub Bot logged work on HIVE-23539:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:48
Start Date: 09/Jun/20 16:48
Worklog Time Spent: 10m 
  Work Description: pkumarsinha opened a new pull request #1084:
URL: https://github.com/apache/hive/pull/1084


   …ased staging location
   
   ## NOTICE
   
   Please create an issue in ASF JIRA before opening a pull request,
   and you need to set the title of the pull request which starts with
   the corresponding JIRA issue number. (e.g. HIVE-X: Fix a typo in YYY)
   For more details, please see 
https://cwiki.apache.org/confluence/display/Hive/HowToContribute
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443269)
Remaining Estimate: 0h
Time Spent: 10m

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23539) Optimize data copy during repl load operation for HDFS based staging location

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-23539:
--
Labels: pull-request-available  (was: )

> Optimize data copy during repl load operation for HDFS based staging location
> -
>
> Key: HIVE-23539
> URL: https://issues.apache.org/jira/browse/HIVE-23539
> Project: Hive
>  Issue Type: Improvement
>Reporter: Pravin Sinha
>Assignee: Pravin Sinha
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23539.01.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23074) SchemaTool sql script execution errors when updating the metadata's schema

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23074?focusedWorklogId=443254=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443254
 ]

ASF GitHub Bot logged work on HIVE-23074:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:47
Start Date: 09/Jun/20 16:47
Worklog Time Spent: 10m 
  Work Description: John1Tang commented on pull request #967:
URL: https://github.com/apache/hive/pull/967#issuecomment-641412921


   > I'm looking at the corresponding MySQL scripts and they do not include `IF 
NOT EXISTS` statement. If the tool is failing on a simple upgrade, that's a 
bigger problem than just ignoring some table and indexes. It shouldn't include 
anything that is already present. Need more context of the problem.
   
   Actually, this is how it all began: I was going to do Hive metadata 
migration from 3.0.0 to 3.1.2 and when I executed the "bin/schematool -dbType 
postgres -upgradeSchemaFrom 3.0.0", the new Hive 3.1.2 was not able to start. 
Then I just found that the previous Hive 3.0.0 remained an older schema, in 
other words,  the previous maintainer didn't upgrade schema from 2.3.0.  So 
when I did "bin/schematool -dbType postgres -upgradeSchemaFrom 2.3.0"  it 
reported all those "exists"/"not exists" error, so i would suggest to keep the 
idempotence of those scripts, we can use  IF NOT EXISTS to avoid those error.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443254)
Remaining Estimate: 0h  (was: 10m)
Time Spent: 1h  (was: 50m)

> SchemaTool sql script execution errors when updating the metadata's schema
> --
>
> Key: HIVE-23074
> URL: https://issues.apache.org/jira/browse/HIVE-23074
> Project: Hive
>  Issue Type: Bug
>  Components: SQL
>Affects Versions: 3.1.2
> Environment: running machine: centos7.2 
> metadata db: PostgreSQL 11.3 on x86_64-pc-linux-gnu
> hive version: upgrade from version 3.0.0 to 3.1.2
>Reporter: John1Tang
>Assignee: John1Tang
>Priority: Major
>  Labels: pull-request-available
> Fix For: 3.1.2
>
>   Original Estimate: 1h
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> SchemaTool sql script executed with conflicts on indices and columns and 
> missed " for avoiding keywords when updating the metadata's schema
> {code:java}
> bin/schematool -dbType postgres -upgradeSchemaFrom 3.0.0{code}
> went like this:
> {code:java}
> ALTER TABLE "GLOBAL_PRIVS" ADD COLUMN "AUTHORIZER" character varying(128) 
> DEFAULT NULL::character varying
> Error: ERROR: column "AUTHORIZER" of relation "GLOBAL_PRIVS" already exists 
> (state=42701,code=0){code}
> {code:java}
> ALTER TABLE COMPLETED_TXN_COMPONENTS ADD COLUMN IF NOT EXISTS 
> CTC_UPDATE_DELETE char(1) NULL
> Error: ERROR: relation "completed_txn_components" does not exist 
> (state=42P01,code=0)
> {code}
> I've already come up with a solution and created a pull request for this 
> issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21313) Use faster function to point to instead of copy immutable byte arrays

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21313?focusedWorklogId=443260=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443260
 ]

ASF GitHub Bot logged work on HIVE-21313:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:47
Start Date: 09/Jun/20 16:47
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on a change in pull request #548:
URL: https://github.com/apache/hive/pull/548#discussion_r437556723



##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorAssignRow.java
##
@@ -466,11 +466,11 @@ private void assignRowColumn(
 if (object instanceof String) {
   String string = (String) object;
   byte[] bytes = string.getBytes();

Review comment:
   StringBytes should include the CharSet of UTF8

##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorAssignRow.java
##
@@ -778,7 +778,7 @@ private void assignConvertRowColumn(ColumnVector 
columnVector, int batchIndex,
   text = new Text();
 }
 text.set(string);
-((BytesColumnVector) columnVector).setVal(
+((BytesColumnVector) columnVector).setRef(

Review comment:
   Text here is used as a re-usable buffer (if no buffer is passed into the 
method, a new one is created.  I'm honestly not sure why the String is being 
Wrapped in a Text, at a quick glance, that seems like it can be removed, 
however, for the purposes of this PR, this is no immutable and therefore is not 
a candidate for `setRef`.

##
File path: 
ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorAssignRow.java
##
@@ -778,7 +778,7 @@ private void assignConvertRowColumn(ColumnVector 
columnVector, int batchIndex,
   text = new Text();
 }
 text.set(string);
-((BytesColumnVector) columnVector).setVal(
+((BytesColumnVector) columnVector).setRef(

Review comment:
   Text here is used as a re-usable buffer (if no buffer is passed into the 
method, a new one is created.  I'm honestly not sure why the String is being 
Wrapped in a Text, at a quick glance, that seems like it can be removed, 
however, for the purposes of this PR, this is not immutable and therefore is 
not a candidate for `setRef`.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443260)
Time Spent: 40m  (was: 0.5h)

> Use faster function to point to instead of copy immutable byte arrays
> -
>
> Key: HIVE-21313
> URL: https://issues.apache.org/jira/browse/HIVE-21313
> Project: Hive
>  Issue Type: Improvement
>Affects Versions: All Versions
>Reporter: ZhangXin
>Assignee: ZhangXin
>Priority: Minor
>  Labels: pull-request-available
> Fix For: All Versions
>
> Attachments: HIVE-21313.patch, HIVE-21313.patch
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> In file ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorAssignRow.java
> We may find code like this:
> ```
> Text text = (Text) convertTargetWritable;
>  if (text == null)
> {     text = new Text(); }
> text.set(string);
>  ((BytesColumnVector) columnVector).setVal(
>      batchIndex, text.getBytes(), 0, text.getLength());
> ```
>  
> Using `setVal` method can copy the bytes array generated by 
> `text.getBytes()`. This is totally unnecessary at all. Since the bytes array 
> is immutable, we can just use `setRef` method to point to the specific  byte 
> array, which will also lower the memory usage.
>  
> Pull request on Github:  https://github.com/apache/hive/pull/548
>  
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20771) LazyBinarySerDe fails on empty structs.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20771?focusedWorklogId=443249=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443249
 ]

ASF GitHub Bot logged work on HIVE-20771:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:46
Start Date: 09/Jun/20 16:46
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #450:
URL: https://github.com/apache/hive/pull/450#issuecomment-641405770


   @jcamachor Can you take a look at this?  Does it make sense to enable this?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443249)
Time Spent: 20m  (was: 10m)

> LazyBinarySerDe fails on empty structs.
> ---
>
> Key: HIVE-20771
> URL: https://issues.apache.org/jira/browse/HIVE-20771
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 1.2.2, 2.3.2, 3.1.0
>Reporter: Clemens Valiente
>Assignee: Clemens Valiente
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-20771.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> {code:java}
> CREATE TABLE cvaliente.structtest AS
> SELECT named_struct();
> SHOW CREATE TABLE cvaliente.structtest;
> SELECT * FROM cvaliente.structtest ORDER BY rand();
> {code}
> The resulting schema is:
> {code:sql}
> CREATE TABLE `cvaliente.structtest`(
>   `_c0` struct<>)
> ROW FORMAT SERDE 
>   'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' 
> STORED AS INPUTFORMAT 
>   'org.apache.hadoop.mapred.TextInputFormat' 
> OUTPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> LOCATION
>   'hdfs://nameservice1/user/cvaliente/cvaliente/structtest2'
> TBLPROPERTIES (
>   'COLUMN_STATS_ACCURATE'='true', 
>   'numFiles'='1',   
>   'numRows'='1', 
>   'rawDataSize'='0', 
>   'totalSize'='1',  
>   'transient_lastDdlTime'='1539781607');
> {code}
> Between the MAP and REDUCE phase hive serializes to LazyBinaryStruct and when 
> trying to read the same object back the {{SELECT}} query above fails:
> {code}
> 2018-10-17 14:32:02,298 [FATAL] [TezChild] |tez.ReduceRecordSource|: 
> org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
> processing row (tag=0) 
> {"key":{"reducesinkkey0":0.13508293503238622},"value":{"_col0":{}}}
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$GroupIterator.next(ReduceRecordSource.java:338)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord(ReduceRecordSource.java:259)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run(ReduceRecordProcessor.java:169)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:164)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:139)
>   at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:422)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37)
>   at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>   at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Error evaluating 
> VALUE._col0
>   at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:82)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$GroupIterator.next(ReduceRecordSource.java:329)
>   ... 17 more
> Caused by: java.lang.RuntimeException: length should be positive!
>   at 
> 

[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443250=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443250
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:46
Start Date: 09/Jun/20 16:46
Worklog Time Spent: 10m 
  Work Description: xiejiajun commented on a change in pull request #1080:
URL: https://github.com/apache/hive/pull/1080#discussion_r437552140



##
File path: common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
##
@@ -1500,6 +1500,9 @@ private static void populateLlapDaemonVarsSet(Set 
llapDaemonVarsSetLocal
 HIVEQUERYNAME ("hive.query.name", null,
 "This named is used by Tez to set the dag name. This name in turn will 
appear on \n" +
 "the Tez UI representing the work that was done."),
+HIVETEZJOBNAME("tez.job.name", null,

Review comment:
   We can use 'set tez.job.name=;' or '--hiveconf 
ez.job.name='  to use this function to customize Yarn 
application name when we run a sql
   

##
File path: ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezSessionState.java
##
@@ -317,7 +317,8 @@ protected void openInternal(final HiveConf conf, 
Collection additionalFi
 
 setupSessionAcls(tezConfig, conf);
 
-final TezClient session = TezClient.newBuilder("HIVE-" + sessionId, 
tezConfig)
+String tezJobName = HiveConf.getVar(conf, ConfVars.HIVETEZJOBNAME, 
sessionId);
+final TezClient session = TezClient.newBuilder(String.format("HIVE-%s", 
tezJobName), tezConfig)

Review comment:
   Thanks for the hint, I have updated the code according to your suggestion





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443250)
Time Spent: 2h 40m  (was: 2.5h)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23628) Add Default Message for Github PRs

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23628?focusedWorklogId=443243=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443243
 ]

ASF GitHub Bot logged work on HIVE-23628:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:46
Start Date: 09/Jun/20 16:46
Worklog Time Spent: 10m 
  Work Description: belugabehr merged pull request #1068:
URL: https://github.com/apache/hive/pull/1068


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443243)
Time Spent: 1h  (was: 50m)

> Add Default Message for Github PRs
> --
>
> Key: HIVE-23628
> URL: https://issues.apache.org/jira/browse/HIVE-23628
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21721) nvl function fail with NullPointerException if the two paramtype are different

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21721?focusedWorklogId=443229=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443229
 ]

ASF GitHub Bot logged work on HIVE-21721:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:44
Start Date: 09/Jun/20 16:44
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #623:
URL: https://github.com/apache/hive/pull/623#issuecomment-641143476


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443229)
Time Spent: 0.5h  (was: 20m)

> nvl function fail with NullPointerException if the two paramtype are different
> --
>
> Key: HIVE-21721
> URL: https://issues.apache.org/jira/browse/HIVE-21721
> Project: Hive
>  Issue Type: Bug
>  Components: Parser
>Affects Versions: 1.1.0
>Reporter: philipse
>Assignee: philipse
>Priority: Blocker
>  Labels: pull-request-available
> Attachments: image-2019-05-11-10-41-05-168.png, 
> image-2019-05-12-23-47-49-401.png
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Hi all
> when i user nvl and case when fucntion, it behaviors like the following.which 
> is weird to me, and makes me headache everytime. i need to check where the 
> nullpointexception throws out when it hadppens,So can the reasons more 
> friendly to users,
> !image-2019-05-11-10-41-05-168.png!
> {code:java}
> select nvl(cast('2019-05-10 11:11:11,111' as timestamp),'2019-05-10 
> 11:11:11,111');
> Error: Error while compiling statement: FAILED: NullPointerException null 
> (state=42000,code=4)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20771) LazyBinarySerDe fails on empty structs.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20771?focusedWorklogId=443232=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443232
 ]

ASF GitHub Bot logged work on HIVE-20771:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:44
Start Date: 09/Jun/20 16:44
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #450:
URL: https://github.com/apache/hive/pull/450#issuecomment-641144314


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443232)
Remaining Estimate: 0h
Time Spent: 10m

> LazyBinarySerDe fails on empty structs.
> ---
>
> Key: HIVE-20771
> URL: https://issues.apache.org/jira/browse/HIVE-20771
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 1.2.2, 2.3.2, 3.1.0
>Reporter: Clemens Valiente
>Assignee: Clemens Valiente
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-20771.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> {code:java}
> CREATE TABLE cvaliente.structtest AS
> SELECT named_struct();
> SHOW CREATE TABLE cvaliente.structtest;
> SELECT * FROM cvaliente.structtest ORDER BY rand();
> {code}
> The resulting schema is:
> {code:sql}
> CREATE TABLE `cvaliente.structtest`(
>   `_c0` struct<>)
> ROW FORMAT SERDE 
>   'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' 
> STORED AS INPUTFORMAT 
>   'org.apache.hadoop.mapred.TextInputFormat' 
> OUTPUTFORMAT 
>   'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
> LOCATION
>   'hdfs://nameservice1/user/cvaliente/cvaliente/structtest2'
> TBLPROPERTIES (
>   'COLUMN_STATS_ACCURATE'='true', 
>   'numFiles'='1',   
>   'numRows'='1', 
>   'rawDataSize'='0', 
>   'totalSize'='1',  
>   'transient_lastDdlTime'='1539781607');
> {code}
> Between the MAP and REDUCE phase hive serializes to LazyBinaryStruct and when 
> trying to read the same object back the {{SELECT}} query above fails:
> {code}
> 2018-10-17 14:32:02,298 [FATAL] [TezChild] |tez.ReduceRecordSource|: 
> org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
> processing row (tag=0) 
> {"key":{"reducesinkkey0":0.13508293503238622},"value":{"_col0":{}}}
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource$GroupIterator.next(ReduceRecordSource.java:338)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord(ReduceRecordSource.java:259)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run(ReduceRecordProcessor.java:169)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:164)
>   at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:139)
>   at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:422)
>   at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1917)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61)
>   at 
> org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37)
>   at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
>   at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>   at java.lang.Thread.run(Thread.java:748)
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Error evaluating 
> VALUE._col0
>   at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(SelectOperator.java:82)
>   at 
> 

[jira] [Work logged] (HIVE-21790) Bump Java to 1.8

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21790?focusedWorklogId=443230=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443230
 ]

ASF GitHub Bot logged work on HIVE-21790:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:44
Start Date: 09/Jun/20 16:44
Worklog Time Spent: 10m 
  Work Description: belugabehr opened a new pull request #1076:
URL: https://github.com/apache/hive/pull/1076


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443230)
Time Spent: 1h  (was: 50m)

> Bump Java to 1.8
> 
>
> Key: HIVE-21790
> URL: https://issues.apache.org/jira/browse/HIVE-21790
> Project: Hive
>  Issue Type: Improvement
>Affects Versions: 2.3.5
>Reporter: Fokko Driesprong
>Assignee: Fokko Driesprong
>Priority: Major
>  Labels: pull-request-available
> Attachments: 0001-HIVE-21790-Update-to-Java-1.8.patch
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> We're using Hive for reading Parquet files, but we would like to move from 
> gzip to zstandard compression. Currently, the Parquet support of Hive is old 
> because we can't upgrade since Parquet is Java 1.8+. Therefore it is a good 
> idea to upgrade Hive as well.
> GA support of Java 1.7 is also almost over: 
> https://www.oracle.com/technetwork/java/java-se-support-roadmap.html



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443224=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443224
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:43
Start Date: 09/Jun/20 16:43
Worklog Time Spent: 10m 
  Work Description: xiejiajun opened a new pull request #1083:
URL: https://github.com/apache/hive/pull/1083


   ### What is this PR for?
   - add a configuration item to support setting tez job name
   
   ### What type of PR is it?
   - feature
   
   ### What is the Jira issue?
   - https://issues.apache.org/jira/browse/HIVE-23026



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443224)
Time Spent: 2.5h  (was: 2h 20m)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21072) NPE when running partitioned CTAS statements

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21072?focusedWorklogId=443225=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443225
 ]

ASF GitHub Bot logged work on HIVE-21072:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:43
Start Date: 09/Jun/20 16:43
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #510:
URL: https://github.com/apache/hive/pull/510#issuecomment-641144069


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443225)
Time Spent: 0.5h  (was: 20m)

> NPE when running partitioned CTAS statements
> 
>
> Key: HIVE-21072
> URL: https://issues.apache.org/jira/browse/HIVE-21072
> Project: Hive
>  Issue Type: Bug
>Reporter: L. C. Hsieh
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> HIVE-20241 adds support of partitioned CTAS statements:
> {code:sql}
> CREATE TABLE partition_ctas_1 PARTITIONED BY (key) AS
> SELECT value, key FROM src where key > 200 and key < 300;{code}
>  
> However, I've tried this feature by checking out latest branch-3, and 
> encountered NPE:
> {code:java}
> hive> CREATE TABLE t PARTITIONED BY (part) AS SELECT 1 as id, "a" as part;
> FAILED: NullPointerException null
> {code}
> I also ran the query test partition_ctas.q. The test passes when using 
> TestMiniLlapLocalCliDriver, but when I go to test it with TestCliDriver 
> manually, it also throws NullPointerException:
> {code:java}
> 2018-12-25T05:58:22,221 ERROR [a96009a7-3dda-4d95-9536-e2e16d976856 main] 
> ql.Driver: FAILED: NullPointerException null
> java.lang.NullPointerException
> at 
> org.apache.hadoop.hive.ql.optimizer.GenMapRedUtils.usePartitionColumns(GenMapRedUtils.java:2103)
> at 
> org.apache.hadoop.hive.ql.optimizer.GenMapRedUtils.createMRWorkForMergingFiles(GenMapRedUtils.java:1323)
> at 
> org.apache.hadoop.hive.ql.optimizer.GenMRFileSink1.process(GenMRFileSink1.java:113)
> at 
> org.apache.hadoop.hive.ql.lib.DefaultRuleDispatcher.dispatch(DefaultRuleDispatcher.java:90)
> at 
> org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.dispatchAndReturn(DefaultGraphWalker.java:105)
> at 
> org.apache.hadoop.hive.ql.parse.GenMapRedWalker.walk(GenMapRedWalker.java:54)
> at 
> org.apache.hadoop.hive.ql.parse.GenMapRedWalker.walk(GenMapRedWalker.java:65)
> at 
> org.apache.hadoop.hive.ql.parse.GenMapRedWalker.walk(GenMapRedWalker.java:65)
> at 
> org.apache.hadoop.hive.ql.parse.GenMapRedWalker.walk(GenMapRedWalker.java:65)
> at 
> org.apache.hadoop.hive.ql.lib.DefaultGraphWalker.startWalking(DefaultGraphWalker.java:120)
> at 
> org.apache.hadoop.hive.ql.parse.MapReduceCompiler.generateTaskTree(MapReduceCompiler.java:323)
> at 
> org.apache.hadoop.hive.ql.parse.TaskCompiler.compile(TaskCompiler.java:244)
> at 
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:12503)
> at 
> org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:357)
> at 
> org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:285)
> at 
> org.apache.hadoop.hive.ql.parse.ExplainSemanticAnalyzer.analyzeInternal(ExplainSemanticAnalyzer.java:166)
> at 
> org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:285)
> at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:664)
> at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:1854)
> at org.apache.hadoop.hive.ql.Driver.compileAndRespond(Driver.java:1801)
> at org.apache.hadoop.hive.ql.Driver.compileAndRespond(Driver.java:1796)
> at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.compileAndRespond(ReExecDriver.java:126)
> at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.run(ReExecDriver.java:214)
> at 
> org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:239)
> at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:188)
> at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:402)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23617) Fix FindBug issues in storage-api

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23617?focusedWorklogId=443209=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443209
 ]

ASF GitHub Bot logged work on HIVE-23617:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:42
Start Date: 09/Jun/20 16:42
Worklog Time Spent: 10m 
  Work Description: pgaref commented on a change in pull request #1063:
URL: https://github.com/apache/hive/pull/1063#discussion_r436654787



##
File path: 
storage-api/src/java/org/apache/hadoop/hive/common/io/encoded/EncodedColumnBatch.java
##
@@ -78,13 +79,21 @@ public void setIndexBaseOffset(int indexBaseOffset) {
 
 @Override
 public String toString() {
-  String bufStr = "";
+  StringBuilder sb = new StringBuilder();
   if (cacheBuffers != null) {
-for (MemoryBuffer mb : cacheBuffers) {
-  bufStr += mb.getClass().getSimpleName() + " with " + 
mb.getByteBufferRaw().remaining() + " bytes, ";
+Iterator iter = cacheBuffers.iterator();
+while (iter.hasNext()) {
+  MemoryBuffer mb = iter.next();
+  sb.append(mb.getClass().getSimpleName());
+  sb.append(" with ");
+  sb.append(mb.getByteBufferRaw().remaining());
+  sb.append(" bytes");
+  if (iter.hasNext()) {
+sb.append(", ");

Review comment:
   Tests passed so this change should be safe.

##
File path: 
storage-api/src/java/org/apache/hadoop/hive/common/io/DiskRangeList.java
##
@@ -228,6 +228,16 @@ public long getTotalLength() {
 return result;
   }
 
+  @Override
+  public int hashCode() {
+return super.hashCode();
+  }
+
+  @Override
+  public boolean equals(Object other) {
+return super.equals(other);

Review comment:
   Addressed as part of 0e40c954e3aa7af7e932b1b26091012707b80fe2
   As discussed, let's make sure we update such methods across packages.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443209)
Time Spent: 1h 40m  (was: 1.5h)

> Fix FindBug issues in storage-api
> -
>
> Key: HIVE-23617
> URL: https://issues.apache.org/jira/browse/HIVE-23617
> Project: Hive
>  Issue Type: Sub-task
>  Components: storage-api
>Reporter: Panagiotis Garefalakis
>Assignee: Panagiotis Garefalakis
>Priority: Major
>  Labels: pull-request-available
> Attachments: findbugsXml.xml
>
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> {code:java}
> mvn test-compile findbugs:findbugs -pl storage-api
> {code}
> {code:java}
> mvn findbugs:gui
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23000) Improve travis.yml

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23000?focusedWorklogId=443216=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443216
 ]

ASF GitHub Bot logged work on HIVE-23000:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:42
Start Date: 09/Jun/20 16:42
Worklog Time Spent: 10m 
  Work Description: belugabehr merged pull request #946:
URL: https://github.com/apache/hive/pull/946


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443216)
Time Spent: 50m  (was: 40m)

> Improve travis.yml
> --
>
> Key: HIVE-23000
> URL: https://issues.apache.org/jira/browse/HIVE-23000
> Project: Hive
>  Issue Type: Improvement
>Reporter: Philipp Dallig
>Assignee: Philipp Dallig
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-23000.1.patch
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> travis.yml used an [unsupported operating system|http://example.com/] (trusty 
> - 14.04), updated to bionic (18.04).
> Switched to openjdk8, because OracleJDK8 gets no updates (current version 
> [1.8.0_151|https://travis-ci.org/apache/hive/builds/660081028#L441])
> Removed a unnecessary workaround, because of a travis fix in 2017.
> {{install: skip}} is the right way to [skip the installation 
> phase|https://docs.travis-ci.com/user/job-lifecycle/#skipping-the-installation-phase]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443217=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443217
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:42
Start Date: 09/Jun/20 16:42
Worklog Time Spent: 10m 
  Work Description: xiejiajun commented on pull request #1080:
URL: https://github.com/apache/hive/pull/1080#issuecomment-641343470







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443217)
Time Spent: 2h 20m  (was: 2h 10m)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-19430) ObjectStore.cleanNotificationEvents OutOfMemory on large number of pending events

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-19430?focusedWorklogId=443215=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443215
 ]

ASF GitHub Bot logged work on HIVE-19430:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:42
Start Date: 09/Jun/20 16:42
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #537:
URL: https://github.com/apache/hive/pull/537#issuecomment-641143909


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443215)
Time Spent: 1h 20m  (was: 1h 10m)

> ObjectStore.cleanNotificationEvents OutOfMemory on large number of pending 
> events
> -
>
> Key: HIVE-19430
> URL: https://issues.apache.org/jira/browse/HIVE-19430
> Project: Hive
>  Issue Type: Bug
>  Components: HCatalog, repl
>Affects Versions: 3.0.0
>Reporter: Thejas Nair
>Assignee: Ashutosh Bapat
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-19430.01.patch, HIVE-19430.02.patch, 
> HIVE-19430.03.patch, HIVE-19430.04.patch
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> If there are large number of events that haven't been cleaned up for some 
> reason, then ObjectStore.cleanNotificationEvents() can run out of memory 
> while it loads all the events to be deleted.
> It should fetch events in batches.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443206=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443206
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:41
Start Date: 09/Jun/20 16:41
Worklog Time Spent: 10m 
  Work Description: xiejiajun opened a new pull request #1080:
URL: https://github.com/apache/hive/pull/1080


   ### What is this PR for?
   - add a configuration item to support setting tez job name
   
   ### What type of PR is it?
   - feature
   
   ### What is the Jira issue?
   - https://issues.apache.org/jira/browse/HIVE-23026



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443206)
Time Spent: 2h 10m  (was: 2h)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20293) Support Replication of ACID table truncate operation

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20293?focusedWorklogId=443202=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443202
 ]

ASF GitHub Bot logged work on HIVE-20293:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:41
Start Date: 09/Jun/20 16:41
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #427:
URL: https://github.com/apache/hive/pull/427#issuecomment-641144519


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443202)
Remaining Estimate: 0h
Time Spent: 10m

> Support Replication of ACID table truncate operation
> 
>
> Key: HIVE-20293
> URL: https://issues.apache.org/jira/browse/HIVE-20293
> Project: Hive
>  Issue Type: Task
>  Components: repl, Transactions
>Affects Versions: 3.1.0, 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: DR, pull-request-available, replication
> Fix For: 4.0.0
>
> Attachments: HIVE-20293.01.patch, HIVE-20293.02.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Support truncate acid table replication.
> 1. Write id allocation needs to be removed



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21829) HiveMetaStore authorization issue with AlterTable and DropTable events

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21829?focusedWorklogId=443198=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443198
 ]

ASF GitHub Bot logged work on HIVE-21829:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:40
Start Date: 09/Jun/20 16:40
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #658:
URL: https://github.com/apache/hive/pull/658#issuecomment-640955905


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443198)
Time Spent: 20m  (was: 10m)

> HiveMetaStore authorization issue with AlterTable and DropTable events
> --
>
> Key: HIVE-21829
> URL: https://issues.apache.org/jira/browse/HIVE-21829
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Affects Versions: 3.1.2
>Reporter: Ramesh Mani
>Assignee: Ramesh Mani
>Priority: Critical
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21829.1.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> With HIVE-21753, we have HiveMetastore authorizer which uses HiveAuthorizer 
> interface to authorizer metastore events.
> This jira is to fix a bug in HIVE-21753 which failed to authorizer Alter and 
> DropTable events



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23597) VectorizedOrcAcidRowBatchReader::ColumnizedDeleteEventRegistry reads delete delta directories multiple times

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23597?focusedWorklogId=443199=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443199
 ]

ASF GitHub Bot logged work on HIVE-23597:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:40
Start Date: 09/Jun/20 16:40
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk closed pull request #1081:
URL: https://github.com/apache/hive/pull/1081


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443199)
Time Spent: 0.5h  (was: 20m)

> VectorizedOrcAcidRowBatchReader::ColumnizedDeleteEventRegistry reads delete 
> delta directories multiple times
> 
>
> Key: HIVE-23597
> URL: https://issues.apache.org/jira/browse/HIVE-23597
> Project: Hive
>  Issue Type: Improvement
>Reporter: Rajesh Balamohan
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> [https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/orc/VectorizedOrcAcidRowBatchReader.java#L1562]
> {code:java}
> try {
> final Path[] deleteDeltaDirs = getDeleteDeltaDirsFromSplit(orcSplit);
> if (deleteDeltaDirs.length > 0) {
>   int totalDeleteEventCount = 0;
>   for (Path deleteDeltaDir : deleteDeltaDirs) {
> {code}
>  
> Consider a directory layout like the following. This was created by having 
> simple set of "insert --> update --> select" queries.
>  
> {noformat}
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/base_001
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/base_002
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_003_003_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_004_004_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_005_005_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_006_006_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_007_007_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_008_008_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_009_009_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_010_010_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_011_011_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_012_012_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delete_delta_013_013_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_003_003_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_004_004_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_005_005_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_006_006_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_007_007_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_008_008_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_009_009_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_010_010_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_011_011_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_012_012_
> /warehouse-1591131255-hl5z/warehouse/tablespace/managed/hive/sequential_update_4/delta_013_013_
>  {noformat}
>  
> Orcsplit contains all the delete delta folder information. For the directory 
> layout like this, it would create {{~12 splits}}. For every split, it 
> constructs "ColumnizedDeleteEventRegistry" in VRBAcidReader and ends up 
> 

[jira] [Work logged] (HIVE-20629) Hive incremental replication fails with events missing error if database is kept idle for more than an hour

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20629?focusedWorklogId=443186=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443186
 ]

ASF GitHub Bot logged work on HIVE-20629:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:39
Start Date: 09/Jun/20 16:39
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #436:
URL: https://github.com/apache/hive/pull/436#issuecomment-641144433


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443186)
Remaining Estimate: 0h
Time Spent: 10m

> Hive incremental replication fails with events missing error if database is 
> kept idle for more than an hour
> ---
>
> Key: HIVE-20629
> URL: https://issues.apache.org/jira/browse/HIVE-20629
> Project: Hive
>  Issue Type: Bug
>  Components: repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20629.01.patch, HIVE-20629.02.patch, 
> HIVE-20629.03.patch, HIVE-20629.04.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Start a source cluster with 2 database. Replicate the databases to target 
> after doing some operations. Keep taking incremental dump for both database 
> and keep replicating them to target cluster. Keep one the database idle for 
> more than 24 hrs. After 24 hrs, the incremental dump of idle database fails 
> with event missing error.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23615) Null pointers should not be dereferenced

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-23615:
--
Labels: pull-request-available  (was: )

> Null pointers should not be dereferenced
> 
>
> Key: HIVE-23615
> URL: https://issues.apache.org/jira/browse/HIVE-23615
> Project: Hive
>  Issue Type: Bug
>Reporter: kvlasov
>Assignee: kvlasov
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> [This pull request|https://github.com/apache/hive/pull/62] is focused on 
> resolving occurrences of Sonar rule squid:S2259
> Beeline: Null pointers should not be dereferenced



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21722) REPL:: logs are missing in hiveStatement.getQueryLog output during parallel execution mode.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21722?focusedWorklogId=443192=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443192
 ]

ASF GitHub Bot logged work on HIVE-21722:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:39
Start Date: 09/Jun/20 16:39
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #625:
URL: https://github.com/apache/hive/pull/625#issuecomment-641143463


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443192)
Time Spent: 2h 50m  (was: 2h 40m)

> REPL:: logs are missing in hiveStatement.getQueryLog output during parallel 
> execution mode.
> ---
>
> Key: HIVE-21722
> URL: https://issues.apache.org/jira/browse/HIVE-21722
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2, repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21722.01.patch, HIVE-21722.02.patch
>
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> getQueryLog only reads logs from Background thread scope. If parallel 
> execution is set to true, a new thread is created for execution and all the 
> logs added by the new thread are not added to the parent  Background thread 
> scope. In replication scope, replStateLogTasks are started in parallel mode 
> causing the logs to be skipped from getQueryLog scope. 
> There is one more issue, with the conf is not passed while creating 
> replStateLogTask during bootstrap load end. The same issue is there with 
> event load during incremental load. The incremental load end log task is 
> created with the proper config. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20751) Upgrade arrow version to 0.10.0

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20751?focusedWorklogId=443191=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443191
 ]

ASF GitHub Bot logged work on HIVE-20751:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:39
Start Date: 09/Jun/20 16:39
Worklog Time Spent: 10m 
  Work Description: belugabehr closed pull request #448:
URL: https://github.com/apache/hive/pull/448


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443191)
Time Spent: 0.5h  (was: 20m)

> Upgrade arrow version to 0.10.0
> ---
>
> Key: HIVE-20751
> URL: https://issues.apache.org/jira/browse/HIVE-20751
> Project: Hive
>  Issue Type: Bug
>  Components: llap
>Affects Versions: 3.1.0
>Reporter: Shubham Chaurasia
>Assignee: Shubham Chaurasia
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20751.1.patch, HIVE-20751.2.patch
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Need to upgrade arrow version as spark is moving to arrow version 0.10.0 in 
> it's upcoming release 2.4.0



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443187=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443187
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:39
Start Date: 09/Jun/20 16:39
Worklog Time Spent: 10m 
  Work Description: xiejiajun commented on pull request #947:
URL: https://github.com/apache/hive/pull/947#issuecomment-640768491







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443187)
Time Spent: 2h  (was: 1h 50m)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23615) Null pointers should not be dereferenced

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23615?focusedWorklogId=443189=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443189
 ]

ASF GitHub Bot logged work on HIVE-23615:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:39
Start Date: 09/Jun/20 16:39
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #62:
URL: https://github.com/apache/hive/pull/62#issuecomment-640558326


   ci fails with a wierd error - "owner can't be null" maybe it's because the 
PR comes from "unknown repository"



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443189)
Remaining Estimate: 0h
Time Spent: 10m

> Null pointers should not be dereferenced
> 
>
> Key: HIVE-23615
> URL: https://issues.apache.org/jira/browse/HIVE-23615
> Project: Hive
>  Issue Type: Bug
>Reporter: kvlasov
>Assignee: kvlasov
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> [This pull request|https://github.com/apache/hive/pull/62] is focused on 
> resolving occurrences of Sonar rule squid:S2259
> Beeline: Null pointers should not be dereferenced



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20445) Add randomized tests to TestArrowColumnarBatchSerDe

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20445?focusedWorklogId=443183=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443183
 ]

ASF GitHub Bot logged work on HIVE-20445:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:38
Start Date: 09/Jun/20 16:38
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #423:
URL: https://github.com/apache/hive/pull/423#issuecomment-641144550


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443183)
Remaining Estimate: 0h
Time Spent: 10m

> Add randomized tests to TestArrowColumnarBatchSerDe
> ---
>
> Key: HIVE-20445
> URL: https://issues.apache.org/jira/browse/HIVE-20445
> Project: Hive
>  Issue Type: Test
>Reporter: Teddy Choi
>Assignee: Teddy Choi
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-20445.1.patch, HIVE-20445.2.patch, 
> HIVE-20445.2.patch, HIVE-20445.3.patch, HIVE-20445.3.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Use random schemas/data for TestArrowColumnarBatchSerDe to improve coverage 
> of arrow conversion/serialization.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21641) Llap external client returns decimal columns in different precision/scale as compared to beeline

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21641?focusedWorklogId=443180=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443180
 ]

ASF GitHub Bot logged work on HIVE-21641:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:38
Start Date: 09/Jun/20 16:38
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #601:
URL: https://github.com/apache/hive/pull/601#issuecomment-641143632


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443180)
Time Spent: 20m  (was: 10m)

> Llap external client returns decimal columns in different precision/scale as 
> compared to beeline
> 
>
> Key: HIVE-21641
> URL: https://issues.apache.org/jira/browse/HIVE-21641
> Project: Hive
>  Issue Type: Bug
>  Components: llap
>Affects Versions: 3.1.1
>Reporter: Shubham Chaurasia
>Assignee: Shubham Chaurasia
>Priority: Major
>  Labels: Branch3Candidate, pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21641.1.patch, HIVE-21641.2.patch, 
> HIVE-21641.3.patch, HIVE-21641.4.patch, HIVE-21641.5.branch-3.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Llap external client gives different precision/scale as compared to when the 
> query is executed beeline. Consider the following results:
> Query:
> {code} 
> select avg(ss_ext_sales_price) my_avg from store_sales;
> {code} 
> Result from Beeline
> {code} 
> ++
> |   my_avg   |
> ++
> | 37.8923531030581611189434  |
> ++
> {code} 
> Result from Llap external client
> {code}
> +-+
> |   my_avg|
> +-+
> |37.892353|
> +-+
> {code}
>  
> This is due to Driver(beeline path) calls 
> [analyzeInternal()|https://github.com/apache/hive/blob/rel/release-3.1.1/ql/src/java/org/apache/hadoop/hive/ql/parse/CalcitePlanner.java#L328]
>  for getting result set schema which initializes 
> [resultSchema|https://github.com/apache/hive/blob/rel/release-3.1.1/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java#L333]
>  after some more transformations as compared to llap-ext-client which calls 
> [genLogicalPlan()|https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/parse/ParseUtils.java#L561]
> Replacing {{genLogicalPlan()}} by {{analyze()}} resolves this.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21435) LlapBaseInputFormat should get task number from TASK_ATTEMPT_ID conf if present, while building SubmitWorkRequestProto

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21435?focusedWorklogId=443177=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443177
 ]

ASF GitHub Bot logged work on HIVE-21435:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:38
Start Date: 09/Jun/20 16:38
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #568:
URL: https://github.com/apache/hive/pull/568#issuecomment-641143843


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443177)
Time Spent: 20m  (was: 10m)

> LlapBaseInputFormat should get task number from TASK_ATTEMPT_ID conf if 
> present, while building SubmitWorkRequestProto
> --
>
> Key: HIVE-21435
> URL: https://issues.apache.org/jira/browse/HIVE-21435
> Project: Hive
>  Issue Type: Bug
>  Components: llap
>Reporter: Shubham Chaurasia
>Assignee: Shubham Chaurasia
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21435.1.patch, HIVE-21435.2.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Currently we always pass [split number here as 
> taskNum|https://github.com/apache/hive/blob/master/llap-ext-client/src/java/org/apache/hadoop/hive/llap/LlapBaseInputFormat.java#L185].
>  However it should be taken from TASK_ATTEMPT_ID conf if it is present.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20994) Upgrade arrow version to 0.10.0 in branch-3

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20994?focusedWorklogId=443184=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443184
 ]

ASF GitHub Bot logged work on HIVE-20994:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:38
Start Date: 09/Jun/20 16:38
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #501:
URL: https://github.com/apache/hive/pull/501#issuecomment-641144138


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443184)
Remaining Estimate: 0h
Time Spent: 10m

> Upgrade arrow version to 0.10.0 in branch-3
> ---
>
> Key: HIVE-20994
> URL: https://issues.apache.org/jira/browse/HIVE-20994
> Project: Hive
>  Issue Type: Improvement
>Reporter: Teddy Choi
>Assignee: Teddy Choi
>Priority: Major
>  Labels: pull-request-available
> Fix For: 3.2.0
>
> Attachments: HIVE-20994-branch-3.1.patch, HIVE-20994.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> HIVE-20751 upgraded arrow version in Hive 4, but its patch has conflicts with 
> Hive 3. It needs to be rebased.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21313) Use faster function to point to instead of copy immutable byte arrays

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21313?focusedWorklogId=443165=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443165
 ]

ASF GitHub Bot logged work on HIVE-21313:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:37
Start Date: 09/Jun/20 16:37
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #548:
URL: https://github.com/apache/hive/pull/548#issuecomment-641143884


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443165)
Time Spent: 0.5h  (was: 20m)

> Use faster function to point to instead of copy immutable byte arrays
> -
>
> Key: HIVE-21313
> URL: https://issues.apache.org/jira/browse/HIVE-21313
> Project: Hive
>  Issue Type: Improvement
>Affects Versions: All Versions
>Reporter: ZhangXin
>Assignee: ZhangXin
>Priority: Minor
>  Labels: pull-request-available
> Fix For: All Versions
>
> Attachments: HIVE-21313.patch, HIVE-21313.patch
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> In file ql/src/java/org/apache/hadoop/hive/ql/exec/vector/VectorAssignRow.java
> We may find code like this:
> ```
> Text text = (Text) convertTargetWritable;
>  if (text == null)
> {     text = new Text(); }
> text.set(string);
>  ((BytesColumnVector) columnVector).setVal(
>      batchIndex, text.getBytes(), 0, text.getLength());
> ```
>  
> Using `setVal` method can copy the bytes array generated by 
> `text.getBytes()`. This is totally unnecessary at all. Since the bytes array 
> is immutable, we can just use `setRef` method to point to the specific  byte 
> array, which will also lower the memory usage.
>  
> Pull request on Github:  https://github.com/apache/hive/pull/548
>  
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20954) Vector RS operator is not using uniform hash function for TPC-DS query 95

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20954?focusedWorklogId=443141=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443141
 ]

ASF GitHub Bot logged work on HIVE-20954:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:35
Start Date: 09/Jun/20 16:35
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #492:
URL: https://github.com/apache/hive/pull/492#issuecomment-641144189


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443141)
Remaining Estimate: 0h
Time Spent: 10m

> Vector RS operator is not using uniform hash function for TPC-DS query 95
> -
>
> Key: HIVE-20954
> URL: https://issues.apache.org/jira/browse/HIVE-20954
> Project: Hive
>  Issue Type: Improvement
>Reporter: Teddy Choi
>Assignee: Teddy Choi
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20954.1.patch, HIVE-20954.2.patch, 
> HIVE-20954.3.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Distribution of rows is skewed in DHJ causing slowdown.
> Same RS outputs, but the two branches use VectorReduceSinkObjectHashOperator 
> and VectorReduceSinkLongOperator.
> {code}
> | Select Operator|
> |   expressions: ws_warehouse_sk (type: bigint), 
> ws_order_number (type: bigint) |
> |   outputColumnNames: _col0, _col1 |
> |   Select Vectorization:|
> |   className: VectorSelectOperator |
> |   native: true |
> |   projectedOutputColumnNums: [14, 16] |
> |   Statistics: Num rows: 7199963324 Data size: 
> 115185006696 Basic stats: COMPLETE Column stats: COMPLETE |
> |   Reduce Output Operator   |
> | key expressions: _col1 (type: bigint) |
> | sort order: +  |
> | Map-reduce partition columns: _col1 (type: bigint) |
> | Reduce Sink Vectorization: |
> | className: VectorReduceSinkObjectHashOperator |
> | keyColumnNums: [16]|
> | native: true   |
> | nativeConditionsMet: 
> hive.vectorized.execution.reducesink.new.enabled IS true, 
> hive.execution.engine tez IN [tez, spark] IS true, No PTF TopN IS true, No 
> DISTINCT columns IS true, BinarySortableSerDe for keys IS true, 
> LazyBinarySerDe for values IS true |
> | partitionColumnNums: [16] |
> | valueColumnNums: [14]  |
> ++
> |  Explain   |
> ++
> | Statistics: Num rows: 7199963324 Data size: 
> 115185006696 Basic stats: COMPLETE Column stats: COMPLETE |
> | value expressions: _col0 (type: bigint) |
> |   Reduce Output Operator   |
> | key expressions: _col1 (type: bigint) |
> | sort order: +  |
> | Map-reduce partition columns: _col1 (type: bigint) |
> | Reduce Sink Vectorization: |
> | className: VectorReduceSinkLongOperator |
> | keyColumnNums: [16]|
> | native: true   |
> | nativeConditionsMet: 
> hive.vectorized.execution.reducesink.new.enabled IS true, 
> hive.execution.engine tez IN [tez, spark] IS true, No PTF TopN IS true, No 
> DISTINCT columns IS true, BinarySortableSerDe for keys IS true, 
> LazyBinarySerDe for values IS true |
> | valueColumnNums: [14]  |
> | Statistics: Num rows: 7199963324 Data size: 
> 115185006696 Basic stats: COMPLETE Column stats: COMPLETE |
> |  

[jira] [Work logged] (HIVE-20968) Support conversion of managed to external where location set was not owned by hive

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20968?focusedWorklogId=443145=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443145
 ]

ASF GitHub Bot logged work on HIVE-20968:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:35
Start Date: 09/Jun/20 16:35
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #588:
URL: https://github.com/apache/hive/pull/588#issuecomment-641143670


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443145)
Time Spent: 2h 40m  (was: 2.5h)

> Support conversion of managed to external where location set was not owned by 
> hive
> --
>
> Key: HIVE-20968
> URL: https://issues.apache.org/jira/browse/HIVE-20968
> Project: Hive
>  Issue Type: Sub-task
>  Components: repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: DR, pull-request-available
> Attachments: HIVE-20968.01.patch, HIVE-20968.02.patch, 
> HIVE-20968.03.patch, HIVE-20968.04.patch, HIVE-20968.05.patch
>
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> As per migration rule, if a location is outside the default managed table 
> directory and the location is not owned by "hive" user, then it should be 
> converted to external table after upgrade.
>  So, the same rule is applicable for Hive replication where the data of 
> source managed table is residing outside the default warehouse directory and 
> is not owned by "hive" user.
>  During this conversion, the path should be preserved in target as well so 
> that failover works seamlessly.
>  # If the table location is out side hive warehouse and is not owned by hive, 
> then the table at target will be converted to external table. But the 
> location can not be retained , it will be retained relative to hive external 
> warehouse directory. 
>  #  As the table is not an external table at source, only those data which 
> are added using events will be replicated.
>  # The ownership of the location will be stored in the create table event and 
> will be used to compare it with strict.managed.tables.migration.owner to 
> decide if the flag in replication scope can be set. This flag is used to 
> convert the managed table to external table at target.
> Some of the scenarios needs to be blocked if the database is set for 
> replication from a cluster with non strict managed table setting to strict 
> managed table.
> 1. Block alter table / partition set location for database with source of 
> replication set for managed tables
> 2. If user manually changes the ownership of the location, hive replication 
> may go to a non recoverable state.
> 3. Block add partition if the location ownership is different than table 
> location for managed tables.
> 4. User needs to set strict.managed.tables.migration.owner along with dump 
> command (default to hive user). This value will be used during dump to decide 
> the ownership which will be used during load to decide the table type. The 
> location owner information can be stored in the events during create table. 
> The flag can be stored in replication spec. Check other such configs used in 
> upgrade tool.
> 5. Block conversion from managed to external and vice versa. Pass some flag 
> in upgrade flow to allow this conversion during upgrade flow.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-16839) Unbalanced calls to openTransaction/commitTransaction when alter the same partition concurrently

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-16839?focusedWorklogId=443130=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443130
 ]

ASF GitHub Bot logged work on HIVE-16839:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:34
Start Date: 09/Jun/20 16:34
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #484:
URL: https://github.com/apache/hive/pull/484#issuecomment-641144228


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443130)
Remaining Estimate: 0h
Time Spent: 10m

> Unbalanced calls to openTransaction/commitTransaction when alter the same 
> partition concurrently
> 
>
> Key: HIVE-16839
> URL: https://issues.apache.org/jira/browse/HIVE-16839
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 0.13.1, 1.1.0, 3.0.0, 2.3.4
>Reporter: Nemon Lou
>Assignee: Guang Yang
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0, 3.2.0
>
> Attachments: HIVE-16839.01.patch, HIVE-16839.02.patch, 
> HIVE-16839.03.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> SQL to reproduce:
> prepare:
> {noformat}
>  hdfs dfs -mkdir -p 
> /hzsrc/external/writing_dc/ltgsm/16e7a9b2-21a1-3f4f-8061-bc3395281627
>  1,create external table tb_ltgsm_external (id int) PARTITIONED by (cp 
> string,ld string);
> {noformat}
> open one beeline run these two sql many times 
> {noformat} 2,ALTER TABLE tb_ltgsm_external ADD IF NOT EXISTS PARTITION 
> (cp=2017060513,ld=2017060610);
>  3,ALTER TABLE tb_ltgsm_external PARTITION (cp=2017060513,ld=2017060610) SET 
> LOCATION 
> 'hdfs://hacluster/hzsrc/external/writing_dc/ltgsm/16e7a9b2-21a1-3f4f-8061-bc3395281627';
> {noformat}
> open another beeline to run this sql many times at the same time.
> {noformat}
>  4,ALTER TABLE tb_ltgsm_external DROP PARTITION (cp=2017060513,ld=2017060610);
> {noformat}
> MetaStore logs:
> {noformat}
> 2017-06-06 21:58:34,213 | ERROR | pool-6-thread-197 | Retrying HMSHandler 
> after 2000 ms (attempt 1 of 10) with error: 
> javax.jdo.JDOObjectNotFoundException: No such database row
> FailedObject:49[OID]org.apache.hadoop.hive.metastore.model.MStorageDescriptor
>   at 
> org.datanucleus.api.jdo.NucleusJDOHelper.getJDOExceptionForNucleusException(NucleusJDOHelper.java:475)
>   at 
> org.datanucleus.api.jdo.JDOAdapter.getApiExceptionForNucleusException(JDOAdapter.java:1158)
>   at 
> org.datanucleus.state.JDOStateManager.isLoaded(JDOStateManager.java:3231)
>   at 
> org.apache.hadoop.hive.metastore.model.MStorageDescriptor.jdoGetcd(MStorageDescriptor.java)
>   at 
> org.apache.hadoop.hive.metastore.model.MStorageDescriptor.getCD(MStorageDescriptor.java:184)
>   at 
> org.apache.hadoop.hive.metastore.ObjectStore.convertToStorageDescriptor(ObjectStore.java:1282)
>   at 
> org.apache.hadoop.hive.metastore.ObjectStore.convertToStorageDescriptor(ObjectStore.java:1299)
>   at 
> org.apache.hadoop.hive.metastore.ObjectStore.convertToPart(ObjectStore.java:1680)
>   at 
> org.apache.hadoop.hive.metastore.ObjectStore.getPartition(ObjectStore.java:1586)
>   at sun.reflect.GeneratedMethodAccessor35.invoke(Unknown Source)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:497)
>   at 
> org.apache.hadoop.hive.metastore.RawStoreProxy.invoke(RawStoreProxy.java:98)
>   at com.sun.proxy.$Proxy0.getPartition(Unknown Source)
>   at 
> org.apache.hadoop.hive.metastore.HiveAlterHandler.alterPartitions(HiveAlterHandler.java:538)
>   at 
> org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.alter_partitions(HiveMetaStore.java:3317)
>   at sun.reflect.GeneratedMethodAccessor37.invoke(Unknown Source)
>   at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.lang.reflect.Method.invoke(Method.java:497)
>   at 
> org.apache.hadoop.hive.metastore.RetryingHMSHandler.invoke(RetryingHMSHandler.java:102)
>   at com.sun.proxy.$Proxy12.alter_partitions(Unknown 

[jira] [Work logged] (HIVE-20359) Update protobuf version in pom.xml

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20359?focusedWorklogId=443135=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443135
 ]

ASF GitHub Bot logged work on HIVE-20359:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:34
Start Date: 09/Jun/20 16:34
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #413:
URL: https://github.com/apache/hive/pull/413#issuecomment-641144649


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443135)
Time Spent: 50m  (was: 40m)

> Update protobuf version in pom.xml
> --
>
> Key: HIVE-20359
> URL: https://issues.apache.org/jira/browse/HIVE-20359
> Project: Hive
>  Issue Type: Improvement
>  Components: Hive
>Affects Versions: 3.1.0
> Environment: ARM64 Server machines with CentOS/Debian OS.
>Reporter: Naresh Bhat
>Priority: Major
>  Labels: pull-request-available
> Fix For: 3.1.0, 4.0.0
>
> Attachments: HIVE-20359.01.branch-3.1.patch, 
> HIVE-20359.01.branch-3.1.patch, HIVE-20359.01.patch, HIVE-20359.01.patch, 
> HIVE-20359.01.patch
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> The following protoc version's used in hive is very old i.e. 2.5.0 
> [https://repo.maven.apache.org/maven2/com/google/protobuf/protoc/] .  The 
> v2.5.0 does not have aarch64 support. But the AArch64 support started from 
> v3.5.0 on-words in google's protobuf project release.  The hive release 
> version branch v3/3.0/3.1/master branch build fails on AArch64 because of old 
> protoc version 2.5.0, in which there is no AArch64 support/exe available.
> After I replaced to latest available protoc version 3.6.1 the hive master 
> branch pom.xml files, The hive compiled without any issues on ARM64 machine.  
> This Jira ticket is used to propose a patch on Hive mailing list 
> [https://git.linaro.org/people/naresh.bhat/apache/hive.git/]  to update 
> pom.xml files protobuf version.  I will also cherry-pick patches for other 
> branches v3/3.0/3.1, and create a pull request.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21011) Upgrade MurmurHash 2.0 to 3.0 in vectorized map and reduce operators

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21011?focusedWorklogId=443133=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443133
 ]

ASF GitHub Bot logged work on HIVE-21011:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:34
Start Date: 09/Jun/20 16:34
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #503:
URL: https://github.com/apache/hive/pull/503#issuecomment-641144118


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443133)
Remaining Estimate: 0h
Time Spent: 10m

> Upgrade MurmurHash 2.0 to 3.0 in vectorized map and reduce operators
> 
>
> Key: HIVE-21011
> URL: https://issues.apache.org/jira/browse/HIVE-21011
> Project: Hive
>  Issue Type: Improvement
>Reporter: Teddy Choi
>Assignee: Teddy Choi
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-21011.1.patch, HIVE-21011.2.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> HIVE-20873 improved map join performance by using MurmurHash 3.0. However, 
> there's more operators that can use it. VectorMapJoinCommonOperator and 
> VectorReduceSinkUniformHashOperator use MurmurHash 2.0, so it can be upgraded 
> to MurmurHash 3.0.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-22479) Fix typo in GenericUDF

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-22479?focusedWorklogId=443132=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443132
 ]

ASF GitHub Bot logged work on HIVE-22479:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:34
Start Date: 09/Jun/20 16:34
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #842:
URL: https://github.com/apache/hive/pull/842#issuecomment-640753814







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443132)
Time Spent: 40m  (was: 0.5h)

> Fix typo in GenericUDF
> --
>
> Key: HIVE-22479
> URL: https://issues.apache.org/jira/browse/HIVE-22479
> Project: Hive
>  Issue Type: Improvement
>Reporter: Wanqiang Ji
>Assignee: Wanqiang Ji
>Priority: Major
>  Labels: newbie, pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-22479.001.patch
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> `Defered` corrected to `Deferred`



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20751) Upgrade arrow version to 0.10.0

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20751?focusedWorklogId=443118=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443118
 ]

ASF GitHub Bot logged work on HIVE-20751:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:33
Start Date: 09/Jun/20 16:33
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #448:
URL: https://github.com/apache/hive/pull/448#issuecomment-641144348


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443118)
Time Spent: 20m  (was: 10m)

> Upgrade arrow version to 0.10.0
> ---
>
> Key: HIVE-20751
> URL: https://issues.apache.org/jira/browse/HIVE-20751
> Project: Hive
>  Issue Type: Bug
>  Components: llap
>Affects Versions: 3.1.0
>Reporter: Shubham Chaurasia
>Assignee: Shubham Chaurasia
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20751.1.patch, HIVE-20751.2.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Need to upgrade arrow version as spark is moving to arrow version 0.10.0 in 
> it's upcoming release 2.4.0



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HIVE-23628) Add Default Message for Github PRs

2020-06-09 Thread David Mollitor (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Mollitor resolved HIVE-23628.
---
Fix Version/s: 4.0.0
   Resolution: Fixed

Pushed to master.  Thanks [~kgyrtkirk]!

> Add Default Message for Github PRs
> --
>
> Key: HIVE-23628
> URL: https://issues.apache.org/jira/browse/HIVE-23628
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21788) Support replication from haddop-2 (hive 3.0 and beelow) on-prem cluster to hadoop-3 (hive 4 and above) cloud cluster

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21788?focusedWorklogId=443120=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443120
 ]

ASF GitHub Bot logged work on HIVE-21788:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:33
Start Date: 09/Jun/20 16:33
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #666:
URL: https://github.com/apache/hive/pull/666#issuecomment-640955875


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443120)
Time Spent: 20m  (was: 10m)

> Support replication from haddop-2 (hive 3.0 and beelow) on-prem cluster to 
> hadoop-3 (hive 4 and above) cloud cluster
> 
>
> Key: HIVE-21788
> URL: https://issues.apache.org/jira/browse/HIVE-21788
> Project: Hive
>  Issue Type: Task
>  Components: HiveServer2, repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21788.01.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> In case of replication to cloud both dump and load are executed in the source 
> cluster. This push based replication is done to avoid computation at target 
> cloud cluster. In case in the source cluster, strict managed table is not set 
> to true the tables will be non acid. So during replication to a cluster with 
> strict managed table, migration logic same as upgrade tool has to be applied 
> on the replicated data. This migration logic is implemented only in hive4.0. 
> So it's required that a hive 4.0 instance started at the source cluster. If 
> the source cluster has hadoop-2 installation, hive4 has to be built with 
> hadoop-2 and necessary changes are required in the pom files and the shim 
> files.
> 1. Change the pom.xml files to accept a profile for hadoop-2. If hadoop-2 
> profile is set, the hadoop version should be set accordingly to hadoop-2.
> 2. In shim creare a new file for hadoop-2. Based on the profile the 
> respective file will be included in the build.
> 3. Changed artifactId hadoop-hdfs-client to hadoop-client as in hadoop-2 the 
> jars are stored under hadoop-client folder.
>  
>  
> Command to enable hadop-2 dependency  —  mvn clean install package 
> -DskipTests  -Pdist -pl '!standalone-metastore, !llap-common, !llap-client, 
> !llap-ext-client, !llap-tez, !llap-server, !hbase-handler, !service, !hplsql, 
> !kryo-registrator' -Phadoop-2.7
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20631) Hive returns 20011 error code for re-triable error

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20631?focusedWorklogId=443119=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443119
 ]

ASF GitHub Bot logged work on HIVE-20631:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:33
Start Date: 09/Jun/20 16:33
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #437:
URL: https://github.com/apache/hive/pull/437#issuecomment-641144413


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443119)
Remaining Estimate: 0h
Time Spent: 10m

> Hive returns 20011 error code for re-triable error
> --
>
> Key: HIVE-20631
> URL: https://issues.apache.org/jira/browse/HIVE-20631
> Project: Hive
>  Issue Type: Bug
>  Components: repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20631.01.patch, HIVE-20631.02.patch, 
> HIVE-20631.03.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> In case of network issue .repl load is returning non retry-able error code. 
> The scenario is 
> 1. While copying the file, repl load found that source is not reachable and 
> went for copy retry.
> 2. While retying, getting file checksum failed due to network issue and thus 
> its assumed that the source file is not present. So in the next retry copy is 
> tried from cm path.
> 3. In the next retry, network is recovered and it in cm path no file was 
> found. This will cause return of non retry-able error.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20419) Vectorization: Prevent mutation of VectorPartitionDesc after being used in a hashmap key

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20419?focusedWorklogId=443113=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443113
 ]

ASF GitHub Bot logged work on HIVE-20419:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:32
Start Date: 09/Jun/20 16:32
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #518:
URL: https://github.com/apache/hive/pull/518#issuecomment-641143996


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443113)
Remaining Estimate: 0h
Time Spent: 10m

> Vectorization: Prevent mutation of VectorPartitionDesc after being used in a 
> hashmap key
> 
>
> Key: HIVE-20419
> URL: https://issues.apache.org/jira/browse/HIVE-20419
> Project: Hive
>  Issue Type: Bug
>  Components: Vectorization
>Reporter: Gopal Vijayaraghavan
>Assignee: Teddy Choi
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20419.1.patch, HIVE-20419.2.patch, 
> HIVE-20419.4.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> This is going into the loop because the VectorPartitionDesc is modified after 
> it is used in the HashMap key - resulting in a hashcode & equals modification 
> after it has been placed in the hashmap.
> {code}
> HiveServer2-Background-Pool: Thread-6049 State: RUNNABLE CPU usage on sample: 
> 621ms
> java.util.HashMap$TreeNode.find(int, Object, Class) HashMap.java:1869  <7 
> recursive calls>
> java.util.HashMap$TreeNode.putTreeVal(HashMap, HashMap$Node[], int, Object, 
> Object) HashMap.java:1989
> java.util.HashMap.putVal(int, Object, Object, boolean, boolean) 
> HashMap.java:637
> java.util.HashMap.put(Object, Object) HashMap.java:611
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.addVectorPartitionDesc(PartitionDesc,
>  VectorPartitionDesc, Map) Vectorizer.java:1272
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.verifyAndSetVectorPartDesc(PartitionDesc,
>  boolean, List, Set, Map, Set, ArrayList, Set) Vectorizer.java:1323
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.validateInputFormatAndSchemaEvolution(MapWork,
>  String, TableScanOperator, Vectorizer$VectorTaskColumnInfo) 
> Vectorizer.java:1654
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.validateAndVectorizeMapWork(MapWork,
>  Vectorizer$VectorTaskColumnInfo, boolean) Vectorizer.java:1865
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.convertMapWork(MapWork,
>  boolean) Vectorizer.java:1109
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer$VectorizationDispatcher.dispatch(Node,
>  Stack, Object[]) Vectorizer.java:961
> org.apache.hadoop.hive.ql.lib.TaskGraphWalker.dispatch(Node, Stack, 
> TaskGraphWalker$TaskGraphWalkerContext) TaskGraphWalker.java:111
> org.apache.hadoop.hive.ql.lib.TaskGraphWalker.walk(Node) 
> TaskGraphWalker.java:180
> org.apache.hadoop.hive.ql.lib.TaskGraphWalker.startWalking(Collection, 
> HashMap) TaskGraphWalker.java:125
> org.apache.hadoop.hive.ql.optimizer.physical.Vectorizer.resolve(PhysicalContext)
>  Vectorizer.java:2442
> org.apache.hadoop.hive.ql.parse.TezCompiler.optimizeTaskPlan(List, 
> ParseContext, Context) TezCompiler.java:717
> org.apache.hadoop.hive.ql.parse.TaskCompiler.compile(ParseContext, List, 
> HashSet, HashSet) TaskCompiler.java:258
> org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(ASTNode, 
> SemanticAnalyzer$PlannerContextFactory) SemanticAnalyzer.java:12443
> org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(ASTNode) 
> CalcitePlanner.java:358
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23621) Enforce ASF headers on source files

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23621?focusedWorklogId=443104=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443104
 ]

ASF GitHub Bot logged work on HIVE-23621:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:32
Start Date: 09/Jun/20 16:32
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #1062:
URL: https://github.com/apache/hive/pull/1062#issuecomment-640552681


   @mustafaiman yeah; I was not thinking about that - I think the best would be 
if we would stop changing the parent to be outside of the project (storage-api 
and standalong-metastore has external parent) - this "parenting" stuff comes 
back for checkstyle/findbugs/rat-chekc/etc we should decide what we want to do 
with this
   I would rather not change the parent in this patch (silently) - but try to 
reach a conclusion with the community



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443104)
Time Spent: 40m  (was: 0.5h)

> Enforce ASF headers on source files
> ---
>
> Key: HIVE-23621
> URL: https://issues.apache.org/jira/browse/HIVE-23621
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-13482) str_to_map function delimiters are regex

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-13482?focusedWorklogId=443109=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443109
 ]

ASF GitHub Bot logged work on HIVE-13482:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:32
Start Date: 09/Jun/20 16:32
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #553:
URL: https://github.com/apache/hive/pull/553#issuecomment-641251896


   I think probably an open/close should also trigger a new testrun...let's see



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443109)
Time Spent: 3h 10m  (was: 3h)

> str_to_map function delimiters are regex
> 
>
> Key: HIVE-13482
> URL: https://issues.apache.org/jira/browse/HIVE-13482
> Project: Hive
>  Issue Type: Improvement
>  Components: UDF
>Affects Versions: 1.0.0
>Reporter: Janick Bernet
>Assignee: Catherine K Anderson
>Priority: Minor
>  Labels: pull-request-available
> Attachments: HIVE-13482.patch, HIVE-13482.patch, HIVE.13482.01.patch
>
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> The two delimiters passed to the 'str_to_map' function are both interpreted 
> as regular expressions, which means that using the pipe ('|') as a delimiter 
> will lead to very unexpected results.
> This behaviour is the same for the closely related 'split' function, however 
> that is clearly documented in the function description (as per 
> https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF). 
> Either the documentation for 'str_to_map' should be updated to reflect that 
> the delimiters are both regular expressions, too, or the implementation 
> should be changed to not interpret them as regexes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23629) Enforce clean findbugs in PRs

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23629?focusedWorklogId=443096=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443096
 ]

ASF GitHub Bot logged work on HIVE-23629:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:31
Start Date: 09/Jun/20 16:31
Worklog Time Spent: 10m 
  Work Description: mustafaiman commented on pull request #1069:
URL: https://github.com/apache/hive/pull/1069#issuecomment-640740432


   > another approach which could be considered: publish the findbugs report in 
the build - I think it will be there even if the build fails
   > some of this might work:
   > https://www.jenkins.io/doc/pipeline/steps/htmlpublisher/
   > https://www.jenkins.io/blog/2017/02/10/declarative-html-publisher/
   > I think it could be published in a "finally" block somewhere around 
compile...
   
   If we are aiming to enforce clean findbugs in every pull request, I dont see 
much benefit in having a findbugs report. It is going to be empty all the time 
apart from the bugs you might introduce in your patch.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443096)
Time Spent: 1.5h  (was: 1h 20m)

> Enforce clean findbugs in PRs
> -
>
> Key: HIVE-23629
> URL: https://issues.apache.org/jira/browse/HIVE-23629
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Mustafa Iman
>Assignee: Mustafa Iman
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> We should start enforcing clean findbugs reports as soon as we fix a module. 
> Otherwise, it will continue collecting findbugs errors. We can add a stage to 
> Jenkins pipeline to enforce findbugs and possibly other checks. It will 
> selectively run findbugs for specified sub modules. Eventually we can get rid 
> of the list and enable findbugs for the whole project.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20685) XmlOutputFormat does not escape all required symbols in case of attribte/text

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20685?focusedWorklogId=443100=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443100
 ]

ASF GitHub Bot logged work on HIVE-20685:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:31
Start Date: 09/Jun/20 16:31
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #444:
URL: https://github.com/apache/hive/pull/444#issuecomment-641144363


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443100)
Remaining Estimate: 0h
Time Spent: 10m

> XmlOutputFormat does not escape all required symbols in case of attribte/text
> -
>
> Key: HIVE-20685
> URL: https://issues.apache.org/jira/browse/HIVE-20685
> Project: Hive
>  Issue Type: Bug
>Reporter: Dmitrii Khekk
>Assignee: Dmitrii Khekk
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently it escapes only " and < while & also must be escaped in case of 
> attribute/text based on [1].
> At the same time both " and ' are not required to be encoded for case of 
> element text while for xml attribute one of them should be depending on 
> attribute quote type.
> Also > must be escaped if there is a sequence ]]> inside text or attribute 
> value which is not a part of CDATA
> [1] https://www.w3.org/TR/2008/REC-xml-20081126/#syntax



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23516) Store hive replication policy execution metrics in the relational DB

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23516?focusedWorklogId=443086=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443086
 ]

ASF GitHub Bot logged work on HIVE-23516:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:30
Start Date: 09/Jun/20 16:30
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk closed pull request #1044:
URL: https://github.com/apache/hive/pull/1044


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443086)
Time Spent: 9h 20m  (was: 9h 10m)

> Store hive replication policy execution metrics in the relational DB
> 
>
> Key: HIVE-23516
> URL: https://issues.apache.org/jira/browse/HIVE-23516
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23516.01.patch, HIVE-23516.02.patch, 
> HIVE-23516.03.patch, HIVE-23516.04.patch, HIVE-23516.05.patch, 
> HIVE-23516.06.patch, HIVE-23516.07.patch, HIVE-23516.08.patch, 
> HIVE-23516.09.patch, HIVE-23516.10.patch, HIVE-23516.11.patch, 
> HIVE-23516.12.patch, HIVE-23516.13.patch, HIVE-23516.14.patch, 
> HIVE-23516.15.patch, HIVE-23516.16.patch, HIVE-23516.17.patch, 
> HIVE-23516.18.patch, HIVE-23516.19.patch, HIVE-23516.20.patch, 
> HIVE-23516.21.patch, HIVE-23516.22.patch, HIVE-23516.23.patch, 
> HIVE-23516.24.patch, HIVE-23516.25.patch, HIVE-23516.26.patch, 
> HIVE-23516.27.patch, HIVE-23516.28.patch, HIVE-23516.29.patch, 
> HIVE-23516.30.patch, HIVE-23516.31.patch, HIVE-23516.32.patch, 
> HIVE-23516.33.patch, HIVE-23516.34.patch, Replication Metrics.pdf
>
>  Time Spent: 9h 20m
>  Remaining Estimate: 0h
>
> Details documented in the attached doc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23628) Add Default Message for Github PRs

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23628?focusedWorklogId=443099=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443099
 ]

ASF GitHub Bot logged work on HIVE-23628:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:31
Start Date: 09/Jun/20 16:31
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #1068:
URL: https://github.com/apache/hive/pull/1068#issuecomment-640597002


   @kgyrtkirk Can I get a +1 please?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443099)
Time Spent: 50m  (was: 40m)

> Add Default Message for Github PRs
> --
>
> Key: HIVE-23628
> URL: https://issues.apache.org/jira/browse/HIVE-23628
> Project: Hive
>  Issue Type: Improvement
>Reporter: David Mollitor
>Assignee: David Mollitor
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21709) Count with expression does not work in Parquet

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21709?focusedWorklogId=443087=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443087
 ]

ASF GitHub Bot logged work on HIVE-21709:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:30
Start Date: 09/Jun/20 16:30
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #631:
URL: https://github.com/apache/hive/pull/631#issuecomment-641143424


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443087)
Time Spent: 20m  (was: 10m)

> Count with expression does not work in Parquet
> --
>
> Key: HIVE-21709
> URL: https://issues.apache.org/jira/browse/HIVE-21709
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 2.3.2
>Reporter: Mainak Ghosh
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> For parquet file with nested schema, count with expression as column name 
> does not work when you are filtering on another column in the same struct. 
> Here are the steps to reproduce:
> {code:java}
> CREATE TABLE `test_table`( `rtb_win` struct<`impression_id`:string, 
> `pub_id`:string>) ROW FORMAT SERDE 
> 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' STORED AS 
> INPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat' 
> OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat';
> INSERT INTO TABLE test_table SELECT named_struct('impression_id', 'cat', 
> 'pub_id', '2');
> select count(rtb_win.impression_id) from test_table where rtb_win.pub_id ='2';
> WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the 
> future versions. Consider using a different execution engine (i.e. spark, 
> tez) or using Hive 1.X releases.
> +--+ 
> | _c0  |
> +--+ 
> | 0    | 
> +--+
> select count(*) from test_parquet_count_mghosh where rtb_win.pub_id ='2';
> WARNING: Hive-on-MR is deprecated in Hive 2 and may not be available in the 
> future versions. Consider using a different execution engine (i.e. spark, 
> tez) or using Hive 1.X releases. 
> +--+ 
> | _c0  | 
> +--+ 
> | 1    | 
> +--+{code}
> As you can see the first query returns the wrong result while the second one 
> returns the correct result.
> The issue is an column order mismatch between the actual parquet file 
> (impression_id first and pub_id second) and the Hive prunedCols datastructure 
> (reverse). As a result in the filter we compare with the wrong value and the 
> count returns 0. I have been able to identify the cause of this mismatch.
> I would love to get the code reviewed and merged. Some of the code changes 
> are changes to commits from Ferdinand Xu and Chao Sun.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23625) HS2 Web UI displays query drill-down results in plain text, not html

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23625?focusedWorklogId=443084=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443084
 ]

ASF GitHub Bot logged work on HIVE-23625:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:30
Start Date: 09/Jun/20 16:30
Worklog Time Spent: 10m 
  Work Description: marton-bod opened a new pull request #1075:
URL: https://github.com/apache/hive/pull/1075


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443084)
Remaining Estimate: 0h
Time Spent: 10m

> HS2 Web UI displays query drill-down results in plain text, not html
> 
>
> Key: HIVE-23625
> URL: https://issues.apache.org/jira/browse/HIVE-23625
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
> Attachments: HIVE-23625.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Opening a drilldown link on the HS2 Web UI, you are directed to the following 
> URL: /query_page?operationId=
> Since the path /query_page contains no file extensions, Jetty cannot 
> determine the mimetype and therefore the Hive HttpServer returns response 
> header Content-Type: text/plain;charset=utf-8, and the information does not 
> render as html in the browser. This should be corrected to return 
> text/html;charset=utf-8.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20679) DDL operations on hive might create large messages for DBNotification

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20679?focusedWorklogId=443085=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443085
 ]

ASF GitHub Bot logged work on HIVE-20679:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:30
Start Date: 09/Jun/20 16:30
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #449:
URL: https://github.com/apache/hive/pull/449#issuecomment-641144336


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443085)
Remaining Estimate: 0h
Time Spent: 10m

> DDL operations on hive might create large messages for DBNotification
> -
>
> Key: HIVE-20679
> URL: https://issues.apache.org/jira/browse/HIVE-20679
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 3.1.0
>Reporter: Anishek Agarwal
>Assignee: Anishek Agarwal
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20679.1.patch, HIVE-20679.2.patch, 
> HIVE-20679.3.patch, HIVE-20679.4.patch, HIVE-20679.5.patch, 
> HIVE-20679.6.patch, HIVE-20679.8.patch, HIVE-20679.9.patch, a.sql, b.sql
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Certain type of ddl operations might create large messages as part of 
> DBNoitification, this might lead to the rdbms throwing an error when storing 
> the message since its size is to large. It will also increase the footprint 
> of the rdbms space usage. 
> We should try store compressed messages to allow handling these situations. 
> Edit: For notification_log table the message column for all supported 
> databases can store messages from 2GB to 4GB



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-23625) HS2 Web UI displays query drill-down results in plain text, not html

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-23625:
--
Labels: pull-request-available  (was: )

> HS2 Web UI displays query drill-down results in plain text, not html
> 
>
> Key: HIVE-23625
> URL: https://issues.apache.org/jira/browse/HIVE-23625
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23625.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Opening a drilldown link on the HS2 Web UI, you are directed to the following 
> URL: /query_page?operationId=
> Since the path /query_page contains no file extensions, Jetty cannot 
> determine the mimetype and therefore the Hive HttpServer returns response 
> header Content-Type: text/plain;charset=utf-8, and the information does not 
> render as html in the browser. This should be corrected to return 
> text/html;charset=utf-8.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21556) Useless configuration for old jetty in log4j.properties

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21556?focusedWorklogId=443076=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443076
 ]

ASF GitHub Bot logged work on HIVE-21556:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:29
Start Date: 09/Jun/20 16:29
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #586:
URL: https://github.com/apache/hive/pull/586#issuecomment-641143704


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443076)
Time Spent: 20m  (was: 10m)

> Useless configuration for old jetty in log4j.properties
> ---
>
> Key: HIVE-21556
> URL: https://issues.apache.org/jira/browse/HIVE-21556
> Project: Hive
>  Issue Type: Improvement
>  Components: Configuration
>Reporter: Chen Zhi
>Priority: Minor
>  Labels: patch-available, pull-request-available
> Attachments: HIVE-21556.1.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
>  
> {code:java}
> logger.Mortbay.name = org.mortbay
> logger.Mortbay.level = INFO
> {code}
> The logger `Mortbay` in log4j.properties is used to control logging 
> activities of jetty (6.x). However, we have upgrade to jetty 9 in HIVE-16049, 
> the package name has changed to `org.eclipse.jetty` and we have added the new 
> logger to control jetty. `Mortbay` is useless. I guess we can remove it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20805) Hive does not copy source data when importing as non-hive user

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20805?focusedWorklogId=443077=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443077
 ]

ASF GitHub Bot logged work on HIVE-20805:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:29
Start Date: 09/Jun/20 16:29
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #482:
URL: https://github.com/apache/hive/pull/482#issuecomment-641144242


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443077)
Remaining Estimate: 0h
Time Spent: 10m

> Hive does not copy source data when importing as non-hive user 
> ---
>
> Key: HIVE-20805
> URL: https://issues.apache.org/jira/browse/HIVE-20805
> Project: Hive
>  Issue Type: Bug
>  Components: Hive, HiveServer2
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20805.03.patch, HIVE-20805.1.patch, 
> HIVE-20805.2.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> while loading data to a managed table from user given path, Hive uses move 
> operation to copy data from user location to table location. In case move can 
> not be used due to permission issue or mismatched encryption zone etc, hive 
> uses copy and then deletes the files from source location to keep to behavior 
> same. But in case the user does not have write access to the source location, 
> delete will fail with file permission exception and load operation will fail. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20283) Logs may be directed to 2 files if --hiveconf hive.log.file is used (metastore)

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20283?focusedWorklogId=443075=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443075
 ]

ASF GitHub Bot logged work on HIVE-20283:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:29
Start Date: 09/Jun/20 16:29
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #409:
URL: https://github.com/apache/hive/pull/409#issuecomment-641357175


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443075)
Remaining Estimate: 0h
Time Spent: 10m

> Logs may be directed to 2 files if --hiveconf hive.log.file is used 
> (metastore)
> ---
>
> Key: HIVE-20283
> URL: https://issues.apache.org/jira/browse/HIVE-20283
> Project: Hive
>  Issue Type: Bug
>Reporter: Jaume M
>Assignee: Jaume M
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-20283.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Unfortunately when doing this : 
> https://issues.apache.org/jira/browse/HIVE-19886 I forgot to do it as well 
> for the metastore



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20784) Migrate hbase.util.Base64 to java.util.Base64

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20784?focusedWorklogId=443079=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443079
 ]

ASF GitHub Bot logged work on HIVE-20784:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:29
Start Date: 09/Jun/20 16:29
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #451:
URL: https://github.com/apache/hive/pull/451#issuecomment-641336806


   Can you please rebase on master?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443079)
Time Spent: 40m  (was: 0.5h)

> Migrate hbase.util.Base64 to java.util.Base64
> -
>
> Key: HIVE-20784
> URL: https://issues.apache.org/jira/browse/HIVE-20784
> Project: Hive
>  Issue Type: Sub-task
>  Components: HBase Handler
>Affects Versions: 3.1.0
> Environment: HBase 2.0.2
> Hive 3.1.0
>Reporter: Dagang Wei
>Assignee: Dagang Wei
>Priority: Critical
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> By default Hive 3.1.0 depends on HBase 2.0.0-alpha4. HBase 2.0.2 migrated 
> from hbase.util.Base64 to java.util.Base64 (HBASE-20884), which causes Hive 
> 3.1.0 fails to build with HBase 2.0.2.
>  
> $ cd hbase-handler
> $ mvn package -DskipTests -Dhbase.version=2.0.2
> [ERROR] 
> .../hive/hbase-handler/src/java/org/apache/hadoop/hive/hbase/HiveHBaseTableSnapshotInputFormat.java:[29,36]
>  cannot find symbol
> [ERROR] symbol: class Base64
> [ERROR] location: package org.apache.hadoop.hbase.util 
>  
> To make Hive works with 2.0.2+ (and also older versions), we should consider 
> migrating Hive to java.util.Base64.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443073=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443073
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:28
Start Date: 09/Jun/20 16:28
Worklog Time Spent: 10m 
  Work Description: belugabehr commented on pull request #947:
URL: https://github.com/apache/hive/pull/947#issuecomment-640728193







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443073)
Time Spent: 1h 50m  (was: 1h 40m)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20817) Reading Timestamp datatype via HiveServer2 gives errors

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20817?focusedWorklogId=443064=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443064
 ]

ASF GitHub Bot logged work on HIVE-20817:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:28
Start Date: 09/Jun/20 16:28
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #481:
URL: https://github.com/apache/hive/pull/481#issuecomment-641144251


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443064)
Remaining Estimate: 0h
Time Spent: 10m

> Reading Timestamp datatype via HiveServer2 gives errors
> ---
>
> Key: HIVE-20817
> URL: https://issues.apache.org/jira/browse/HIVE-20817
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20817.01.patch, HIVE-20817.02.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> CREATE TABLE JdbcBasicRead ( empno int, desg string,empname string,doj 
> timestamp,Salary float,mgrid smallint, deptno tinyint ) ROW FORMAT DELIMITED 
> FIELDS TERMINATED BY ',';
> LOAD DATA LOCAL INPATH '/tmp/art_jdbc/hive/input/input_7columns.txt' 
> OVERWRITE INTO TABLE JdbcBasicRead;
> Sample Data.
> —
> 7369,M,SMITH,1980-12-17 17:07:29.234234,5000.00,7902,20
> 7499,X,ALLEN,1981-02-20 17:07:29.234234,1250.00,7698,30
> 7521,X,WARD,1981-02-22 17:07:29.234234,01600.57,7698,40
> 7566,M,JONES,1981-04-02 17:07:29.234234,02975.65,7839,10
> 7654,X,MARTIN,1981-09-28 17:07:29.234234,01250.00,7698,20
> 7698,M,BLAKE,1981-05-01 17:07:29.234234,2850.98,7839,30
> 7782,M,CLARK,1981-06-09 17:07:29.234234,02450.00,7839,20
> —
> Select statement: SELECT empno, desg, empname, doj, salary, mgrid, deptno 
> FROM JdbcBasicWrite
> {code}
> 2018-09-25T07:11:03,222 WARN [HiveServer2-Handler-Pool: Thread-83]: 
> thrift.ThriftCLIService (:()) - Error fetching results:
> org.apache.hive.service.cli.HiveSQLException: java.lang.ClassCastException: 
> org.apache.hadoop.hive.common.type.Timestamp cannot be cast to 
> java.sql.Timestamp
> at 
> org.apache.hive.service.cli.operation.SQLOperation.getNextRowSet(SQLOperation.java:469)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at 
> org.apache.hive.service.cli.operation.OperationManager.getOperationNextRowSet(OperationManager.java:328)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at 
> org.apache.hive.service.cli.session.HiveSessionImpl.fetchResults(HiveSessionImpl.java:910)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at sun.reflect.GeneratedMethodAccessor50.invoke(Unknown Source) ~[?:?]
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  ~[?:1.8.0_112]
> at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_112]
> at 
> org.apache.hive.service.cli.session.HiveSessionProxy.invoke(HiveSessionProxy.java:78)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at 
> org.apache.hive.service.cli.session.HiveSessionProxy.access$000(HiveSessionProxy.java:36)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at 
> org.apache.hive.service.cli.session.HiveSessionProxy$1.run(HiveSessionProxy.java:63)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at java.security.AccessController.doPrivileged(Native Method) ~[?:1.8.0_112]
> at javax.security.auth.Subject.doAs(Subject.java:422) ~[?:1.8.0_112]
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1730)
>  ~[hadoop-common-3.1.1.3.0.1.0-187.jar:?]
> at 
> org.apache.hive.service.cli.session.HiveSessionProxy.invoke(HiveSessionProxy.java:59)
>  ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at com.sun.proxy.$Proxy46.fetchResults(Unknown Source) ~[?:?]
> at org.apache.hive.service.cli.CLIService.fetchResults(CLIService.java:564) 
> ~[hive-service-3.1.0.3.0.1.0-187.jar:3.1.0.3.0.1.0-187]
> at 
> org.apache.hive.service.cli.thrift.ThriftCLIService.FetchResults(ThriftCLIService.java:786)
>  

[jira] [Work logged] (HIVE-21218) KafkaSerDe doesn't support topics created via Confluent Avro serializer

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21218?focusedWorklogId=443065=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443065
 ]

ASF GitHub Bot logged work on HIVE-21218:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:28
Start Date: 09/Jun/20 16:28
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #526:
URL: https://github.com/apache/hive/pull/526#issuecomment-641143952


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443065)
Time Spent: 15.5h  (was: 15h 20m)

> KafkaSerDe doesn't support topics created via Confluent Avro serializer
> ---
>
> Key: HIVE-21218
> URL: https://issues.apache.org/jira/browse/HIVE-21218
> Project: Hive
>  Issue Type: Bug
>  Components: kafka integration, Serializers/Deserializers
>Affects Versions: 3.1.1
>Reporter: Milan Baran
>Assignee: David McGinnis
>Priority: Major
>  Labels: pull-request-available
> Attachments: 
> 0001-HIVE-21818-Adding-ability-for-Kafka-Handler-to-proce.patch, 
> HIVE-21218.10.patch, HIVE-21218.11.patch, HIVE-21218.12.patch, 
> HIVE-21218.13.patch, HIVE-21218.2.patch, HIVE-21218.3.patch, 
> HIVE-21218.4.patch, HIVE-21218.5.patch, HIVE-21218.6.patch, 
> HIVE-21218.7.patch, HIVE-21218.8.patch, HIVE-21218.9.patch, HIVE-21218.patch
>
>  Time Spent: 15.5h
>  Remaining Estimate: 0h
>
> According to [Google 
> groups|https://groups.google.com/forum/#!topic/confluent-platform/JYhlXN0u9_A]
>  the Confluent avro serialzier uses propertiary format for kafka value - 
> <4 bytes of schema ID> conforms to schema>. 
> This format does not cause any problem for Confluent kafka deserializer which 
> respect the format however for hive kafka handler its bit a problem to 
> correctly deserialize kafka value, because Hive uses custom deserializer from 
> bytes to objects and ignores kafka consumer ser/deser classes provided via 
> table property.
> It would be nice to support Confluent format with magic byte.
> Also it would be great to support Schema registry as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20531) Repl load on cloud storage file system can skip redundant move or add partition tasks.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20531?focusedWorklogId=443060=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443060
 ]

ASF GitHub Bot logged work on HIVE-20531:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:27
Start Date: 09/Jun/20 16:27
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #431:
URL: https://github.com/apache/hive/pull/431#issuecomment-641144464


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443060)
Remaining Estimate: 0h
Time Spent: 10m

> Repl load on cloud storage file system can skip redundant move or add 
> partition tasks.
> --
>
> Key: HIVE-20531
> URL: https://issues.apache.org/jira/browse/HIVE-20531
> Project: Hive
>  Issue Type: Sub-task
>  Components: repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20531.01.patch, HIVE-20531.02.patch, 
> HIVE-20531.03.patch, HIVE-20531.04.patch, HIVE-20531.05.patch, 
> HIVE-20531.06.patch, HIVE-20531.07.patch, HIVE-20531.08.patch, 
> HIVE-20531.09.patch, HIVE-20531.10.patch, HIVE-20531.11.patch, 
> HIVE-20531.12.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> In replication load, both add partition and insert operations are handled 
> through import. Import creates 3 major tasks. Copy, add partition and move. 
> Copy does the copy of data from source location to staging directory. Then 
> add partition (which runs in parallel to copy) creates the partition in meta 
> store. Its a no op in case of insert and by the time this ddl task is 
> executed for insert partition would be already present. The third operation 
> is move. Which actually moves the file from staging directory to actual 
> location. And then in case of insert it adds the insert event to notification 
> table. It does this for add partition operation which is redundant as the 
> event for add partition would have been written already by ddl task. With the 
> optimization to copy directly to actual table location in S3, move task can 
> be avoided for add partition operation replay and replay of insert need not 
> create the add partition (ddl) task.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23516) Store hive replication policy execution metrics in the relational DB

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23516?focusedWorklogId=443054=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443054
 ]

ASF GitHub Bot logged work on HIVE-23516:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:26
Start Date: 09/Jun/20 16:26
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on a change in pull request #1044:
URL: https://github.com/apache/hive/pull/1044#discussion_r437295212



##
File path: standalone-metastore/metastore-server/src/main/resources/package.jdo
##
@@ -1523,6 +1523,29 @@
 
   

+
+  

Review comment:
   I don't know if currently there is a way to remove these metrics entries 
- but it could be an option to piggy-pack on  the scheduled query execution 
removal by declaring a foreign key for this fields with "on delete cascade"





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443054)
Time Spent: 9h 10m  (was: 9h)

> Store hive replication policy execution metrics in the relational DB
> 
>
> Key: HIVE-23516
> URL: https://issues.apache.org/jira/browse/HIVE-23516
> Project: Hive
>  Issue Type: Task
>Reporter: Aasha Medhi
>Assignee: Aasha Medhi
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23516.01.patch, HIVE-23516.02.patch, 
> HIVE-23516.03.patch, HIVE-23516.04.patch, HIVE-23516.05.patch, 
> HIVE-23516.06.patch, HIVE-23516.07.patch, HIVE-23516.08.patch, 
> HIVE-23516.09.patch, HIVE-23516.10.patch, HIVE-23516.11.patch, 
> HIVE-23516.12.patch, HIVE-23516.13.patch, HIVE-23516.14.patch, 
> HIVE-23516.15.patch, HIVE-23516.16.patch, HIVE-23516.17.patch, 
> HIVE-23516.18.patch, HIVE-23516.19.patch, HIVE-23516.20.patch, 
> HIVE-23516.21.patch, HIVE-23516.22.patch, HIVE-23516.23.patch, 
> HIVE-23516.24.patch, HIVE-23516.25.patch, HIVE-23516.26.patch, 
> HIVE-23516.27.patch, HIVE-23516.28.patch, HIVE-23516.29.patch, 
> HIVE-23516.30.patch, HIVE-23516.31.patch, HIVE-23516.32.patch, 
> HIVE-23516.33.patch, HIVE-23516.34.patch, Replication Metrics.pdf
>
>  Time Spent: 9h 10m
>  Remaining Estimate: 0h
>
> Details documented in the attached doc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21776) Replication fails to replicate a UDF with jar on HDFS during incremental

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21776?focusedWorklogId=443045=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443045
 ]

ASF GitHub Bot logged work on HIVE-21776:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:25
Start Date: 09/Jun/20 16:25
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #642:
URL: https://github.com/apache/hive/pull/642#issuecomment-641143369


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443045)
Time Spent: 1h 20m  (was: 1h 10m)

> Replication fails to replicate a UDF with jar on HDFS during incremental
> 
>
> Key: HIVE-21776
> URL: https://issues.apache.org/jira/browse/HIVE-21776
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 4.0.0
>Reporter: Ashutosh Bapat
>Assignee: Ashutosh Bapat
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21776.01.patch, HIVE-21776.02.patch, 
> HIVE-21776.03.patch, HIVE-21776.04.patch
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> When a UDF with jar on HDFS is replicated, we add the jar path to the dump. 
> The dumped URL of jar has checksum and cmroot added to it. During load, we 
> load the jar on target. ReplCopyTask handles the jar paths separately from 
> the paths in _files and it uses the presence of checksum and cmroot for that 
> decision. (Those two are not present in _files URL). If ReplChangeManager is 
> not initialized during dump, dumped URL of jar does not contain checksum and 
> cmroot and thus ReplCopyTask fails to copy the UDF jar to the target. This 
> fails the repl load since the function can not be created. Fix is to 
> initialize ReplChangeManager always.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20423) Set NULLS LAST as the default null ordering

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20423?focusedWorklogId=443047=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443047
 ]

ASF GitHub Bot logged work on HIVE-20423:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:26
Start Date: 09/Jun/20 16:26
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #419:
URL: https://github.com/apache/hive/pull/419#issuecomment-641144560


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443047)
Remaining Estimate: 0h
Time Spent: 10m

> Set NULLS LAST as the default null ordering
> ---
>
> Key: HIVE-20423
> URL: https://issues.apache.org/jira/browse/HIVE-20423
> Project: Hive
>  Issue Type: Bug
>Reporter: Teddy Choi
>Assignee: Teddy Choi
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0, 3.2.0
>
> Attachments: HIVE-20423.1.patch, HIVE-20423.10-branch-3.patch, 
> HIVE-20423.2.patch, HIVE-20423.3.patch, HIVE-20423.4.patch, 
> HIVE-20423.4.patch, HIVE-20423.5.patch, HIVE-20423.6.patch, 
> HIVE-20423.7.patch, HIVE-20423.8-branch-3.patch, 
> HIVE-20423.8.1-branch.3.patch, HIVE-20423.8.patch, HIVE-20423.9-branch.3.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> HIVE-20150 TopNKeyOperator pushdown can be more efficient if NULLS LAST 
> becomes the default null ordering.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20680) Bootstrap is missing partitions in replicated DB when retry after kill query.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20680?focusedWorklogId=443041=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443041
 ]

ASF GitHub Bot logged work on HIVE-20680:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:25
Start Date: 09/Jun/20 16:25
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #443:
URL: https://github.com/apache/hive/pull/443#issuecomment-641144376


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443041)
Remaining Estimate: 0h
Time Spent: 10m

> Bootstrap is missing partitions in replicated DB when retry after kill query.
> -
>
> Key: HIVE-20680
> URL: https://issues.apache.org/jira/browse/HIVE-20680
> Project: Hive
>  Issue Type: Bug
>  Components: repl
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-20680.01.patch, HIVE-20680.02.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> The issue is
> 1. When bootstrap was going on, kill query was called to kill the repl load 
> command.
> 2. During restart, one table with no partition set the scope to table as the 
> ckpt property was not yet set for that table.
> 3. Due to this, all partitioned table after this didn't get their tasks 
> related to partition appended in the root task if the ckpt property is 
> already updated for those tables.
> The fix is to reset the table scope to false if for that table there are no 
> tasks added.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23026) support add a yarn application name for tez on hiveserver2

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23026?focusedWorklogId=443046=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443046
 ]

ASF GitHub Bot logged work on HIVE-23026:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:25
Start Date: 09/Jun/20 16:25
Worklog Time Spent: 10m 
  Work Description: xiejiajun opened a new pull request #1082:
URL: https://github.com/apache/hive/pull/1082


   ### What is this PR for?
   - add a configuration item to support setting tez job name
   
   ### What type of PR is it?
   - feature
   
   ### What is the Jira issue?
   - https://issues.apache.org/jira/browse/HIVE-23026



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443046)
Time Spent: 1h 40m  (was: 1.5h)

> support add a yarn application name for tez on hiveserver2
> --
>
> Key: HIVE-23026
> URL: https://issues.apache.org/jira/browse/HIVE-23026
> Project: Hive
>  Issue Type: Improvement
>  Components: Tez
>Affects Versions: 2.0.0
>Reporter: Jake Xie
>Priority: Major
>  Labels: pull-request-available
> Fix For: 2.0.0, 3.0.0
>
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Currently tez on hiveServer2 cannot specify yarn application name, which is 
> not very convenient for locating the problem SQL, so i added a configuration 
> item to support setting tez job name



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21126) Allow session level queries in LlapBaseInputFormat#getSplits() before actual get_splits() call

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21126?focusedWorklogId=443031=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443031
 ]

ASF GitHub Bot logged work on HIVE-21126:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:24
Start Date: 09/Jun/20 16:24
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #515:
URL: https://github.com/apache/hive/pull/515#issuecomment-641144031


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443031)
Remaining Estimate: 0h
Time Spent: 10m

> Allow session level queries in LlapBaseInputFormat#getSplits() before actual 
> get_splits() call
> --
>
> Key: HIVE-21126
> URL: https://issues.apache.org/jira/browse/HIVE-21126
> Project: Hive
>  Issue Type: Improvement
>  Components: llap
>Affects Versions: 3.1.1
>Reporter: Shubham Chaurasia
>Assignee: Shubham Chaurasia
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0, 3.2.0
>
> Attachments: HIVE-21126.1.patch, HIVE-21126.2.patch, 
> HIVE-21126.3.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Facilitate execution of session level queries before \{{select get_splits()}} 
> call. This will allow us to set params like \{{tez.grouping.split-count}} 
> which can be taken into consideration while splits calculation.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-20386) Allow not to strip hidden fields from Hiveconf

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-20386?focusedWorklogId=443038=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443038
 ]

ASF GitHub Bot logged work on HIVE-20386:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:24
Start Date: 09/Jun/20 16:24
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #417:
URL: https://github.com/apache/hive/pull/417#issuecomment-641144597


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443038)
Remaining Estimate: 0h
Time Spent: 10m

> Allow not to strip hidden fields from Hiveconf
> --
>
> Key: HIVE-20386
> URL: https://issues.apache.org/jira/browse/HIVE-20386
> Project: Hive
>  Issue Type: Improvement
>  Components: Configuration
>Affects Versions: 3.1.0
>Reporter: Jaume M
>Assignee: Jaume M
>Priority: Major
>  Labels: pull-request-available
> Fix For: 3.2.0
>
> Attachments: HIVE-20386.1.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Currently some sensitive fields from the HiveConf are [being 
> stripped|https://github.com/apache/hive/blob/93779cb1010dc357ae3eb9dab77b35b4a444a33c/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java#L5713]
>  in different places and this may result in bugs like 
> https://issues.apache.org/jira/browse/HIVE-14848. It'd be nice to have a flag 
> to allow to control this and not strip them if the particular configuration 
> requires those fields.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-21700) Hive incremental load going OOM while adding load task to the leaf nodes of the DAG.

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-21700?focusedWorklogId=443028=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443028
 ]

ASF GitHub Bot logged work on HIVE-21700:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:23
Start Date: 09/Jun/20 16:23
Worklog Time Spent: 10m 
  Work Description: github-actions[bot] commented on pull request #618:
URL: https://github.com/apache/hive/pull/618#issuecomment-641143534


   This pull request has been automatically marked as stale because it has not 
had recent activity. It will be closed if no further activity occurs.
   Feel free to reach out on the d...@hive.apache.org list if the patch is in 
need of reviews.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443028)
Time Spent: 20m  (was: 10m)

> Hive incremental load going OOM while adding load task to the leaf nodes of 
> the DAG.
> 
>
> Key: HIVE-21700
> URL: https://issues.apache.org/jira/browse/HIVE-21700
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Affects Versions: 4.0.0
>Reporter: mahesh kumar behera
>Assignee: mahesh kumar behera
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-21700.01.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> While listing the child nodes to check for leaf node, we need to filter out 
> tasks which are already added to the children list. If a task is added 
> multiple time to the children list then it may cause the list to grow 
> exponentially. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (HIVE-23621) Enforce ASF headers on source files

2020-06-09 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23621?focusedWorklogId=443022=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-443022
 ]

ASF GitHub Bot logged work on HIVE-23621:
-

Author: ASF GitHub Bot
Created on: 09/Jun/20 16:23
Start Date: 09/Jun/20 16:23
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #1062:
URL: https://github.com/apache/hive/pull/1062


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 443022)
Time Spent: 0.5h  (was: 20m)

> Enforce ASF headers on source files
> ---
>
> Key: HIVE-23621
> URL: https://issues.apache.org/jira/browse/HIVE-23621
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   3   >