[jira] [Resolved] (HIVE-25443) Arrow SerDe Cannot serialize/deserialize complex data types When there are more than 1024 values

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25443.
-
Resolution: Fixed

merged into master. Thank you [~srahman]!

> Arrow SerDe Cannot serialize/deserialize complex data types When there are 
> more than 1024 values
> 
>
> Key: HIVE-25443
> URL: https://issues.apache.org/jira/browse/HIVE-25443
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 3.1.0, 3.0.0, 3.1.1, 3.1.2
>Reporter: Syed Shameerur Rahman
>Assignee: Syed Shameerur Rahman
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Complex data types like MAP, STRUCT cannot be serialized/deserialzed using 
> Arrow SerDe when there are more than 1024 values. This happens due to 
> ColumnVector always being initialized with a size of 1024.
> Issue #1 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L213
> Issue #2 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L215
> Sample unit test to reproduce the case in TestArrowColumnarBatchSerDe :
> {code:java}
> @Test
>public void testListBooleanWithMoreThan1024Values() throws SerDeException {
>  String[][] schema = {
>  {"boolean_list", "array"},
>  };
>   
>  Object[][] rows = new Object[1025][1];
>  for (int i = 0; i < 1025; i++) {
>rows[i][0] = new BooleanWritable(true);
>  }
>   
>  initAndSerializeAndDeserialize(schema, toList(rows));
>}
>   
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25443) Arrow SerDe Cannot serialize/deserialize complex data types When there are more than 1024 values

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25443?focusedWorklogId=683127=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683127
 ]

ASF GitHub Bot logged work on HIVE-25443:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 07:30
Start Date: 18/Nov/21 07:30
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #2581:
URL: https://github.com/apache/hive/pull/2581


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683127)
Time Spent: 40m  (was: 0.5h)

> Arrow SerDe Cannot serialize/deserialize complex data types When there are 
> more than 1024 values
> 
>
> Key: HIVE-25443
> URL: https://issues.apache.org/jira/browse/HIVE-25443
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 3.1.0, 3.0.0, 3.1.1, 3.1.2
>Reporter: Syed Shameerur Rahman
>Assignee: Syed Shameerur Rahman
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> Complex data types like MAP, STRUCT cannot be serialized/deserialzed using 
> Arrow SerDe when there are more than 1024 values. This happens due to 
> ColumnVector always being initialized with a size of 1024.
> Issue #1 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L213
> Issue #2 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L215
> Sample unit test to reproduce the case in TestArrowColumnarBatchSerDe :
> {code:java}
> @Test
>public void testListBooleanWithMoreThan1024Values() throws SerDeException {
>  String[][] schema = {
>  {"boolean_list", "array"},
>  };
>   
>  Object[][] rows = new Object[1025][1];
>  for (int i = 0; i < 1025; i++) {
>rows[i][0] = new BooleanWritable(true);
>  }
>   
>  initAndSerializeAndDeserialize(schema, toList(rows));
>}
>   
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25714) Some tests are flaky because docker is not able to start in 5 seconds

2021-11-17 Thread Zoltan Haindrich (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445693#comment-17445693
 ] 

Zoltan Haindrich commented on HIVE-25714:
-

ok; I'm not a big fan of that - but of course we can try that...actually; 
adding some log lines could also help to know how much stuff we are reading 
from docker logs

> Some tests are flaky because docker is not able to start in 5 seconds
> -
>
> Key: HIVE-25714
> URL: https://issues.apache.org/jira/browse/HIVE-25714
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> there are some testruns failing with; and on the test site multiple pods are 
> running in parallel - its not an ideal environment for tight deadlines
> {code}
> Unexpected exception java.lang.RuntimeException: Process docker failed to run 
> in 5 seconds
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.runCmd(AbstractExternalDB.java:92)
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.launchDockerContainer(AbstractExternalDB.java:123)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestDatabaseHandler.beforeTest(QTestDatabaseHandler.java:111)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestOptionDispatcher.beforeTest(QTestOptionDispatcher.java:79)
> {code}
> http://ci.hive.apache.org/job/hive-precommit/job/PR-1674/4/testReport/junit/org.apache.hadoop.hive.cli.split19/TestMiniLlapLocalCliDriver/Testing___split_14___PostProcess___testCliDriver_qt_database_all_/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25095) Beeline/hive -e command can't deal with query with trailing quote

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25095?focusedWorklogId=683122=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683122
 ]

ASF GitHub Bot logged work on HIVE-25095:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 07:22
Start Date: 18/Nov/21 07:22
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #2526:
URL: https://github.com/apache/hive/pull/2526


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683122)
Time Spent: 1h 20m  (was: 1h 10m)

> Beeline/hive -e command can't deal with query with trailing quote
> -
>
> Key: HIVE-25095
> URL: https://issues.apache.org/jira/browse/HIVE-25095
> Project: Hive
>  Issue Type: Bug
>Reporter: Robbie Zhang
>Assignee: Robbie Zhang
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> The command 
> {code:java}
> hive -e 'select "hive"'{code}
> and
> {code:java}
> beeline -e 'select "hive"'{code}
> fail with such error:
> {code:java}
> Error: Error while compiling statement: FAILED: ParseException line 1:12 
> character '' not supported here (state=42000,code=4){code}
> The reason is that org.apache.commons.cli.Util.stripLeadingAndTrailingQuotes 
> in commons-cli-1.2.jar strips the trailing quote so the query string is 
> changed to
> {code:java}
> select "hive{code}
> This bug is fixed in commons-cli-1.3.1 and commons-cli-1.4.jar. The 
> workaround is to overwrite commons-cli-1.2.jar with commons-cli-1.3.1 or 
> commons-cli-1.4.jar.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25095) Beeline/hive -e command can't deal with query with trailing quote

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25095.
-
Fix Version/s: 4.0.0
   Resolution: Fixed

merged into master. Thank you [~robbiezhang]!

> Beeline/hive -e command can't deal with query with trailing quote
> -
>
> Key: HIVE-25095
> URL: https://issues.apache.org/jira/browse/HIVE-25095
> Project: Hive
>  Issue Type: Bug
>Reporter: Robbie Zhang
>Assignee: Robbie Zhang
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> The command 
> {code:java}
> hive -e 'select "hive"'{code}
> and
> {code:java}
> beeline -e 'select "hive"'{code}
> fail with such error:
> {code:java}
> Error: Error while compiling statement: FAILED: ParseException line 1:12 
> character '' not supported here (state=42000,code=4){code}
> The reason is that org.apache.commons.cli.Util.stripLeadingAndTrailingQuotes 
> in commons-cli-1.2.jar strips the trailing quote so the query string is 
> changed to
> {code:java}
> select "hive{code}
> This bug is fixed in commons-cli-1.3.1 and commons-cli-1.4.jar. The 
> workaround is to overwrite commons-cli-1.2.jar with commons-cli-1.3.1 or 
> commons-cli-1.4.jar.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25722) Compaction fails if there are empty buckets.

2021-11-17 Thread Arko Sharma (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arko Sharma reassigned HIVE-25722:
--


> Compaction fails if there are empty buckets.
> 
>
> Key: HIVE-25722
> URL: https://issues.apache.org/jira/browse/HIVE-25722
> Project: Hive
>  Issue Type: Bug
>Reporter: Arko Sharma
>Assignee: Arko Sharma
>Priority: Major
>
> Compaction fails if there are empty buckets.
> This patch detects and deletes empty buckets before launching compaction in 
> order to protect it from failing.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?focusedWorklogId=683113=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683113
 ]

ASF GitHub Bot logged work on HIVE-25721:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 06:39
Start Date: 18/Nov/21 06:39
Worklog Time Spent: 10m 
  Work Description: SparksFyz commented on pull request #2798:
URL: https://github.com/apache/hive/pull/2798#issuecomment-972581163


   @maheshk114 @zabetak Could you please take a look when you have a chance? 
Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683113)
Time Spent: 20m  (was: 10m)

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: Yizhen Fan
>Assignee: Yizhen Fan
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?focusedWorklogId=683112=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683112
 ]

ASF GitHub Bot logged work on HIVE-25721:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 06:37
Start Date: 18/Nov/21 06:37
Worklog Time Spent: 10m 
  Work Description: SparksFyz opened a new pull request #2798:
URL: https://github.com/apache/hive/pull/2798


   For CommonMergeJoinOperator, the tableDesc will be null in the case that all 
columns in that table is not used. The tableDesc is null lead to using a dummy 
row to denote all rows in that table instead of adding row into rowContainer.
   However, rows in that table cannot be ignored when it contains filter in on 
clause.
   
   Reproduced steps are commented in the following ticket.
   https://issues.apache.org/jira/browse/HIVE-25721


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683112)
Remaining Estimate: 0h
Time Spent: 10m

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: Yizhen Fan
>Assignee: Yizhen Fan
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-25721:
--
Labels: pull-request-available  (was: )

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: Yizhen Fan
>Assignee: Yizhen Fan
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work started] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Yizhen Fan (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25721 started by Yizhen Fan.
-
> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: Yizhen Fan
>Assignee: Yizhen Fan
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Attachment: HIVE-25721.patch

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Attachment: (was: HIVE-25721.patch)

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Attachment: HIVE-25721.patch

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
> Attachments: HIVE-25721.patch
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Description: 
Outer Join results is wrong, here is a left join case.

select b.fields from a left join b on a.key=b.key and a.filter=xxx

there are some necessary condition to produce this problem:
 # `select` clause only contains right table fields
 # `on` clause contains left table condition, and this condition can filter 
records 

h3. cause:

candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process

row of left table cannot be add into row container because tblDesc of left 
table is null, while left table data can not be ignored in this case.
h3. Reproducible steps are mentioned below.



set hive.auto.convert.join=false;

create table t_smj_left (key string, value int);

insert into t_smj_left values
('key1', 1),
('key1', 2);

create table t_smj_right (key string, value int);

insert into t_smj_right values
('key1', 1);

select
t2.value
from t_smj_left t1
left join t_smj_right t2 on t1.key=t2.key and t1.value=2;

 

Result:

+

NULL
NULL

+

Expected Output:

+

1

NULL

+

  was:
Outer Join results is wrong, here is a left join case.

select b.fields from a left join b on a.key=b.key and a.filter=xxx

there are some necessary condition to produce this problem:
 # `select` clause only contains right table fields
 # `on` clause contains left table condition, and this condition can filter 
records 

Reproducible steps are mentioned below.



set hive.auto.convert.join=false;

create table t_smj_left (key string, value int);

insert into t_smj_left values
('key1', 1),
('key1', 2);

create table t_smj_right (key string, value int);

insert into t_smj_right values
('key1', 1);

select
t2.value
from t_smj_left t1
left join t_smj_right t2 on t1.key=t2.key and t1.value=2;

 

Result:

+

NULL
NULL

+

Expected Output:

+

1

NULL

+


> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> h3. cause:
> candidateStorage[tag].addRow(value); // CommonMergeJoinOperator.process
> row of left table cannot be add into row container because tblDesc of left 
> table is null, while left table data can not be ignored in this case.
> h3. Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Description: 
Outer Join results is wrong, here is a left join case.

select b.fields from a left join b on a.key=b.key and a.filter=xxx

there are some necessary condition to produce this problem:
 # `select` clause only contains right table fields
 # `on` clause contains left table condition, and this condition can filter 
records 

Reproducible steps are mentioned below.



set hive.auto.convert.join=false;

create table t_smj_left (key string, value int);

insert into t_smj_left values
('key1', 1),
('key1', 2);

create table t_smj_right (key string, value int);

insert into t_smj_right values
('key1', 1);

select
t2.value
from t_smj_left t1
left join t_smj_right t2 on t1.key=t2.key and t1.value=2;

 

Result:

+

NULL
NULL

+

Expected Output:

+

1

NULL

+

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
>
> Outer Join results is wrong, here is a left join case.
> select b.fields from a left join b on a.key=b.key and a.filter=xxx
> there are some necessary condition to produce this problem:
>  # `select` clause only contains right table fields
>  # `on` clause contains left table condition, and this condition can filter 
> records 
> Reproducible steps are mentioned below.
> 
> set hive.auto.convert.join=false;
> create table t_smj_left (key string, value int);
> insert into t_smj_left values
> ('key1', 1),
> ('key1', 2);
> create table t_smj_right (key string, value int);
> insert into t_smj_right values
> ('key1', 1);
> select
> t2.value
> from t_smj_left t1
> left join t_smj_right t2 on t1.key=t2.key and t1.value=2;
>  
> Result:
> +
> NULL
> NULL
> +
> Expected Output:
> +
> 1
> NULL
> +



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25721) Outer join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 updated HIVE-25721:
---
Summary: Outer join result is wrong  (was: Out join result is wrong)

> Outer join result is wrong
> --
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25721) Out join result is wrong

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-25721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

范宜臻 reassigned HIVE-25721:
--


> Out join result is wrong
> 
>
> Key: HIVE-25721
> URL: https://issues.apache.org/jira/browse/HIVE-25721
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: All Versions
>Reporter: 范宜臻
>Assignee: 范宜臻
>Priority: Major
> Fix For: 4.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25443) Arrow SerDe Cannot serialize/deserialize complex data types When there are more than 1024 values

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25443?focusedWorklogId=683102=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683102
 ]

ASF GitHub Bot logged work on HIVE-25443:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 05:20
Start Date: 18/Nov/21 05:20
Worklog Time Spent: 10m 
  Work Description: shameersss1 commented on pull request #2581:
URL: https://github.com/apache/hive/pull/2581#issuecomment-972548623


   > @shameersss1 there are some commits with your 
[r@amazon.com](mailto:r@amazon.com) address - could you add associate 
that email address with your github account? without that the committer email 
address will be changed to some 
[...@users.noreply.github.com](mailto:...@users.noreply.github.com)
   
   @kgyrtkirk  - I have linked that email address. Please take it forward and 
thanks for the review.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683102)
Time Spent: 0.5h  (was: 20m)

> Arrow SerDe Cannot serialize/deserialize complex data types When there are 
> more than 1024 values
> 
>
> Key: HIVE-25443
> URL: https://issues.apache.org/jira/browse/HIVE-25443
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 3.1.0, 3.0.0, 3.1.1, 3.1.2
>Reporter: Syed Shameerur Rahman
>Assignee: Syed Shameerur Rahman
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Complex data types like MAP, STRUCT cannot be serialized/deserialzed using 
> Arrow SerDe when there are more than 1024 values. This happens due to 
> ColumnVector always being initialized with a size of 1024.
> Issue #1 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L213
> Issue #2 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L215
> Sample unit test to reproduce the case in TestArrowColumnarBatchSerDe :
> {code:java}
> @Test
>public void testListBooleanWithMoreThan1024Values() throws SerDeException {
>  String[][] schema = {
>  {"boolean_list", "array"},
>  };
>   
>  Object[][] rows = new Object[1025][1];
>  for (int i = 0; i < 1025; i++) {
>rows[i][0] = new BooleanWritable(true);
>  }
>   
>  initAndSerializeAndDeserialize(schema, toList(rows));
>}
>   
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25582) Empty result when using offset limit with MR

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25582?focusedWorklogId=683092=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683092
 ]

ASF GitHub Bot logged work on HIVE-25582:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 04:56
Start Date: 18/Nov/21 04:56
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 opened a new pull request #2693:
URL: https://github.com/apache/hive/pull/2693


   
   
   ### What changes were proposed in this pull request?
   
   
   
   ### Why are the changes needed?
   
   
   
   ### Does this PR introduce _any_ user-facing change?
   
   
   
   ### How was this patch tested?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683092)
Time Spent: 2h  (was: 1h 50m)

> Empty result when using offset limit with MR
> 
>
> Key: HIVE-25582
> URL: https://issues.apache.org/jira/browse/HIVE-25582
> Project: Hive
>  Issue Type: Bug
>  Components: Operators
>Affects Versions: 4.0.0
>Reporter: Zhihua Deng
>Assignee: Zhihua Deng
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> The _mr.ObjectCache_ caches nothing, every time when the limit [retrieving 
> global counter from the 
> cache|https://github.com/apache/hive/blob/7b3ecf617a6d46f48a3b6f77e0339fd4ad95a420/ql/src/java/org/apache/hadoop/hive/ql/exec/LimitOperator.java#L150-L161],
>  a new AtomicInteger will be returned. This make offset _<= 
> currentCountForAllTasksInt_ always be evaluated to false, as _offset > 0_, 
> the operator will skip all rows.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25582) Empty result when using offset limit with MR

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25582?focusedWorklogId=683091=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-683091
 ]

ASF GitHub Bot logged work on HIVE-25582:
-

Author: ASF GitHub Bot
Created on: 18/Nov/21 04:55
Start Date: 18/Nov/21 04:55
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 closed pull request #2693:
URL: https://github.com/apache/hive/pull/2693


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 683091)
Time Spent: 1h 50m  (was: 1h 40m)

> Empty result when using offset limit with MR
> 
>
> Key: HIVE-25582
> URL: https://issues.apache.org/jira/browse/HIVE-25582
> Project: Hive
>  Issue Type: Bug
>  Components: Operators
>Affects Versions: 4.0.0
>Reporter: Zhihua Deng
>Assignee: Zhihua Deng
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> The _mr.ObjectCache_ caches nothing, every time when the limit [retrieving 
> global counter from the 
> cache|https://github.com/apache/hive/blob/7b3ecf617a6d46f48a3b6f77e0339fd4ad95a420/ql/src/java/org/apache/hadoop/hive/ql/exec/LimitOperator.java#L150-L161],
>  a new AtomicInteger will be returned. This make offset _<= 
> currentCountForAllTasksInt_ always be evaluated to false, as _offset > 0_, 
> the operator will skip all rows.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25095) Beeline/hive -e command can't deal with query with trailing quote

2021-11-17 Thread Robbie Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445564#comment-17445564
 ] 

Robbie Zhang commented on HIVE-25095:
-

[~kgyrtkirk] , thanks for your offering :D

> Beeline/hive -e command can't deal with query with trailing quote
> -
>
> Key: HIVE-25095
> URL: https://issues.apache.org/jira/browse/HIVE-25095
> Project: Hive
>  Issue Type: Bug
>Reporter: Robbie Zhang
>Assignee: Robbie Zhang
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> The command 
> {code:java}
> hive -e 'select "hive"'{code}
> and
> {code:java}
> beeline -e 'select "hive"'{code}
> fail with such error:
> {code:java}
> Error: Error while compiling statement: FAILED: ParseException line 1:12 
> character '' not supported here (state=42000,code=4){code}
> The reason is that org.apache.commons.cli.Util.stripLeadingAndTrailingQuotes 
> in commons-cli-1.2.jar strips the trailing quote so the query string is 
> changed to
> {code:java}
> select "hive{code}
> This bug is fixed in commons-cli-1.3.1 and commons-cli-1.4.jar. The 
> workaround is to overwrite commons-cli-1.2.jar with commons-cli-1.3.1 or 
> commons-cli-1.4.jar.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25720) Fix flaky test TestScheduledReplicationScenarios

2021-11-17 Thread Zoltan Haindrich (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445400#comment-17445400
 ] 

Zoltan Haindrich commented on HIVE-25720:
-

2 testcases are already disabled in this class by HIVE-23395 ; in recent runs 
I've seen issues with the other 2 testmethods...

> Fix flaky test TestScheduledReplicationScenarios
> 
>
> Key: HIVE-25720
> URL: https://issues.apache.org/jira/browse/HIVE-25720
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Priority: Major
>
> failed at the first attempt; the issue happened during
> {code}
> drop scheduled query repl_load_p2
> {code}
> which is in a finally block ; so this exception may be shadowing another 
> exception
> http://ci.hive.apache.org/job/hive-flaky-check/463/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25718) ORDER BY query on external MSSQL table fails

2021-11-17 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445364#comment-17445364
 ] 

Stamatis Zampetakis commented on HIVE-25718:


Most likely the problem is on CBO which seems to generate a query that cannot 
be executed on MSSQL. This may require a fix in Apache Calcite.

> ORDER BY query on external MSSQL table fails
> 
>
> Key: HIVE-25718
> URL: https://issues.apache.org/jira/browse/HIVE-25718
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_orderby_mssql.q
>
>
> +Microsoft SQLServer+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MSSQL",
> "hive.sql.jdbc.driver" = "com.microsoft.sqlserver.jdbc.SQLServerDriver",
> "hive.sql.jdbc.url" = "jdbc:sqlserver://localhost:1433;",
> "hive.sql.dbcp.username" = "sa",
> "hive.sql.dbcp.password" = "Its-a-s3cret",
> "hive.sql.table" = "country");
> SELECT * FROM country ORDER BY id;
> {code}
> The query fails with the following stacktrace:
> {noformat}
> com.microsoft.sqlserver.jdbc.SQLServerException: The ORDER BY clause is 
> invalid in views, inline functions, derived tables, subqueries, and common 
> table expressions, unless TOP, OFFSET or FOR XML is also specified.
> at 
> com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError(SQLServerException.java:258)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServerStatement.java:1535)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.doExecutePreparedStatement(SQLServerPreparedStatement.java:467)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement$PrepStmtExecCmd.doExecute(SQLServerPreparedStatement.java:409)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:7151) 
> ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLServerConnection.java:2478)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLServerStatement.java:219)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLServerStatement.java:199)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.executeQuery(SQLServerPreparedStatement.java:331)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.hive.storage.jdbc.dao.GenericJdbcDatabaseAccessor.getRecordIterator(GenericJdbcDatabaseAccessor.java:180)
>  [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:58) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:35) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:589)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:529) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:150) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.Driver.getFetchingTableResults(Driver.java:716) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:668) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.getResults(ReExecDriver.java:241)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:277) 
> [hive-cli-4.0.0-SNAPSHOT.jar:?]
> at 
> 

[jira] [Updated] (HIVE-25718) ORDER BY query on external MSSQL table fails

2021-11-17 Thread Stamatis Zampetakis (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stamatis Zampetakis updated HIVE-25718:
---
Attachment: jdbc_table_orderby_mssql.q

> ORDER BY query on external MSSQL table fails
> 
>
> Key: HIVE-25718
> URL: https://issues.apache.org/jira/browse/HIVE-25718
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_orderby_mssql.q
>
>
> +Microsoft SQLServer+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MSSQL",
> "hive.sql.jdbc.driver" = "com.microsoft.sqlserver.jdbc.SQLServerDriver",
> "hive.sql.jdbc.url" = "jdbc:sqlserver://localhost:1433;",
> "hive.sql.dbcp.username" = "sa",
> "hive.sql.dbcp.password" = "Its-a-s3cret",
> "hive.sql.table" = "country");
> SELECT * FROM country ORDER BY id;
> {code}
> The query fails with the following stacktrace:
> {noformat}
> com.microsoft.sqlserver.jdbc.SQLServerException: The ORDER BY clause is 
> invalid in views, inline functions, derived tables, subqueries, and common 
> table expressions, unless TOP, OFFSET or FOR XML is also specified.
> at 
> com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError(SQLServerException.java:258)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServerStatement.java:1535)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.doExecutePreparedStatement(SQLServerPreparedStatement.java:467)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement$PrepStmtExecCmd.doExecute(SQLServerPreparedStatement.java:409)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:7151) 
> ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLServerConnection.java:2478)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLServerStatement.java:219)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLServerStatement.java:199)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.executeQuery(SQLServerPreparedStatement.java:331)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.hive.storage.jdbc.dao.GenericJdbcDatabaseAccessor.getRecordIterator(GenericJdbcDatabaseAccessor.java:180)
>  [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:58) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:35) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:589)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:529) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:150) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.Driver.getFetchingTableResults(Driver.java:716) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:668) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.getResults(ReExecDriver.java:241)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:277) 
> [hive-cli-4.0.0-SNAPSHOT.jar:?]
> at 
> org.apache.hadoop.hive.cli.CliDriver.processCmd1(CliDriver.java:201) 
> [hive-cli-4.0.0-SNAPSHOT.jar:?]
> at 
> org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:127) 
> 

[jira] [Commented] (HIVE-25718) ORDER BY query on external MSSQL table fails

2021-11-17 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445362#comment-17445362
 ] 

Stamatis Zampetakis commented on HIVE-25718:


To reproduce the problem run:
{code:bash}
mvn test -Dtest=TestMiniLlapLocalCliDriver -Dqfile=jdbc_table_orderby_mssql.q 
-Dtest.output.overwrite
{code}

> ORDER BY query on external MSSQL table fails
> 
>
> Key: HIVE-25718
> URL: https://issues.apache.org/jira/browse/HIVE-25718
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Reporter: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_orderby_mssql.q
>
>
> +Microsoft SQLServer+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MSSQL",
> "hive.sql.jdbc.driver" = "com.microsoft.sqlserver.jdbc.SQLServerDriver",
> "hive.sql.jdbc.url" = "jdbc:sqlserver://localhost:1433;",
> "hive.sql.dbcp.username" = "sa",
> "hive.sql.dbcp.password" = "Its-a-s3cret",
> "hive.sql.table" = "country");
> SELECT * FROM country ORDER BY id;
> {code}
> The query fails with the following stacktrace:
> {noformat}
> com.microsoft.sqlserver.jdbc.SQLServerException: The ORDER BY clause is 
> invalid in views, inline functions, derived tables, subqueries, and common 
> table expressions, unless TOP, OFFSET or FOR XML is also specified.
> at 
> com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError(SQLServerException.java:258)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServerStatement.java:1535)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.doExecutePreparedStatement(SQLServerPreparedStatement.java:467)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement$PrepStmtExecCmd.doExecute(SQLServerPreparedStatement.java:409)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:7151) 
> ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLServerConnection.java:2478)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLServerStatement.java:219)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLServerStatement.java:199)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.executeQuery(SQLServerPreparedStatement.java:331)
>  ~[mssql-jdbc-6.2.1.jre8.jar:?]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.commons.dbcp2.DelegatingPreparedStatement.executeQuery(DelegatingPreparedStatement.java:122)
>  ~[commons-dbcp2-2.7.0.jar:2.7.0]
> at 
> org.apache.hive.storage.jdbc.dao.GenericJdbcDatabaseAccessor.getRecordIterator(GenericJdbcDatabaseAccessor.java:180)
>  [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:58) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hive.storage.jdbc.JdbcRecordReader.next(JdbcRecordReader.java:35) 
> [hive-jdbc-handler-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.getNextRow(FetchOperator.java:589)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.exec.FetchOperator.pushRow(FetchOperator.java:529) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.exec.FetchTask.fetch(FetchTask.java:150) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.Driver.getFetchingTableResults(Driver.java:716) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at org.apache.hadoop.hive.ql.Driver.getResults(Driver.java:668) 
> [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.ql.reexec.ReExecDriver.getResults(ReExecDriver.java:241)
>  [hive-exec-4.0.0-SNAPSHOT.jar:4.0.0-SNAPSHOT]
> at 
> org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:277) 
> [hive-cli-4.0.0-SNAPSHOT.jar:?]
> at 
> 

[jira] [Work logged] (HIVE-25443) Arrow SerDe Cannot serialize/deserialize complex data types When there are more than 1024 values

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25443?focusedWorklogId=682738=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682738
 ]

ASF GitHub Bot logged work on HIVE-25443:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 16:27
Start Date: 17/Nov/21 16:27
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #2581:
URL: https://github.com/apache/hive/pull/2581#issuecomment-971744573


   @shameersss1 there are some commits with your r@amazon.com address - 
could you add associate that email address with your github account?
   without that the committer email address will be changed to some 
...@users.noreply.github.com


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682738)
Time Spent: 20m  (was: 10m)

> Arrow SerDe Cannot serialize/deserialize complex data types When there are 
> more than 1024 values
> 
>
> Key: HIVE-25443
> URL: https://issues.apache.org/jira/browse/HIVE-25443
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 3.1.0, 3.0.0, 3.1.1, 3.1.2
>Reporter: Syed Shameerur Rahman
>Assignee: Syed Shameerur Rahman
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Complex data types like MAP, STRUCT cannot be serialized/deserialzed using 
> Arrow SerDe when there are more than 1024 values. This happens due to 
> ColumnVector always being initialized with a size of 1024.
> Issue #1 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L213
> Issue #2 : 
> https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/io/arrow/ArrowColumnarBatchSerDe.java#L215
> Sample unit test to reproduce the case in TestArrowColumnarBatchSerDe :
> {code:java}
> @Test
>public void testListBooleanWithMoreThan1024Values() throws SerDeException {
>  String[][] schema = {
>  {"boolean_list", "array"},
>  };
>   
>  Object[][] rows = new Object[1025][1];
>  for (int i = 0; i < 1025; i++) {
>rows[i][0] = new BooleanWritable(true);
>  }
>   
>  initAndSerializeAndDeserialize(schema, toList(rows));
>}
>   
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25670) Avoid getTable() calls for foreign key tables not used in a query

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25670.
-
Fix Version/s: 4.0.0
   Resolution: Fixed

merged into master. Thank you [~scarlin] !

> Avoid getTable() calls for foreign key tables not used in a query
> -
>
> Key: HIVE-25670
> URL: https://issues.apache.org/jira/browse/HIVE-25670
> Project: Hive
>  Issue Type: Improvement
>  Components: HiveServer2
>Reporter: Steve Carlin
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> In RelOptHiveTable, we generate the referential constraints for the table. In 
> this process, we make a metastore call to fetch these tables.  This is used 
> later on for potential gain on joins done on the key.
> However, there is no need to fetch these constraints if the table is not used 
> in the query. If we can get this information up front, we can save a bit on 
> compilation time.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25670) Avoid getTable() calls for foreign key tables not used in a query

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25670?focusedWorklogId=682734=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682734
 ]

ASF GitHub Bot logged work on HIVE-25670:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 16:23
Start Date: 17/Nov/21 16:23
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #2763:
URL: https://github.com/apache/hive/pull/2763


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682734)
Time Spent: 1h 20m  (was: 1h 10m)

> Avoid getTable() calls for foreign key tables not used in a query
> -
>
> Key: HIVE-25670
> URL: https://issues.apache.org/jira/browse/HIVE-25670
> Project: Hive
>  Issue Type: Improvement
>  Components: HiveServer2
>Reporter: Steve Carlin
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> In RelOptHiveTable, we generate the referential constraints for the table. In 
> this process, we make a metastore call to fetch these tables.  This is used 
> later on for potential gain on joins done on the key.
> However, there is no need to fetch these constraints if the table is not used 
> in the query. If we can get this information up front, we can save a bit on 
> compilation time.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25700) Prevent deletion of Notification Events post restarts

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25700?focusedWorklogId=682692=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682692
 ]

ASF GitHub Bot logged work on HIVE-25700:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 15:27
Start Date: 17/Nov/21 15:27
Worklog Time Spent: 10m 
  Work Description: ayushtkn commented on a change in pull request #2789:
URL: https://github.com/apache/hive/pull/2789#discussion_r751353413



##
File path: 
hcatalog/server-extensions/src/main/java/org/apache/hive/hcatalog/listener/DbNotificationListener.java
##
@@ -1417,10 +1426,27 @@ private void process(NotificationEvent event, 
ListenerEvent listenerEvent) throw
   setTimeToLive(MetastoreConf.getTimeVar(conf, ttlConf, TimeUnit.SECONDS));
   setCleanupInterval(
   MetastoreConf.getTimeVar(conf, 
ConfVars.EVENT_DB_LISTENER_CLEAN_INTERVAL, TimeUnit.MILLISECONDS));
+  setWaitInterval(MetastoreConf
+  .getTimeVar(conf, EVENT_DB_LISTENER_CLEAN_STARTUP_WAIT_INTERVAL, 
TimeUnit.MILLISECONDS));
 }
 
 @Override
 public void run() {
+  LOG.info("Wait interval is {}", waitInterval);
+  if (waitInterval > 0) {
+try {
+  LOG.info("Cleaner Thread Restarted and {} or {} is configured. So 
cleaner thread will startup post waiting "
+  + "{} ms", EVENT_DB_LISTENER_CLEAN_STARTUP_WAIT_INTERVAL,
+  EVENT_DB_LISTENER_CLEAN_STARTUP_WAIT_INTERVAL.getHiveName(), 
waitInterval);
+  Thread.sleep(waitInterval);
+} catch (InterruptedException e) {
+  LOG.error("Failed during the initial wait before start.", e);
+  Thread.currentThread().interrupt();

Review comment:
   As discussed, Have wrapped this isTest check

##
File path: 
standalone-metastore/metastore-common/src/main/java/org/apache/hadoop/hive/metastore/conf/MetastoreConf.java
##
@@ -752,6 +752,9 @@ public static ConfVars getMetaConf(String name) {
 
EVENT_DB_LISTENER_CLEAN_INTERVAL("metastore.event.db.listener.clean.interval",
 "hive.metastore.event.db.listener.clean.interval", 7200, 
TimeUnit.SECONDS,
 "sleep interval between each run for cleanup of events from the 
database listener queue"),
+
EVENT_DB_LISTENER_CLEAN_STARTUP_WAIT_INTERVAL("metastore.event.db.listener.clean.startup.wait.interval",
+"hive.metastore.event.db.listener.clean.startup.wait.interval", 1, 
TimeUnit.DAYS,
+"wait interval post start of Cleaner thread after which the cleaner 
thread starts to work"),

Review comment:
   Done




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682692)
Time Spent: 40m  (was: 0.5h)

> Prevent deletion of Notification Events post restarts
> -
>
> Key: HIVE-25700
> URL: https://issues.apache.org/jira/browse/HIVE-25700
> Project: Hive
>  Issue Type: Sub-task
>Reporter: Ayush Saxena
>Assignee: Ayush Saxena
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> In case of DR scenarios, when Hive service goes down, Prevent deletion of 
> entries in the Notification Log immediately, Give time for ADMINs to 
> reconfigure properties to handle further Replication process.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25717) INSERT INTO on external MariaDB/MySQL table fails silently

2021-11-17 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445288#comment-17445288
 ] 

Stamatis Zampetakis commented on HIVE-25717:


It seems that this was caused as a side effect of fixing HIVE-22706.

> INSERT INTO on external MariaDB/MySQL table fails silently
> --
>
> Key: HIVE-25717
> URL: https://issues.apache.org/jira/browse/HIVE-25717
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: 4.0.0
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_dml_mysql.q
>
>
> +MariaDB/MySQL+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MYSQL",
> "hive.sql.jdbc.driver" = "com.mysql.jdbc.Driver",
> "hive.sql.jdbc.url" = "jdbc:mysql://localhost:3306/qtestDB",
> "hive.sql.dbcp.username" = "root",
> "hive.sql.dbcp.password" = "qtestpassword",
> "hive.sql.table" = "country"
> );
> INSERT INTO country VALUES (8, 'Hungary');
> SELECT * FROM country;
> {code}
> +Expected results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> |8|   Hungary|
> +Actual results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> The {{INSERT INTO}} statement finishes without showing any kind of problem in 
> the logs but the row is not inserted in the table.
> Running the test it comes back green although the following exception is 
> printed in the System.err (not in the logs).
> {noformat}
> java.sql.SQLException: Parameter metadata not available for the given 
> statement
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:129)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.checkAvailable(MysqlParameterMetadata.java:86)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.getParameterType(MysqlParameterMetadata.java:138)
> at 
> org.apache.hive.storage.jdbc.DBRecordWritable.write(DBRecordWritable.java:67)
> at 
> org.apache.hadoop.mapreduce.lib.db.DBOutputFormat$DBRecordWriter.write(DBOutputFormat.java:122)
> at 
> org.apache.hive.storage.jdbc.JdbcRecordWriter.write(JdbcRecordWriter.java:47)
> at 
> org.apache.hadoop.hive.ql.exec.FileSinkOperator.process(FileSinkOperator.java:1160)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.forwardUDTFOutput(UDTFOperator.java:133)
> at 
> org.apache.hadoop.hive.ql.udf.generic.UDTFCollector.collect(UDTFCollector.java:45)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTF.forward(GenericUDTF.java:110)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTFInline.process(GenericUDTFInline.java:64)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.process(UDTFOperator.java:116)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:173)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:154)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:552)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:101)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:83)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:414)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:311)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:277)
> at 
> 

[jira] [Commented] (HIVE-25717) INSERT INTO on external MariaDB/MySQL table fails silently

2021-11-17 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445284#comment-17445284
 ] 

Stamatis Zampetakis commented on HIVE-25717:


The problem can be reproduced by running:
{code:bash}
mvn test -Dtest=TestMiniLlapLocalCliDriver -Dqfile=jdbc_table_dml_mysql.q 
-Dtest.output.overwrite
{code}


> INSERT INTO on external MariaDB/MySQL table fails silently
> --
>
> Key: HIVE-25717
> URL: https://issues.apache.org/jira/browse/HIVE-25717
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: 4.0.0
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_dml_mysql.q
>
>
> +MariaDB/MySQL+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MYSQL",
> "hive.sql.jdbc.driver" = "com.mysql.jdbc.Driver",
> "hive.sql.jdbc.url" = "jdbc:mysql://localhost:3306/qtestDB",
> "hive.sql.dbcp.username" = "root",
> "hive.sql.dbcp.password" = "qtestpassword",
> "hive.sql.table" = "country"
> );
> INSERT INTO country VALUES (8, 'Hungary');
> SELECT * FROM country;
> {code}
> +Expected results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> |8|   Hungary|
> +Actual results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> The {{INSERT INTO}} statement finishes without showing any kind of problem in 
> the logs but the row is not inserted in the table.
> Running the test it comes back green although the following exception is 
> printed in the System.err (not in the logs).
> {noformat}
> java.sql.SQLException: Parameter metadata not available for the given 
> statement
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:129)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.checkAvailable(MysqlParameterMetadata.java:86)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.getParameterType(MysqlParameterMetadata.java:138)
> at 
> org.apache.hive.storage.jdbc.DBRecordWritable.write(DBRecordWritable.java:67)
> at 
> org.apache.hadoop.mapreduce.lib.db.DBOutputFormat$DBRecordWriter.write(DBOutputFormat.java:122)
> at 
> org.apache.hive.storage.jdbc.JdbcRecordWriter.write(JdbcRecordWriter.java:47)
> at 
> org.apache.hadoop.hive.ql.exec.FileSinkOperator.process(FileSinkOperator.java:1160)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.forwardUDTFOutput(UDTFOperator.java:133)
> at 
> org.apache.hadoop.hive.ql.udf.generic.UDTFCollector.collect(UDTFCollector.java:45)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTF.forward(GenericUDTF.java:110)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTFInline.process(GenericUDTFInline.java:64)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.process(UDTFOperator.java:116)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:173)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:154)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:552)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:101)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:83)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:414)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:311)
> at 
> 

[jira] [Updated] (HIVE-25717) INSERT INTO on external MariaDB/MySQL table fails silently

2021-11-17 Thread Stamatis Zampetakis (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stamatis Zampetakis updated HIVE-25717:
---
Attachment: jdbc_table_dml_mysql.q

> INSERT INTO on external MariaDB/MySQL table fails silently
> --
>
> Key: HIVE-25717
> URL: https://issues.apache.org/jira/browse/HIVE-25717
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: 4.0.0
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
> Attachments: jdbc_table_dml_mysql.q
>
>
> +MariaDB/MySQL+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MYSQL",
> "hive.sql.jdbc.driver" = "com.mysql.jdbc.Driver",
> "hive.sql.jdbc.url" = "jdbc:mysql://localhost:3306/qtestDB",
> "hive.sql.dbcp.username" = "root",
> "hive.sql.dbcp.password" = "qtestpassword",
> "hive.sql.table" = "country"
> );
> INSERT INTO country VALUES (8, 'Hungary');
> SELECT * FROM country;
> {code}
> +Expected results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> |8|   Hungary|
> +Actual results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> The {{INSERT INTO}} statement finishes without showing any kind of problem in 
> the logs but the row is not inserted in the table.
> Running the test it comes back green although the following exception is 
> printed in the System.err (not in the logs).
> {noformat}
> java.sql.SQLException: Parameter metadata not available for the given 
> statement
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:129)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.checkAvailable(MysqlParameterMetadata.java:86)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.getParameterType(MysqlParameterMetadata.java:138)
> at 
> org.apache.hive.storage.jdbc.DBRecordWritable.write(DBRecordWritable.java:67)
> at 
> org.apache.hadoop.mapreduce.lib.db.DBOutputFormat$DBRecordWriter.write(DBOutputFormat.java:122)
> at 
> org.apache.hive.storage.jdbc.JdbcRecordWriter.write(JdbcRecordWriter.java:47)
> at 
> org.apache.hadoop.hive.ql.exec.FileSinkOperator.process(FileSinkOperator.java:1160)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.forwardUDTFOutput(UDTFOperator.java:133)
> at 
> org.apache.hadoop.hive.ql.udf.generic.UDTFCollector.collect(UDTFCollector.java:45)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTF.forward(GenericUDTF.java:110)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTFInline.process(GenericUDTFInline.java:64)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.process(UDTFOperator.java:116)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:173)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:154)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:552)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:101)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:83)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:414)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:311)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:277)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:381)
> at 
> 

[jira] [Assigned] (HIVE-25717) INSERT INTO on external MariaDB/MySQL table fails silently

2021-11-17 Thread Stamatis Zampetakis (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stamatis Zampetakis reassigned HIVE-25717:
--


> INSERT INTO on external MariaDB/MySQL table fails silently
> --
>
> Key: HIVE-25717
> URL: https://issues.apache.org/jira/browse/HIVE-25717
> Project: Hive
>  Issue Type: Bug
>  Components: HiveServer2
>Affects Versions: 4.0.0
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
>
> +MariaDB/MySQL+
> {code:sql}
> CREATE TABLE country (id   int, name varchar(20));
> insert into country values (1, 'India');
> insert into country values (2, 'Russia');
> insert into country values (3, 'USA');
> {code}
> +Hive+
> {code:sql}
> CREATE EXTERNAL TABLE country (id int, name varchar(20))
> STORED BY 'org.apache.hive.storage.jdbc.JdbcStorageHandler'
> TBLPROPERTIES (
> "hive.sql.database.type" = "MYSQL",
> "hive.sql.jdbc.driver" = "com.mysql.jdbc.Driver",
> "hive.sql.jdbc.url" = "jdbc:mysql://localhost:3306/qtestDB",
> "hive.sql.dbcp.username" = "root",
> "hive.sql.dbcp.password" = "qtestpassword",
> "hive.sql.table" = "country"
> );
> INSERT INTO country VALUES (8, 'Hungary');
> SELECT * FROM country;
> {code}
> +Expected results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> |8|   Hungary|
> +Actual results+
> ||ID||NAME||
> |1|   India|
> |2|   Russia|
> |3|   USA|
> The {{INSERT INTO}} statement finishes without showing any kind of problem in 
> the logs but the row is not inserted in the table.
> Running the test it comes back green although the following exception is 
> printed in the System.err (not in the logs).
> {noformat}
> java.sql.SQLException: Parameter metadata not available for the given 
> statement
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:129)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
> at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.checkAvailable(MysqlParameterMetadata.java:86)
> at 
> com.mysql.cj.jdbc.MysqlParameterMetadata.getParameterType(MysqlParameterMetadata.java:138)
> at 
> org.apache.hive.storage.jdbc.DBRecordWritable.write(DBRecordWritable.java:67)
> at 
> org.apache.hadoop.mapreduce.lib.db.DBOutputFormat$DBRecordWriter.write(DBOutputFormat.java:122)
> at 
> org.apache.hive.storage.jdbc.JdbcRecordWriter.write(JdbcRecordWriter.java:47)
> at 
> org.apache.hadoop.hive.ql.exec.FileSinkOperator.process(FileSinkOperator.java:1160)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.forwardUDTFOutput(UDTFOperator.java:133)
> at 
> org.apache.hadoop.hive.ql.udf.generic.UDTFCollector.collect(UDTFCollector.java:45)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTF.forward(GenericUDTF.java:110)
> at 
> org.apache.hadoop.hive.ql.udf.generic.GenericUDTFInline.process(GenericUDTFInline.java:64)
> at 
> org.apache.hadoop.hive.ql.exec.UDTFOperator.process(UDTFOperator.java:116)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.SelectOperator.process(SelectOperator.java:94)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:888)
> at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:173)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:154)
> at 
> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:552)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:101)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:83)
> at 
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:414)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:311)
> at 
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:277)
> at 
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:381)
> at 
> org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:82)
> at 
> 

[jira] [Updated] (HIVE-25716) Fix flaky test TestCompactionMetrics#testOldestReadyForCleaningAge

2021-11-17 Thread Karen Coppage (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Karen Coppage updated HIVE-25716:
-
Labels: flaky-test  (was: )

> Fix flaky test TestCompactionMetrics#testOldestReadyForCleaningAge
> --
>
> Key: HIVE-25716
> URL: https://issues.apache.org/jira/browse/HIVE-25716
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 4.0.0
>Reporter: Karen Coppage
>Priority: Major
>  Labels: flaky-test
>
> Flaky check failed on run #59:
> [http://ci.hive.apache.org/job/hive-flaky-check/467/|http://ci.hive.apache.org/job/hive-flaky-check/467/]
> {code:java}
> java.lang.AssertionError
>   at org.junit.Assert.fail(Assert.java:87)
>   at org.junit.Assert.assertTrue(Assert.java:42)
>   at org.junit.Assert.assertTrue(Assert.java:53)
>   at 
> org.apache.hadoop.hive.ql.txn.compactor.TestCompactionMetrics.testOldestReadyForCleaningAge(TestCompactionMetrics.java:214)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-24390) Spelling

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-24390?focusedWorklogId=682655=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682655
 ]

ASF GitHub Bot logged work on HIVE-24390:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 14:34
Start Date: 17/Nov/21 14:34
Worklog Time Spent: 10m 
  Work Description: jsoref commented on pull request #1674:
URL: https://github.com/apache/hive/pull/1674#issuecomment-971639430


   Yes. That's my plan. This is sadly a tiny slice. I think I'll do another PR 
of about the same size. When we get to the end, I'll see about a PR for the 
action-workflow.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682655)
Time Spent: 2h 20m  (was: 2h 10m)

> Spelling
> 
>
> Key: HIVE-24390
> URL: https://issues.apache.org/jira/browse/HIVE-24390
> Project: Hive
>  Issue Type: Bug
>Reporter: Josh Soref
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25690) Fix column reorder detection for Iceberg schema evolution

2021-11-17 Thread Marton Bod (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445157#comment-17445157
 ] 

Marton Bod commented on HIVE-25690:
---

Pushed to master. Thanks [~szita] for reviewing!

> Fix column reorder detection for Iceberg schema evolution
> -
>
> Key: HIVE-25690
> URL: https://issues.apache.org/jira/browse/HIVE-25690
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Current algorithm for detecting schema differences between HMS and Iceberg 
> schema is broken when it comes to column reorders. This patch should fix that 
> up and add more extensive testing.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25690) Fix column reorder detection for Iceberg schema evolution

2021-11-17 Thread Marton Bod (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marton Bod resolved HIVE-25690.
---
Resolution: Fixed

> Fix column reorder detection for Iceberg schema evolution
> -
>
> Key: HIVE-25690
> URL: https://issues.apache.org/jira/browse/HIVE-25690
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Current algorithm for detecting schema differences between HMS and Iceberg 
> schema is broken when it comes to column reorders. This patch should fix that 
> up and add more extensive testing.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25690) Fix column reorder detection for Iceberg schema evolution

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25690?focusedWorklogId=682605=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682605
 ]

ASF GitHub Bot logged work on HIVE-25690:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 13:19
Start Date: 17/Nov/21 13:19
Worklog Time Spent: 10m 
  Work Description: marton-bod merged pull request #2779:
URL: https://github.com/apache/hive/pull/2779


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682605)
Time Spent: 50m  (was: 40m)

> Fix column reorder detection for Iceberg schema evolution
> -
>
> Key: HIVE-25690
> URL: https://issues.apache.org/jira/browse/HIVE-25690
> Project: Hive
>  Issue Type: Bug
>Reporter: Marton Bod
>Assignee: Marton Bod
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> Current algorithm for detecting schema differences between HMS and Iceberg 
> schema is broken when it comes to column reorders. This patch should fix that 
> up and add more extensive testing.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25582) Empty result when using offset limit with MR

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25582?focusedWorklogId=682595=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682595
 ]

ASF GitHub Bot logged work on HIVE-25582:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 13:11
Start Date: 17/Nov/21 13:11
Worklog Time Spent: 10m 
  Work Description: dengzhhu653 commented on pull request #2693:
URL: https://github.com/apache/hive/pull/2693#issuecomment-971565843


   The failed test seems unrelated...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682595)
Time Spent: 1h 40m  (was: 1.5h)

> Empty result when using offset limit with MR
> 
>
> Key: HIVE-25582
> URL: https://issues.apache.org/jira/browse/HIVE-25582
> Project: Hive
>  Issue Type: Bug
>  Components: Operators
>Affects Versions: 4.0.0
>Reporter: Zhihua Deng
>Assignee: Zhihua Deng
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> The _mr.ObjectCache_ caches nothing, every time when the limit [retrieving 
> global counter from the 
> cache|https://github.com/apache/hive/blob/7b3ecf617a6d46f48a3b6f77e0339fd4ad95a420/ql/src/java/org/apache/hadoop/hive/ql/exec/LimitOperator.java#L150-L161],
>  a new AtomicInteger will be returned. This make offset _<= 
> currentCountForAllTasksInt_ always be evaluated to false, as _offset > 0_, 
> the operator will skip all rows.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25701) Declare JDBC drivers as runtime & optional dependencies

2021-11-17 Thread Stamatis Zampetakis (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stamatis Zampetakis resolved HIVE-25701.

Fix Version/s: 4.0.0
   Resolution: Fixed

Fixed in 
[34158d8531725ab778a6c6112391370b44669bd8|https://github.com/apache/hive/commit/34158d8531725ab778a6c6112391370b44669bd8].
 Thanks for the review [~kgyrtkirk]!

> Declare JDBC drivers as runtime & optional dependencies
> ---
>
> Key: HIVE-25701
> URL: https://issues.apache.org/jira/browse/HIVE-25701
> Project: Hive
>  Issue Type: Task
>  Components: Standalone Metastore, Testing Infrastructure
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Currently, we are using the following JDBC drivers in various Hive modules:
> * MariaDB
> * MySQL
> * Oracle
> * Postgres
> * MSSQL
> * Derby
> MariaDB, MySQL, and Oracle licenses are not compatible with Apache License 2 
> ([Category-X |https://www.apache.org/legal/resolved.html#category-x]) and in 
> the past we used various ways to circumvent licensing problems (see 
> HIVE-23284). Now, some of them appear as test scope dependency which is OKish 
> but in the near future may lead again to licensing problems.
> JDBC drivers are only needed at runtime so they could all be declared at 
> runtime scope. Moreover, Hive does not require a specific JDBC driver in 
> order to operate so they are all optional. 
> The goal of this issue is to declare every JDBC driver at runtime scope and 
> mark it as optional 
> ([ASF-optional|https://www.apache.org/legal/resolved.html#optional], 
> [maven-optional|https://maven.apache.org/guides/introduction/introduction-to-optional-and-excludes-dependencies.html]).
>  
> This has the following advantages:
> * Eliminates the risk to write code which needs JDBC driver classes in order 
> to compile and potentially violate AL2.
> * Unifies the declaration of JDBC drivers making easier to add/remove some if 
> necessary.
> * Removes the need to use download-maven-plugin and other similar workarounds 
> to avoid licensing problems.
> * Simplifies the execution of tests using these drivers since now they are 
> added in the runtime classpath automatically by maven.
> * Projects with dependencies depending on Hive will not inherit any JDBC 
> driver by default.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25701) Declare JDBC drivers as runtime & optional dependencies

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25701?focusedWorklogId=682571=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682571
 ]

ASF GitHub Bot logged work on HIVE-25701:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 12:28
Start Date: 17/Nov/21 12:28
Worklog Time Spent: 10m 
  Work Description: zabetak closed pull request #2790:
URL: https://github.com/apache/hive/pull/2790


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682571)
Time Spent: 0.5h  (was: 20m)

> Declare JDBC drivers as runtime & optional dependencies
> ---
>
> Key: HIVE-25701
> URL: https://issues.apache.org/jira/browse/HIVE-25701
> Project: Hive
>  Issue Type: Task
>  Components: Standalone Metastore, Testing Infrastructure
>Reporter: Stamatis Zampetakis
>Assignee: Stamatis Zampetakis
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Currently, we are using the following JDBC drivers in various Hive modules:
> * MariaDB
> * MySQL
> * Oracle
> * Postgres
> * MSSQL
> * Derby
> MariaDB, MySQL, and Oracle licenses are not compatible with Apache License 2 
> ([Category-X |https://www.apache.org/legal/resolved.html#category-x]) and in 
> the past we used various ways to circumvent licensing problems (see 
> HIVE-23284). Now, some of them appear as test scope dependency which is OKish 
> but in the near future may lead again to licensing problems.
> JDBC drivers are only needed at runtime so they could all be declared at 
> runtime scope. Moreover, Hive does not require a specific JDBC driver in 
> order to operate so they are all optional. 
> The goal of this issue is to declare every JDBC driver at runtime scope and 
> mark it as optional 
> ([ASF-optional|https://www.apache.org/legal/resolved.html#optional], 
> [maven-optional|https://maven.apache.org/guides/introduction/introduction-to-optional-and-excludes-dependencies.html]).
>  
> This has the following advantages:
> * Eliminates the risk to write code which needs JDBC driver classes in order 
> to compile and potentially violate AL2.
> * Unifies the declaration of JDBC drivers making easier to add/remove some if 
> necessary.
> * Removes the need to use download-maven-plugin and other similar workarounds 
> to avoid licensing problems.
> * Simplifies the execution of tests using these drivers since now they are 
> added in the runtime classpath automatically by maven.
> * Projects with dependencies depending on Hive will not inherit any JDBC 
> driver by default.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25656) Get materialized view state based on number of affected rows of transactions

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25656?focusedWorklogId=682568=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682568
 ]

ASF GitHub Bot logged work on HIVE-25656:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 12:27
Start Date: 17/Nov/21 12:27
Worklog Time Spent: 10m 
  Work Description: kasakrisz commented on a change in pull request #2756:
URL: https://github.com/apache/hive/pull/2756#discussion_r751191780



##
File path: ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
##
@@ -12847,13 +12848,12 @@ protected void saveViewDefinition() throws 
SemanticException {
 createVwDesc.setViewExpandedText(expandedText);
   }
 
-  private Set getTablesUsed(ParseContext parseCtx) {
-Set tablesUsed = new HashSet<>();
+  private Set getTablesUsed(ParseContext parseCtx) {

Review comment:
   Try to move this method to `ParseContext`




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682568)
Time Spent: 2h 10m  (was: 2h)

> Get materialized view state based on number of affected rows of transactions
> 
>
> Key: HIVE-25656
> URL: https://issues.apache.org/jira/browse/HIVE-25656
> Project: Hive
>  Issue Type: Improvement
>  Components: Materialized views, Transactions
>Reporter: Krisztian Kasa
>Assignee: Krisztian Kasa
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> To enable the faster incremental rebuild of materialized views presence of 
> update/delete operations on the source tables of the view since the last 
> rebuild must be checked. Based on the outcome different plan is generated for 
> scenarios in presence of update/delete and insert only operations.
> Currently this is done by querying the COMPLETED_TXN_COMPONENTS table however 
> the records from this table is cleaned when MV source tables are compacted. 
> This reduces the chances of incremental MV rebuild.
> The goal of this patch is to find an alternative way to store and retrieve 
> this information.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25531) Remove the core classified hive-exec artifact

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25531?focusedWorklogId=682565=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682565
 ]

ASF GitHub Bot logged work on HIVE-25531:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 12:22
Start Date: 17/Nov/21 12:22
Worklog Time Spent: 10m 
  Work Description: zabetak commented on a change in pull request #2648:
URL: https://github.com/apache/hive/pull/2648#discussion_r751178653



##
File path: ql/pom.xml
##
@@ -1036,26 +1036,6 @@
   test-jar
 
   
-  
-
-core-jar
-package
-
-  jar
-
-
-  core
-
-  

Review comment:
   nit: Remove also line 1085/1065.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682565)
Time Spent: 20m  (was: 10m)

> Remove the core classified hive-exec artifact
> -
>
> Key: HIVE-25531
> URL: https://issues.apache.org/jira/browse/HIVE-25531
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> * this artifact was introduced in HIVE-7423 
> * loading this artifact and the shaded hive-exec (along with the jdbc driver) 
> could create interesting classpath problems
> * if other projects have issues with the shaded hive-exec artifact we must 
> start fix those problems



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25714) Some tests are flaky because docker is not able to start in 5 seconds

2021-11-17 Thread Stamatis Zampetakis (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445118#comment-17445118
 ] 

Stamatis Zampetakis commented on HIVE-25714:


The good news is that the flaky checker is still running which means that the 
problem may not be that often :D Maybe we can just increase the timeout till a 
better solution is found.

> Some tests are flaky because docker is not able to start in 5 seconds
> -
>
> Key: HIVE-25714
> URL: https://issues.apache.org/jira/browse/HIVE-25714
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> there are some testruns failing with; and on the test site multiple pods are 
> running in parallel - its not an ideal environment for tight deadlines
> {code}
> Unexpected exception java.lang.RuntimeException: Process docker failed to run 
> in 5 seconds
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.runCmd(AbstractExternalDB.java:92)
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.launchDockerContainer(AbstractExternalDB.java:123)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestDatabaseHandler.beforeTest(QTestDatabaseHandler.java:111)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestOptionDispatcher.beforeTest(QTestOptionDispatcher.java:79)
> {code}
> http://ci.hive.apache.org/job/hive-precommit/job/PR-1674/4/testReport/junit/org.apache.hadoop.hive.cli.split19/TestMiniLlapLocalCliDriver/Testing___split_14___PostProcess___testCliDriver_qt_database_all_/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25715) Provide nightly builds

2021-11-17 Thread Zoltan Haindrich (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445119#comment-17445119
 ] 

Zoltan Haindrich commented on HIVE-25715:
-

there are different levels I could think of:
* ugly-duckling:
** just do a dist build and take 
packaging/target/apache-hive-4.0.0-SNAPSHOT-bin/apache-hive-4.0.0-SNAPSHOT-bin/ 
as the nightly build
** this will work; however its version will be 4.0.0-SNAPSHOT
* twisted:
** run a versions:set on the project which will change the versions of the 
packages "connected" to the root pom
** this will have more proper versions on the outside; but internally storage 
api will run with X-SNAPSHOT
* frankeinstein:
** reconnect the disconnected pom.xml-s to the root and then run a versions:set
** this approach will make storage-api pick up the hive version in these builds
** it might have its own problems because in the original project the pom-s are 
disconnected
* gentlemen:
** go into disconnected modules and update the versions for those as well 
(storage-api,standalone-metastore,?)
** this will need some script magic since maven cant do these updates correctly 
outside

I think I'll go with the "twisted" approach for now - doing the proper one 
could probably also reused as some kind of level release automation...the 
twisted approach is simple enough to start with and build the infra around it ; 
and we can move forward to "gentlemen" later if there is interest in it

> Provide nightly builds
> --
>
> Key: HIVE-25715
> URL: https://issues.apache.org/jira/browse/HIVE-25715
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> provide nightly builds for the master branch



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25405) Implement Connector Provider for Amazon Redshift

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25405?focusedWorklogId=682553=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682553
 ]

ASF GitHub Bot logged work on HIVE-25405:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 11:55
Start Date: 17/Nov/21 11:55
Worklog Time Spent: 10m 
  Work Description: zabetak commented on a change in pull request #2546:
URL: https://github.com/apache/hive/pull/2546#discussion_r751148777



##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/dataconnector/jdbc/RedshiftConnectorProvider.java
##
@@ -0,0 +1,86 @@
+package org.apache.hadoop.hive.metastore.dataconnector.jdbc;
+
+import org.apache.hadoop.hive.metastore.ColumnType;
+import org.apache.hadoop.hive.metastore.api.DataConnector;
+import org.apache.hadoop.hive.metastore.api.MetaException;
+import org.apache.hadoop.hive.metastore.api.Table;
+
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.sql.ResultSet;
+import java.sql.SQLException;
+
+public class RedshiftConnectorProvider extends AbstractJDBCConnectorProvider {
+private static Logger LOG = 
LoggerFactory.getLogger(RedshiftConnectorProvider.class);
+
+private static final String DRIVER_CLASS = 
"com.amazon.redshift.jdbc42.Driver".intern();
+
+public RedshiftConnectorProvider(String dbName, DataConnector dataConn) {
+super(dbName, dataConn, DRIVER_CLASS);
+}
+
+protected String getDataType(String dbDataType, int size) {
+String mappedType = super.getDataType(dbDataType, size);
+
+// The VOID type points to the corresponding datatype not existing in 
hive. These datatypes are datastore
+// specific. They need special handling. An example would be the 
Geometric type that is not supported in Hive.
+// The other cases where a datatype in redshift is resolved to a VOID 
type are during the use of aliases like
+// float8, int8 etc. These can be mapped to existing hive types and 
are done below.
+if (!mappedType.equalsIgnoreCase(ColumnType.VOID_TYPE_NAME)) {
+return mappedType;
+}
+
+// map any db specific types here.

Review comment:
   The Intention is clear, comment is redundant.
   ```suggestion
   ```

##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/dataconnector/jdbc/RedshiftConnectorProvider.java
##
@@ -0,0 +1,86 @@
+package org.apache.hadoop.hive.metastore.dataconnector.jdbc;
+
+import org.apache.hadoop.hive.metastore.ColumnType;
+import org.apache.hadoop.hive.metastore.api.DataConnector;
+import org.apache.hadoop.hive.metastore.api.MetaException;
+import org.apache.hadoop.hive.metastore.api.Table;
+
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import java.sql.ResultSet;
+import java.sql.SQLException;
+
+public class RedshiftConnectorProvider extends AbstractJDBCConnectorProvider {
+private static Logger LOG = 
LoggerFactory.getLogger(RedshiftConnectorProvider.class);
+
+private static final String DRIVER_CLASS = 
"com.amazon.redshift.jdbc42.Driver".intern();

Review comment:
   There is no benefit at all calling intern() on static final field. 
Actually, I don't see a good reason to have field declaration here. We could 
inline the string to the constructor.

##
File path: ql/src/test/queries/clientpositive/redshift_data_connector.q
##
@@ -0,0 +1,65 @@
+-- SORT_QUERY_RESULTS
+SHOW CONNECTORS;
+
+-- CREATE with comment
+CREATE CONNECTOR redshift_test
+TYPE 'redshift'
+URL 
'jdbc:redshift://redshift-cluster-1.c1gffkxfot1v.us-east-2.redshift.amazonaws.com:5439/dev'

Review comment:
   Agree with Zoltan, I don't think we can add this. Very likely now the 
Redhshift instance specified here may not be available and the test will fail.
   
   Since Postgres and Redshift have many similarities I would suggest adding at 
least CONNECTOR test with Postgres. An easy way to do it would be using 
`--!qt:database:postgres:` option.

##
File path: 
standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/dataconnector/jdbc/AbstractJDBCConnectorProvider.java
##
@@ -185,9 +185,18 @@ protected Connection getConnection() {
 return null;
   }
 
-  protected abstract ResultSet fetchTableMetadata(String tableName) throws 
MetaException;
+  protected ResultSet fetchTableMetadata(String tableName) throws 
MetaException {
+try {
+  return fetchTablesViaDBMetaData(tableName);
+}
+catch (SQLException sqle) {
+  throw new MetaException("Error while trying to access the table names in 
the database" + sqle);
+}
+  }
 
-  protected abstract ResultSet fetchTableNames() throws MetaException;
+  protected ResultSet fetchTableNames() throws MetaException {
+return fetchTableMetadata(null);
+  }


[jira] [Work logged] (HIVE-25656) Get materialized view state based on number of affected rows of transactions

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25656?focusedWorklogId=682537=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682537
 ]

ASF GitHub Bot logged work on HIVE-25656:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 11:26
Start Date: 17/Nov/21 11:26
Worklog Time Spent: 10m 
  Work Description: kasakrisz commented on a change in pull request #2756:
URL: https://github.com/apache/hive/pull/2756#discussion_r751148742



##
File path: ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
##
@@ -12847,13 +12849,21 @@ protected void saveViewDefinition() throws 
SemanticException {
 createVwDesc.setViewExpandedText(expandedText);
   }
 
-  private Set getTablesUsed(ParseContext parseCtx) {
-Set tablesUsed = new HashSet<>();
+  private Set getTablesUsed(ParseContext parseCtx) {
+Set tablesUsed = new HashSet<>();
 for (TableScanOperator topOp : parseCtx.getTopOps().values()) {
   Table table = topOp.getConf().getTableMetadata();
   if (!table.isMaterializedTable() && !table.isView()) {
 // Add to signature
-tablesUsed.add(table.getFullyQualifiedName());
+SourceTable sourceTable = new SourceTable();

Review comment:
   Refactored how `SourceTable` instances are created and in some cases 
like the one you mentioned `TableName` is a better choice because the stats 
contained by `SourceTable` is not needed here.
   Added factory methods to `ql.metadata.Table`
   * `SourceTable createSourceTable()` - to create a `SourceTable` instance in 
an initial state referencing this table instance.
   * `TableName getFullTableName()` - to create a `TableName` using this table 
instance.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682537)
Time Spent: 2h  (was: 1h 50m)

> Get materialized view state based on number of affected rows of transactions
> 
>
> Key: HIVE-25656
> URL: https://issues.apache.org/jira/browse/HIVE-25656
> Project: Hive
>  Issue Type: Improvement
>  Components: Materialized views, Transactions
>Reporter: Krisztian Kasa
>Assignee: Krisztian Kasa
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> To enable the faster incremental rebuild of materialized views presence of 
> update/delete operations on the source tables of the view since the last 
> rebuild must be checked. Based on the outcome different plan is generated for 
> scenarios in presence of update/delete and insert only operations.
> Currently this is done by querying the COMPLETED_TXN_COMPONENTS table however 
> the records from this table is cleaned when MV source tables are compacted. 
> This reduces the chances of incremental MV rebuild.
> The goal of this patch is to find an alternative way to store and retrieve 
> this information.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25715) Provide nightly builds

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich reassigned HIVE-25715:
---


> Provide nightly builds
> --
>
> Key: HIVE-25715
> URL: https://issues.apache.org/jira/browse/HIVE-25715
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> provide nightly builds for the master branch



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-22527) Hive on Tez : Job of merging small files will be submitted into another queue (default queue)

2021-11-17 Thread Jira


 [ 
https://issues.apache.org/jira/browse/HIVE-22527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

László Bodor updated HIVE-22527:

Summary: Hive on Tez : Job of merging small files will be submitted into 
another queue (default queue)  (was: Hive on Tez : Job of merging samll files 
will be submitted into another queue (default queue))

> Hive on Tez : Job of merging small files will be submitted into another queue 
> (default queue)
> -
>
> Key: HIVE-22527
> URL: https://issues.apache.org/jira/browse/HIVE-22527
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 3.1.0, 3.1.1
>Reporter: zhangbutao
>Assignee: zhangbutao
>Priority: Blocker
> Fix For: 4.0.0
>
> Attachments: HIVE-22527-branch-3.1.0.patch, HIVE-22527.01.patch, 
> HIVE-22527.02.patch, explain with merge files.png, file merge job.png, hive 
> logs.png
>
>
> Hive on Tez. We enable small file merge configuration with set 
> *hive.merge.tezfiles=true*. So , There will be another job launched for 
> merging files after sql job. However, the merge file job is submitted into 
> another yarn queue, not the queue of current beeline client session. It seems 
> that the merging files job start a new tez session with new conf which is 
> different the current session conf, leading to the merging file job goes into 
> default queue.
>  
> Attachment *hive logs.png* shows that current session queue is 
> *root.bdoc.production* ( String queueName = session.getQueueName();) incoming 
> queue name is *null* ( String confQueueName = 
> conf.get(TezConfiguration.TEZ_QUEUE_NAME);). In fact, we log in to the same 
> beeline client with *set tez.queue.name=* *root.bdoc.production,* and  all  
> jobs should be submitted into the same queue including file merge job.
> [https://github.com/apache/hive/blob/bcc7df95824831a8d2f1524e4048dfc23ab98c19/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezSessionPoolManager.java#L445]
> [https://github.com/apache/hive/blob/bcc7df95824831a8d2f1524e4048dfc23ab98c19/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezSessionPoolManager.java#L446]
>  
> Attachment *explain with merge files.png* shows that ** the stage-4 is 
> individual merge file job which is submitted into another yarn queue(default 
> queue), not the queue root.bdoc.production.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25692) ExceptionHandler may mask checked exceptions

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25692.
-
Fix Version/s: 4.0.0
   Resolution: Fixed

merged into master. Thank you [~kkasa] and [~dengzh] for reviewing the changes!

> ExceptionHandler may mask checked exceptions
> 
>
> Key: HIVE-25692
> URL: https://issues.apache.org/jira/browse/HIVE-25692
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> HIVE-25055 have changed the way exceptions as rethrowed - but one of the 
> methods may let checked exception out without them being declared on the 
> method (and avoid the compile time error for it)
> testcase for:
> org.apache.hadoop.hive.metastore.TestExceptionHandler
> {code}
>   @Test
>   public void testInvalid() throws MetaException {
> try {
>   throw new IOException("IOException test");
> } catch (Exception e) {
>   throw handleException(e).throwIfInstance(AccessControlException.class, 
> IOException.class).defaultMetaException();
> }
>   }
> {code}
> this testcase should not compile - as it may throw IOException or 
> AccessControlException as well



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Resolved] (HIVE-25634) Eclipse compiler bumps into AIOBE during ObjectStore compilation

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich resolved HIVE-25634.
-
Fix Version/s: 4.0.0
   Resolution: Fixed

merged into master. Thank you [~dengzh]!

> Eclipse compiler bumps into AIOBE during ObjectStore compilation
> 
>
> Key: HIVE-25634
> URL: https://issues.apache.org/jira/browse/HIVE-25634
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltan Haindrich
>Assignee: Zhihua Deng
>Priority: Major
>  Labels: pull-request-available
> Fix For: 4.0.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> this issue seem to have started appearing after HIVE-23633



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-23095) NDV might be overestimated for a table with ~70 value

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-23095?focusedWorklogId=682447=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682447
 ]

ASF GitHub Bot logged work on HIVE-23095:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 09:03
Start Date: 17/Nov/21 09:03
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #2783:
URL: https://github.com/apache/hive/pull/2783#issuecomment-971374080


   @kasakrisz could you please take a look?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682447)
Time Spent: 2h 10m  (was: 2h)

> NDV might be overestimated for a table with ~70 value
> -
>
> Key: HIVE-23095
> URL: https://issues.apache.org/jira/browse/HIVE-23095
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
> Attachments: HIVE-23095.01.patch, HIVE-23095.02.patch, 
> HIVE-23095.03.patch, HIVE-23095.04.patch, HIVE-23095.04.patch, 
> HIVE-23095.04.patch, HIVE-23095.05.patch, hll-bench.md
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> uncovered during looking into HIVE-23082
> https://issues.apache.org/jira/browse/HIVE-23082?focusedCommentId=17067773=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17067773



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25692) ExceptionHandler may mask checked exceptions

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25692?focusedWorklogId=682445=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682445
 ]

ASF GitHub Bot logged work on HIVE-25692:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 08:58
Start Date: 17/Nov/21 08:58
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk merged pull request #2782:
URL: https://github.com/apache/hive/pull/2782


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682445)
Time Spent: 1h  (was: 50m)

> ExceptionHandler may mask checked exceptions
> 
>
> Key: HIVE-25692
> URL: https://issues.apache.org/jira/browse/HIVE-25692
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> HIVE-25055 have changed the way exceptions as rethrowed - but one of the 
> methods may let checked exception out without them being declared on the 
> method (and avoid the compile time error for it)
> testcase for:
> org.apache.hadoop.hive.metastore.TestExceptionHandler
> {code}
>   @Test
>   public void testInvalid() throws MetaException {
> try {
>   throw new IOException("IOException test");
> } catch (Exception e) {
>   throw handleException(e).throwIfInstance(AccessControlException.class, 
> IOException.class).defaultMetaException();
> }
>   }
> {code}
> this testcase should not compile - as it may throw IOException or 
> AccessControlException as well



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Work logged] (HIVE-25531) Remove the core classified hive-exec artifact

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25531?focusedWorklogId=682435=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-682435
 ]

ASF GitHub Bot logged work on HIVE-25531:
-

Author: ASF GitHub Bot
Created on: 17/Nov/21 08:41
Start Date: 17/Nov/21 08:41
Worklog Time Spent: 10m 
  Work Description: kgyrtkirk commented on pull request #2648:
URL: https://github.com/apache/hive/pull/2648#issuecomment-971357845


   @zabetak could you please take a look?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 682435)
Remaining Estimate: 0h
Time Spent: 10m

> Remove the core classified hive-exec artifact
> -
>
> Key: HIVE-25531
> URL: https://issues.apache.org/jira/browse/HIVE-25531
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> * this artifact was introduced in HIVE-7423 
> * loading this artifact and the shaded hive-exec (along with the jdbc driver) 
> could create interesting classpath problems
> * if other projects have issues with the shaded hive-exec artifact we must 
> start fix those problems



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HIVE-25531) Remove the core classified hive-exec artifact

2021-11-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-25531:
--
Labels: pull-request-available  (was: )

> Remove the core classified hive-exec artifact
> -
>
> Key: HIVE-25531
> URL: https://issues.apache.org/jira/browse/HIVE-25531
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> * this artifact was introduced in HIVE-7423 
> * loading this artifact and the shaded hive-exec (along with the jdbc driver) 
> could create interesting classpath problems
> * if other projects have issues with the shaded hive-exec artifact we must 
> start fix those problems



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25714) Some tests are flaky because docker is not able to start in 5 seconds

2021-11-17 Thread Zoltan Haindrich (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445006#comment-17445006
 ] 

Zoltan Haindrich commented on HIVE-25714:
-

the testrun also contains some info - it seems like the docker log command for 
oracle took over a second every time it was executed; maybe it has a lot of 
logs?
fyi: [~zabetak]

> Some tests are flaky because docker is not able to start in 5 seconds
> -
>
> Key: HIVE-25714
> URL: https://issues.apache.org/jira/browse/HIVE-25714
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> there are some testruns failing with; and on the test site multiple pods are 
> running in parallel - its not an ideal environment for tight deadlines
> {code}
> Unexpected exception java.lang.RuntimeException: Process docker failed to run 
> in 5 seconds
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.runCmd(AbstractExternalDB.java:92)
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.launchDockerContainer(AbstractExternalDB.java:123)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestDatabaseHandler.beforeTest(QTestDatabaseHandler.java:111)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestOptionDispatcher.beforeTest(QTestOptionDispatcher.java:79)
> {code}
> http://ci.hive.apache.org/job/hive-precommit/job/PR-1674/4/testReport/junit/org.apache.hadoop.hive.cli.split19/TestMiniLlapLocalCliDriver/Testing___split_14___PostProcess___testCliDriver_qt_database_all_/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Commented] (HIVE-25714) Some tests are flaky because docker is not able to start in 5 seconds

2021-11-17 Thread Zoltan Haindrich (Jira)


[ 
https://issues.apache.org/jira/browse/HIVE-25714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17445005#comment-17445005
 ] 

Zoltan Haindrich commented on HIVE-25714:
-

apparently; the docker log command times out after 5 seconds 
https://github.com/apache/hive/blob/dbec7744f04a9389e1bb168e85b5e8f73d357011/itests/util/src/main/java/org/apache/hadoop/hive/ql/externalDB/AbstractExternalDB.java#L123

launched flaky-check for it ; I doubt that it will give more info - but it will 
have the full hive.log and seeing how long earlier `docker log` cmds took might 
help
http://ci.hive.apache.org/job/hive-flaky-check/466/

> Some tests are flaky because docker is not able to start in 5 seconds
> -
>
> Key: HIVE-25714
> URL: https://issues.apache.org/jira/browse/HIVE-25714
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> there are some testruns failing with; and on the test site multiple pods are 
> running in parallel - its not an ideal environment for tight deadlines
> {code}
> Unexpected exception java.lang.RuntimeException: Process docker failed to run 
> in 5 seconds
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.runCmd(AbstractExternalDB.java:92)
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.launchDockerContainer(AbstractExternalDB.java:123)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestDatabaseHandler.beforeTest(QTestDatabaseHandler.java:111)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestOptionDispatcher.beforeTest(QTestOptionDispatcher.java:79)
> {code}
> http://ci.hive.apache.org/job/hive-precommit/job/PR-1674/4/testReport/junit/org.apache.hadoop.hive.cli.split19/TestMiniLlapLocalCliDriver/Testing___split_14___PostProcess___testCliDriver_qt_database_all_/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Assigned] (HIVE-25714) Some tests are flaky because docker is not able to start in 5 seconds

2021-11-17 Thread Zoltan Haindrich (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zoltan Haindrich reassigned HIVE-25714:
---


> Some tests are flaky because docker is not able to start in 5 seconds
> -
>
> Key: HIVE-25714
> URL: https://issues.apache.org/jira/browse/HIVE-25714
> Project: Hive
>  Issue Type: Improvement
>Reporter: Zoltan Haindrich
>Assignee: Zoltan Haindrich
>Priority: Major
>
> there are some testruns failing with; and on the test site multiple pods are 
> running in parallel - its not an ideal environment for tight deadlines
> {code}
> Unexpected exception java.lang.RuntimeException: Process docker failed to run 
> in 5 seconds
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.runCmd(AbstractExternalDB.java:92)
>  at 
> org.apache.hadoop.hive.ql.externalDB.AbstractExternalDB.launchDockerContainer(AbstractExternalDB.java:123)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestDatabaseHandler.beforeTest(QTestDatabaseHandler.java:111)
>  at 
> org.apache.hadoop.hive.ql.qoption.QTestOptionDispatcher.beforeTest(QTestOptionDispatcher.java:79)
> {code}
> http://ci.hive.apache.org/job/hive-precommit/job/PR-1674/4/testReport/junit/org.apache.hadoop.hive.cli.split19/TestMiniLlapLocalCliDriver/Testing___split_14___PostProcess___testCliDriver_qt_database_all_/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)