[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16082 )

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..


Patch Set 9:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/16082/9/fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java
File fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java:

http://gerrit.cloudera.org:8080/#/c/16082/9/fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java@45
PS9, Line 45: import org.apache.impala.analysis.Path.PathType;
nit: keep the import list sorted in groups (usually the IDE will do this for 
you).


http://gerrit.cloudera.org:8080/#/c/16082/9/fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java@1418
PS9, Line 1418:   if (addAcidSlotsIfNeeded(analyzer, hdfsTblRef, 
partitions)) {
nit: what about merging this if-statement with its outer scope so they are

 if (isPartitionKeyScan && queryOpts.optimize_partition_key_scans) {
   ...
 } else if (addAcidSlotsIfNeeded(analyzer, hdfsTblRef, partitions)) {
   ...
 } else {
   ...
 }


http://gerrit.cloudera.org:8080/#/c/16082/9/fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java@1518
PS9, Line 1518: // Let's separate insert delta File Descriptors from delete 
delta FDs.
I think we should separate the file descriptors in catalogd after loading them 
from HDFS instead of doing it here for each query. We can introduce two fileds 
in HdfsPartition: encodedInsertDeltaFileDescriptors_ and 
encodedDeleteDeltaFileDescriptors_ (and related fields in THdfsPartition and 
TPartialPartitionInfo). If a partition contains delete deltas, we separate them 
by setting these two fields and leaving encodedFileDescriptors_ null. We can 
also introduce two methods for FeFsPartition: genInsertDeltaPartition() and 
genDeleteDeltaPartition() using HdfsPartition.Builder in this way:

 public HdfsPartition genInsertDeltaPartition() {
return new HdfsPartition.Builder(this)
.setFileDescriptors(InsertDeltaFileDescriptors_)
.build();
 }

With this we don't need to remove the "final" marker of encodedFileDescriptors_ 
and add back the setFileDescriptor() method, which violates our goal to make 
HdfsPartition immutable. The setFileDescriptor() method may encourage future 
developers to modify HdfsPartitions in-place in catalogd, which will break 
IMPALA-7533.



--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Wed, 01 Jul 2020 06:57:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] WIP - IMPALA-9898: generate grouping set plans

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16128 )

Change subject: WIP - IMPALA-9898: generate grouping set plans
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6473/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16128
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie454c5bf7aee266321dee615548d7f2b71380197
Gerrit-Change-Number: 16128
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 01 Jul 2020 04:25:54 +
Gerrit-HasComments: No


[Impala-ASF-CR] WIP - IMPALA-9898: generate grouping set plans

2020-06-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16128 )

Change subject: WIP - IMPALA-9898: generate grouping set plans
..


Patch Set 6:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/16128/5/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
File fe/src/main/java/org/apache/impala/analysis/GroupByClause.java:

http://gerrit.cloudera.org:8080/#/c/16128/5/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java@145
PS5, Line 145: }
There's a problem here that affects plan generation - we need to include 
grouping expressions that were added during rewrites and aren't part of the 
original grouping sets.


http://gerrit.cloudera.org:8080/#/c/16128/5/fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java
File fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java:

http://gerrit.cloudera.org:8080/#/c/16128/5/fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java@227
PS5, Line 227:   return;
> Set isAnalyzed_ = true ?  Seems analyzeGroupingSets() is not setting this f
I'm relying on analyzeCustomClasses to do that, but it's kinda buried - added a 
comment in analyzeGroupingSets().



--
To view, visit http://gerrit.cloudera.org:8080/16128
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie454c5bf7aee266321dee615548d7f2b71380197
Gerrit-Change-Number: 16128
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 01 Jul 2020 03:59:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] WIP - IMPALA-9898: generate grouping set plans

2020-06-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16128


Change subject: WIP - IMPALA-9898: generate grouping set plans
..

WIP - IMPALA-9898: generate grouping set plans

Testing:
* Add analysis test to make sure we reject unsupported queries.
* Added targeted planner tests to ensure we generate the correct
  aggregation classes for a variety of cases.

TODO:
* Add detailed functional tests.

Added five TPC-DS queries that use ROLLUP, building on some work done
by Fang-Yu Rao. Some tweaks were required for these tests.
* Add an extra ORDER BY clause to q77 to make fully deterministic.
* Add backticks around `returns` to avoid reserved word.
* Add INTERVAL keyword to date/timestamp arithmetic.

We should also be able to run q27 and q80, but I didn't have those
readily available.

Change-Id: Ie454c5bf7aee266321dee615548d7f2b71380197
---
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
M fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
M fe/src/main/java/org/apache/impala/common/RuntimeEnv.java
M fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
M fe/src/test/java/org/apache/impala/analysis/ToSqlTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTest.java
A testdata/workloads/functional-planner/queries/PlannerTest/grouping-sets.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/subquery-rewrite.test
M testdata/workloads/functional-planner/queries/PlannerTest/tpcds-all.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q18.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q22.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q5.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q67.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q77.test
M tests/query_test/test_tpcds_queries.py
M tests/util/parse_util.py
19 files changed, 3,623 insertions(+), 60 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/28/16128/6
--
To view, visit http://gerrit.cloudera.org:8080/16128
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ie454c5bf7aee266321dee615548d7f2b71380197
Gerrit-Change-Number: 16128
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] WIP - IMPALA-9898: generate grouping set plans

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16128 )

Change subject: WIP - IMPALA-9898: generate grouping set plans
..


Patch Set 6:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/16128/6/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
File fe/src/main/java/org/apache/impala/analysis/GroupByClause.java:

http://gerrit.cloudera.org:8080/#/c/16128/6/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java@137
PS6, Line 137:   private void addGroupingID(long id, List groupingExprs, 
List addtlGroupingExprs)
line too long (94 > 90)


http://gerrit.cloudera.org:8080/#/c/16128/6/fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
File fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java:

http://gerrit.cloudera.org:8080/#/c/16128/6/fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java@2472
PS6, Line 2472: AnalysisError("select distinct int_col, bool_col, count(*) 
from functional.alltypes " +
line too long (91 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/16128
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie454c5bf7aee266321dee615548d7f2b71380197
Gerrit-Change-Number: 16128
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 01 Jul 2020 03:57:58 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..

IMPALA-9913: Use table id to match the table in drop table event

This patch modifies the matching logic of the table while
processing a DROP_TABLE event. The EventsProcessor currently relies
on createTime of the table to determine if the table in the catalog
matches with the table in the event. It is possible that if there
is a time-skew between the different HMS instances and a same table
is dropped and recreated, the creation time of the table which
is present in the catalog is less than or equals to the creationTime
of a table which has been dropped. In such case, there is a false
negative during the matching and the DROP_TABLE event is skipped.

HIVE-20556 exposes a id field of the table which can be used to
uniquely identify a table. A table which is dropped and recreated
with the same name has a different id. This patch makes use of this
id to match the table in catalog with the table from the HMS event.

Note that the DROP_DATABASE event continues to use createTime for
matching. We should use a database id for it when it is available.
I have created HIVE-23785 for tracking that on the HMS side.

Testing:
1. Ran Events processor tests.

Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Reviewed-on: http://gerrit.cloudera.org:8080/16129
Reviewed-by: Anurag Mantripragada 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M 
fe/src/test/java/org/apache/impala/catalog/events/MetastoreEventsProcessorTest.java
2 files changed, 10 insertions(+), 10 deletions(-)

Approvals:
  Anurag Mantripragada: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 3
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 2
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 01 Jul 2020 00:16:52 +
Gerrit-HasComments: No


[native-toolchain-CR] Bump Kudu version to 23f67ae0d

2020-06-30 Thread Bankim Bhavsar (Code Review)
Bankim Bhavsar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16130 )

Change subject: Bump Kudu version to 23f67ae0d
..


Patch Set 1: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/16130
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I036889f91a0901ff09fce31069bdb0bf96b7bb1e
Gerrit-Change-Number: 16130
Gerrit-PatchSet: 1
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bankim Bhavsar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 23:01:02 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15961 )

Change subject: IMPALA-9692 (part 3): Model QuerySchedule as a protobuf
..


Patch Set 10: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15961
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Gerrit-Change-Number: 15961
Gerrit-PatchSet: 10
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 22:06:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15961 )

Change subject: IMPALA-9692 (part 3): Model QuerySchedule as a protobuf
..

IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

In order to support the new admission control service, we need to be
able to return the results of an admission attempt, i.e. the query
schedule, to the coordinator.

To enable this, this patch moves all parts of the QuerySchedule class
and related classes that are required by the coordinator into a new
message QuerySchedulePB. The main admission control interface,
SubmitForAdmission(), now returns a QuerySchedulePB.

Some notable changes:
- Previously, QuerySchedule was used by Coordinator as a way to pass
  around references to parts of the TExecRequest to places like
  Coordinator::ExecSummary and Coordinator::BackendState. This has
  been replaced with the ExecParams class, which is a container for
  references to the TExecRequest and QuerySchedulePB along with
  convenience functions for accessing them.
- Similarly, FragmentExecParams, which is part of QuerySchedule,
  contains references to the associated TPlanFragment, owned by the
  TExecRequest, which were used by the Coordinator when iterating over
  the schedule to initiate the query. Since FragmentExecParamsPB can't
  contain these references, they were replaced by a map between
  fragment idx and TPlanFragment in ExecParams.
- In order to keep payloads reasonable for the eventual RPC interface,
  AdmissionController::ReleaseQuery() and ReleaseQueryBackend() now
  take a query id as a parameter instead of a QuerySchedule. To
  facilitate this, AdmissionController now maintains a map from query
  ids of running queries to the resources that were allocated for them
  so that it can look the resources up when releasing them. This map
  will be necessary when implementing the admission control service to
  facilitate proper accounting of resouces in cases like coordinator
  failures.
- As scheduling is currently organized, we first construct the
  FragmentExecParams with the FInstanceExecParams as their children,
  then we construct the BackendExecParams which get references to
  their FInstanceExecParams. Since we can't send references like these
  through an rpc, we now instead Swap() the FInstanceExecParamsPB
  into the BackendExecParamsPB.

Testing:
- Updated related tests.
- Passed a full run of existing tests.

Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Reviewed-on: http://gerrit.cloudera.org:8080/15961
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/runtime/CMakeLists.txt
M be/src/runtime/coordinator-backend-resource-state.cc
M be/src/runtime/coordinator-backend-state-test.cc
M be/src/runtime/coordinator-backend-state.cc
M be/src/runtime/coordinator-backend-state.h
M be/src/runtime/coordinator.cc
M be/src/runtime/coordinator.h
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/query-driver.cc
A be/src/runtime/query-exec-params.cc
A be/src/runtime/query-exec-params.h
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/scheduling/cluster-membership-test-util.cc
M be/src/scheduling/query-schedule.cc
M be/src/scheduling/query-schedule.h
M be/src/scheduling/scheduler-test-util.cc
M be/src/scheduling/scheduler-test-util.h
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
M be/src/service/client-request-state.cc
M be/src/service/client-request-state.h
M be/src/service/impala-http-handler.cc
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M be/src/util/debug-util.h
M be/src/util/uid-util.h
M common/protobuf/CMakeLists.txt
A common/protobuf/admission_control_service.proto
34 files changed, 1,082 insertions(+), 776 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/15961
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Gerrit-Change-Number: 15961
Gerrit-PatchSet: 11
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 


[Impala-ASF-CR] IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in http mode.

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15752 )

Change subject: IMPALA-9540 Test that Impala Shell no longer sends duplicate 
"Host" headers in http mode.
..


Patch Set 6: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15752
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82996015d0205923e854dac8bb88604778684c46
Gerrit-Change-Number: 15752
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 21:46:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in http mode.

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15752 )

Change subject: IMPALA-9540 Test that Impala Shell no longer sends duplicate 
"Host" headers in http mode.
..

IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in 
http mode.

Many http servers will not accept an http request that has multiple
copies of the "Host" header. A recent toolchain change patches
Thrift so that will not send the extraneous header (in THttpClient).
This change tests that the duplicate headers are not sent,

TESTING:
  Ran all end-to-end tests.
  Rewrote an existing Shell test to check that only one "Host" header
  is sent.

Change-Id: I82996015d0205923e854dac8bb88604778684c46
Reviewed-on: http://gerrit.cloudera.org:8080/15752
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M tests/shell/test_shell_interactive.py
M tests/shell/util.py
2 files changed, 54 insertions(+), 35 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/15752
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I82996015d0205923e854dac8bb88604778684c46
Gerrit-Change-Number: 15752
Gerrit-PatchSet: 7
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 11:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6472/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 11
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 21:30:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9294: Support DATE for min-max runtime filter

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16103 )

Change subject: IMPALA-9294: Support DATE for min-max runtime filter
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6471/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16103
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2f6e2dc6949735d5f0fcf317361cc2969a5e82c
Gerrit-Change-Number: 16103
Gerrit-PatchSet: 6
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 30 Jun 2020 21:17:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 11:

> LGTM, nice approach of keeping the groupingExprs_ in the select stmt.

I copied that from you ;)


--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 11
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 21:08:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Tim Armstrong (Code Review)
Hello Aman Sinha, Shant Hovsepian, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16112

to look at the new patch set (#11).

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..

IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

Supports a single ROLLUP, CUBE or GROUPING SETS clause in
GROUP BY.

Also adds non-standard "WITH ROLLUP" and "WITH CUBE" syntax
that is supported by some other SQL dialects.

This implements basic parsing and validation of the query,
then raises an AnalysisException to report that it is not
supported so that incorrect plans will not be generated.

This patch adds a GroupByClause to each SelectStmt that
contains info about the grouping sets and the original
GROUP BY list. The grouping exprs are still represented
as a List in SelectStmt. Most of the logic, including
statement and expr rewrites, can operate on this list of
expressions without requiring special handling for grouping
sets.

Testing:
* Add Parser test.
* Add toSql() test.
* Added analysis tests to check that analysis accepts or rejects
  queries correctly.

Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
---
M fe/src/main/cup/sql-parser.cup
A fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java
M fe/src/main/java/org/apache/impala/common/RuntimeEnv.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzerTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/analysis/ToSqlTest.java
10 files changed, 741 insertions(+), 23 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/12/16112/11
--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 11
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 10:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/cup/sql-parser.cup
File fe/src/main/cup/sql-parser.cup:

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/cup/sql-parser.cup@3076
PS10, Line 3076: // * GROUP BY a, b, c WITH CUBE - non-standard CUBE syntax 
supported by some systems
> Add example of GROUP BY GROUPING SETS ?
Done


http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/cup/sql-parser.cup@3076
PS10, Line 3076: // * GROUP BY a, b, c WITH CUBE - non-standard CUBE syntax 
supported by some systems
> Add example of GROUP BY GROUPING SETS ?
Done


http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
File fe/src/main/java/org/apache/impala/analysis/GroupByClause.java:

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java@121
PS10, Line 121:   private void addGroupingID(long id, List groupingExprs) 
throws AnalysisException {
> Nit but might be cleaner to use the JDK BitSet class especially down the li
It's probably easier to leave as-is, if only because enumerating classes for 
CUBE is way more concise using additional on an integer.

I added some comments to explain that bit manipulation logic to make it less 
cryptic.



--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 10
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 21:03:21 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Bump Kudu version to 23f67ae0d

2020-06-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16130


Change subject: Bump Kudu version to 23f67ae0d
..

Bump Kudu version to 23f67ae0d

This pulls in the latest version of Kudu which includes a performance
enhancement in Kudu server for Bloom filter, which uses heuristics to
disable predicate evaluation for Bloom filter.

Testing:
 - Built native-toolchain on local machine with Kudu version of
   23f67ae0d, and built Impala with new version of Kudu from native-
   toolchain built on local machine, then started mini-cluster with
   the new version of Kudu server. Ran single_node_perf_run.py with
   TPCH for Kudu and verified that the regression issue on TPCH-Q9
   was fixed.
 - Passed test_kudu.py and test_runtime_filters.py against new version
   of Kudu server.
 - Successfully built native-toolchain on Jenkins without publishing
   dependencies to S3.

Change-Id: I036889f91a0901ff09fce31069bdb0bf96b7bb1e
---
M buildall.sh
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/native-toolchain 
refs/changes/30/16130/1
--
To view, visit http://gerrit.cloudera.org:8080/16130
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I036889f91a0901ff09fce31069bdb0bf96b7bb1e
Gerrit-Change-Number: 16130
Gerrit-PatchSet: 1
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Thomas Tauber-Marshall 


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 10: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 10
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 20:59:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9294: Support DATE for min-max runtime filter

2020-06-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/16103 )

Change subject: IMPALA-9294: Support DATE for min-max runtime filter
..

IMPALA-9294: Support DATE for min-max runtime filter

Implemented Date min-max filter and applied it to Kudu as other
min-max runtime filters.
Added new test cases for Date min-max filters.

Testing:
Passed all core tests.

Change-Id: Ic2f6e2dc6949735d5f0fcf317361cc2969a5e82c
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/runtime/date-value.h
M be/src/util/min-max-filter-ir.cc
M be/src/util/min-max-filter-test.cc
M be/src/util/min-max-filter.cc
M be/src/util/min-max-filter.h
M common/protobuf/common.proto
M common/thrift/Data.thrift
M fe/src/main/java/org/apache/impala/planner/RuntimeFilterGenerator.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/min-max-runtime-filters.test
M testdata/workloads/functional-query/queries/QueryTest/all_runtime_filters.test
M testdata/workloads/functional-query/queries/QueryTest/min_max_filters.test
12 files changed, 376 insertions(+), 167 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/03/16103/6
--
To view, visit http://gerrit.cloudera.org:8080/16103
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ic2f6e2dc6949735d5f0fcf317361cc2969a5e82c
Gerrit-Change-Number: 16103
Gerrit-PatchSet: 6
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6470/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 2
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 19:35:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 4
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 19:33:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..

IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() 
method

ParallelFileMetadataLoader.loaders_ became a Map, but at one place we
still treat it as a List:

  loaders_.get(i).getPartDir()

'i' is an integer, while the loaders_'s key type is Path.

Unfortunately the code compiled which means we got a
NullPointerException for this statement. I changed the logic to avoid
the NPE.

Testing:

  * Tested manually on an ACID table with deleted rows
  * I didn't add an automated test because I'm about to extend the ACID
functionalities, hence getting rid of these exceptions
  * The other case when we can hit this NPE is when we get an
IOException in the file metadata loader's load() method

Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Reviewed-on: http://gerrit.cloudera.org:8080/16121
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java
1 file changed, 7 insertions(+), 4 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 5
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6469/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 1
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 19:32:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6083/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 2
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 19:23:18 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Anurag Mantripragada (Code Review)
Anurag Mantripragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 2
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 19:22:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/16129 )

Change subject: IMPALA-9913: Use table id to match the table in drop table event
..

IMPALA-9913: Use table id to match the table in drop table event

This patch modifies the matching logic of the table while
processing a DROP_TABLE event. The EventsProcessor currently relies
on createTime of the table to determine if the table in the catalog
matches with the table in the event. It is possible that if there
is a time-skew between the different HMS instances and a same table
is dropped and recreated, the creation time of the table which
is present in the catalog is less than or equals to the creationTime
of a table which has been dropped. In such case, there is a false
negative during the matching and the DROP_TABLE event is skipped.

HIVE-20556 exposes a id field of the table which can be used to
uniquely identify a table. A table which is dropped and recreated
with the same name has a different id. This patch makes use of this
id to match the table in catalog with the table from the HMS event.

Note that the DROP_DATABASE event continues to use createTime for
matching. We should use a database id for it when it is available.
I have created HIVE-23785 for tracking that on the HMS side.

Testing:
1. Ran Events processor tests.

Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
---
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M 
fe/src/test/java/org/apache/impala/catalog/events/MetastoreEventsProcessorTest.java
2 files changed, 10 insertions(+), 10 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/16129/2
--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 2
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9913: Use table id to match the table in drop table event

2020-06-30 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16129


Change subject: IMPALA-9913: Use table id to match the table in drop table event
..

IMPALA-9913: Use table id to match the table in drop table event

This patch modifies the matching logic of the table while
processing a DROP_TABLE event. The EventsProcessor currently relies
on createTime of the table to determine if the table in the catalog
matching with the table in the event. It is possible that if there
is a time-skew between the different HMS instances and a same table
is dropped and recreated, the the creation time of the table which
is present in the catalog is less than or equals to the creationTime
of a table which is dropped. In such case, there is a false negative
during the matching and the DROP_TABLE event is skipped.

HIVE-20556 exposes a id field of the table in which can be used to
uniquely identify a table. A table which is dropped and recreated
with the same name has a different id. This patch makes use of this
id to match the table in catalog with the table from the HMS event.

Note that the DROP_DATABASE event continues to use createTime for
matching. We should use a database id for it when it is available.
I have created HIVE-23785 for tracking that on the HMS side.

Testing:
1. Ran Events processor tests.

Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
---
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M 
fe/src/test/java/org/apache/impala/catalog/events/MetastoreEventsProcessorTest.java
2 files changed, 10 insertions(+), 10 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/16129/1
--
To view, visit http://gerrit.cloudera.org:8080/16129
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Id6a80bbf5757e46318af1b57911fc127d7dd1f01
Gerrit-Change-Number: 16129
Gerrit-PatchSet: 1
Gerrit-Owner: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16122 )

Change subject: IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6468/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16122
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
Gerrit-Change-Number: 16122
Gerrit-PatchSet: 3
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 18:22:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

2020-06-30 Thread Thomas Tauber-Marshall (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16122

to look at the new patch set (#3).

Change subject: IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState
..

IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

This is the final patch in the refactor of QuerySchedule for the
single admission controller work. It was kept separate for ease of
reviewing.

This patch renames QuerySchedule and its related classes
(FInstanceExecParams, BackendExecParams, and FragmentExecParams) to
use the name 'ScheduleState', reflecting the fact that they are used
as containers for intermediate info about a query during scheduling.

The messages that are included in the QuerySchedulePB struct retain
the 'ExecParams' name, reflecting the fact that they are used by the
coordinator to start execution.

Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
---
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
R be/src/scheduling/schedule-state.cc
R be/src/scheduling/schedule-state.h
M be/src/scheduling/scheduler-test-util.h
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
9 files changed, 631 insertions(+), 625 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/16122/3
--
To view, visit http://gerrit.cloudera.org:8080/16122
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
Gerrit-Change-Number: 16122
Gerrit-PatchSet: 3
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 


[Impala-ASF-CR] IMPALA-9898: Plan generation and execution for grouping sets

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16115 )

Change subject: IMPALA-9898: Plan generation and execution for grouping sets
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6467/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16115
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id474c5373860b0d8014ee9c844a3fb90092be968
Gerrit-Change-Number: 16115
Gerrit-PatchSet: 3
Gerrit-Owner: Aman Sinha 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:34:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16122 )

Change subject: IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6466/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16122
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
Gerrit-Change-Number: 16122
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:31:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9898: Plan generation and execution for grouping sets

2020-06-30 Thread Aman Sinha (Code Review)
Aman Sinha has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16115 )

Change subject: IMPALA-9898: Plan generation and execution for grouping sets
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16115/2/fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java
File fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java:

http://gerrit.cloudera.org:8080/#/c/16115/2/fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java@544
PS2, Line 544: if (aggInfo.getGroupingExprs().size() == 0) {
> I think it would be more efficient to support not providing the NULL groupi
Yes, the NULL exprs are included in the intermediate tuples.  I felt that 
getting the positional index right (for grouping_id/grouping function support) 
could be prone to bugs if all grouping sets did not have the fixed number of 
exprs. I have left this as-is for now in the latest patchset but if needed can 
do some refactoring as Part-2 based on the integration with your patch



--
To view, visit http://gerrit.cloudera.org:8080/16115
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id474c5373860b0d8014ee9c844a3fb90092be968
Gerrit-Change-Number: 16115
Gerrit-PatchSet: 3
Gerrit-Owner: Aman Sinha 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:25:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9898: Plan generation and execution for grouping sets

2020-06-30 Thread Aman Sinha (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16115

to look at the new patch set (#3).

Change subject: IMPALA-9898: Plan generation and execution for grouping sets
..

IMPALA-9898: Plan generation and execution for grouping sets

This patch enhances the MultiAggregateInfo to handle grouping sets
and rollup (which is converted to grouping sets). This patch does
not itself do parsing/validation of grouping sets syntax but rather
provides the following supporting functionality:
  - A separate analyze method that accepts aggregation classes and
aggregation info that have been created separately.
  - A modified Transpose phase that uses combination of aggif(),
valid_tid() functions and CASE exprs to choose exactly which
slots from the underlying aggregate classes need to be output
based on the tuple id.
  - Modified materialization step where all aggregate slots and
grouping slots are materialized in case of grouping sets.
  - Creates grouping_id value for grouping sets. The grouping_id
function in SQL describes which expression is grouped-by in a
particular row of a query with grouping sets.

Testing:
  This patch is not individually testable but will be tested
  as part of the overall grouping set support.

Change-Id: Id474c5373860b0d8014ee9c844a3fb90092be968
---
M fe/src/main/java/org/apache/impala/analysis/MultiAggregateInfo.java
1 file changed, 266 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/15/16115/3
--
To view, visit http://gerrit.cloudera.org:8080/16115
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id474c5373860b0d8014ee9c844a3fb90092be968
Gerrit-Change-Number: 16115
Gerrit-PatchSet: 3
Gerrit-Owner: Aman Sinha 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15961 )

Change subject: IMPALA-9692 (part 3): Model QuerySchedule as a protobuf
..


Patch Set 10: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15961
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Gerrit-Change-Number: 15961
Gerrit-PatchSet: 10
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:03:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15961 )

Change subject: IMPALA-9692 (part 3): Model QuerySchedule as a protobuf
..


Patch Set 10:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6082/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15961
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Gerrit-Change-Number: 15961
Gerrit-PatchSet: 10
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:03:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9692 (part 3): Model QuerySchedule as a protobuf

2020-06-30 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15961 )

Change subject: IMPALA-9692 (part 3): Model QuerySchedule as a protobuf
..


Patch Set 9: Code-Review+2

(1 comment)

rebased, carrying forward

http://gerrit.cloudera.org:8080/#/c/15961/4/be/src/scheduling/query-schedule.h
File be/src/scheduling/query-schedule.h:

http://gerrit.cloudera.org:8080/#/c/15961/4/be/src/scheduling/query-schedule.h@58
PS4, Line 58: er
> JIRA?
https://gerrit.cloudera.org/#/c/16122/



--
To view, visit http://gerrit.cloudera.org:8080/15961
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1db64e72f84604b1d8ac24e0bdd4ad6bedd6bcd9
Gerrit-Change-Number: 15961
Gerrit-PatchSet: 9
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17:03:08 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

2020-06-30 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16122 )

Change subject: IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState
..


Patch Set 2:

(18 comments)

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller-test.cc
File be/src/scheduling/admission-controller-test.cc:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller-test.cc@131
PS1, Line 131:   
backend_schedule_state.exec_params->set_min_mem_reservation_bytes(
> line too long (99 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller.h
File be/src/scheduling/admission-controller.h:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller.h@246
PS1, Line 246: /// afterwards call CanAdmitRequest() for each of the schedules. 
Executor groups are
> line too long (91 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller.h@993
PS1, Line 993:   /// Backend is an Executor or a Coordinator.
> line too long (92 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/admission-controller.cc@675
PS1, Line 675:   largest_min_mem_reservation =
> line too long (91 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h
File be/src/scheduling/schedule-state.h:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h@61
PS1, Line 61:   explicit BackendScheduleState(BackendExecParamsPB* exec_params)
> line too long (95 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h@65
PS1, Line 65: /// Map from an impalad backend address to the exec params for 
that backend.
> line too long (91 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h@83
PS1, Line 83:   /// Contains any info that needs to be sent back to the 
coordinator. Computed during
> line too long (94 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h@134
PS1, Line 134: ///   pointers to corresponding FragmentExecParamsPBs created in 
the ExecParamsPB.
> line too long (91 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.h@210
PS1, Line 210: return per_backend_schedule_states_;
> line too long (97 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.cc
File be/src/scheduling/schedule-state.cc:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.cc@163
PS1, Line 163: count_map.insert(
> line too long (98 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/schedule-state.cc@204
PS1, Line 204: const BackendScheduleState& bp = elem.second;
> line too long (93 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.h
File be/src/scheduling/scheduler.h:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.h@355
PS1, Line 355:   /// Computes execution parameters for all backends assigned in 
the query and always one
> line too long (93 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.h@413
PS1, Line 413:   const std::vector& scan_ids,
> line too long (93 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc
File be/src/scheduling/scheduler.cc:

http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc@224
PS1, Line 224:   FragmentScheduleState* src_params =
> line too long (92 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc@828
PS1, Line 828:   coord_min_reservation = 
coord_be_state.exec_params->min_mem_reservation_bytes();
> line too long (92 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc@842
PS1, Line 842:   }
> line too long (96 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc@853
PS1, Line 853:<< ") ";
> line too long (98 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/16122/1/be/src/scheduling/scheduler.cc@857
PS1, Line 857:  TUnit::UNIT)
> line too long (94 > 90)
Done



--
To view, visit http://gerrit.cloudera.org:8080/16122
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
Gerrit-Change-Number: 16122
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 17

[Impala-ASF-CR] IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

2020-06-30 Thread Thomas Tauber-Marshall (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16122

to look at the new patch set (#2).

Change subject: IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState
..

IMPALA-9692 (part 4): Rename QuerySchedule to ScheduleState

This is the final patch in the refactor of QuerySchedule for the
single admission controller work. It was kept separate for ease of
reviewing.

This patch renames QuerySchedule and its related classes
(FInstanceExecParams, BackendExecParams, and FragmentExecParams) to
use the name 'ScheduleState', reflecting the fact that they are used
as containers for intermediate info about a query during scheduling.

The messages that are included in the QuerySchedulePB struct retain
the 'ExecParams' name, reflecting the fact that they are used by the
coordinator to start execution.

Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
---
M .clang-format
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
R be/src/scheduling/schedule-state.cc
R be/src/scheduling/schedule-state.h
M be/src/scheduling/scheduler-test-util.h
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
10 files changed, 616 insertions(+), 611 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/16122/2
--
To view, visit http://gerrit.cloudera.org:8080/16122
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I57069c4a426f3e697df7e2a07754d063bdea26f7
Gerrit-Change-Number: 16122
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Shant Hovsepian (Code Review)
Shant Hovsepian has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 10: Code-Review+1

(2 comments)

LGTM, nice approach of keeping the groupingExprs_ in the select stmt.

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/cup/sql-parser.cup
File fe/src/main/cup/sql-parser.cup:

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/cup/sql-parser.cup@3076
PS10, Line 3076: // * GROUP BY a, b, c WITH CUBE - non-standard CUBE syntax 
supported by some systems
Add example of GROUP BY GROUPING SETS ?


http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java
File fe/src/main/java/org/apache/impala/analysis/GroupByClause.java:

http://gerrit.cloudera.org:8080/#/c/16112/10/fe/src/main/java/org/apache/impala/analysis/GroupByClause.java@121
PS10, Line 121:   private void addGroupingID(long id, List groupingExprs) 
throws AnalysisException {
Nit but might be cleaner to use the JDK BitSet class especially down the line 
when filling out the multiagg info.



--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 10
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 16:52:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in http mode.

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15752 )

Change subject: IMPALA-9540 Test that Impala Shell no longer sends duplicate 
"Host" headers in http mode.
..


Patch Set 6: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15752
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82996015d0205923e854dac8bb88604778684c46
Gerrit-Change-Number: 15752
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 16:45:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in http mode.

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15752 )

Change subject: IMPALA-9540 Test that Impala Shell no longer sends duplicate 
"Host" headers in http mode.
..


Patch Set 6:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6081/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15752
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82996015d0205923e854dac8bb88604778684c46
Gerrit-Change-Number: 15752
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 16:45:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9540 Test that Impala Shell no longer sends duplicate "Host" headers in http mode.

2020-06-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15752 )

Change subject: IMPALA-9540 Test that Impala Shell no longer sends duplicate 
"Host" headers in http mode.
..


Patch Set 5: Code-Review+2

Bring forward +2
I rebased and ran all tests


--
To view, visit http://gerrit.cloudera.org:8080/15752
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82996015d0205923e854dac8bb88604778684c46
Gerrit-Change-Number: 15752
Gerrit-PatchSet: 5
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 16:42:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16112 )

Change subject: IMPALA-9897: GROUPING SETS/CUBE/ROLLUP parsing and analysis
..


Patch Set 10:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6080/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/16112
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I178e45de94d736630c97ae1ec4a92423cd74621f
Gerrit-Change-Number: 16112
Gerrit-PatchSet: 10
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 15:59:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5444: Asynchronous code generation

2020-06-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15105 )

Change subject: IMPALA-5444: Asynchronous code generation
..


Patch Set 43: Code-Review+2

(5 comments)

I have a few optional comments about the tests.
Great work!

http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_async_codegen.py
File tests/query_test/test_async_codegen.py:

http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_async_codegen.py@83
PS43, Line 83: 
cls.ImpalaTestMatrix.add_dimension(create_exec_option_dimension(
 : cluster_sizes=[1],
 : disable_codegen_options=[False],
 : batch_sizes=[0],
 : disable_codegen_rows_threshold_options=[0],
 : 
debug_action_options=[cls.DEBUG_ACTION_CODEGEN_FINISH_BEFORE_EXEC_START,
 : cls.DEBUG_ACTION_CODEGEN_FINISH_DURING_EXEC,
 : cls.DEBUG_ACTION_EXEC_FINISH_BEFORE_CODEGEN]))
maybe it could be simpler to run the query with the 3 different debug actions 
in test_async_codegen instead of creating an extra dimension


http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_async_codegen.py@136
PS43, Line 136:   def __find_event_sequence(self, profile):
  : # The lines corresponding to the events in the event 
sequence.
  : events = []
  :
  : # The number of leading whitespace in the lines containing 
the events, used to
  : # detect the line after the last event.
  : indent_len = None
  :
  : # Set to true when encountering the header before the first 
event. This means the
  : # following lines contain the events.
  : found_events_start = False
  :
  : for line in profile.split('\n'):
  :   if found_events_start:
  : leading_whitespace = len(line) - len(line.lstrip())
  : if indent_len is None:
  :   # This was the first event. We store the indentation 
of the events.
  :   indent_len = leading_whitespace
  : elif leading_whitespace < indent_len:
  :   # We've reached the line after the events, stop the 
iteration.
  :   break
  :
  : # If we reach here we are processing a line containing 
an event.
  : events.append(self.__extract_event_name(line))
  :
  :   elif 'Fragment Instance Lifecycle Event Timeline' in line:
  : found_events_start = True
  :
  : return events
  :
  :   def __extract_event_name(self, line):
  : start = line.index('-') + 2  # There is a space after the 
dash.
  : end = line.index(':')
  : return line[start:end]
This could be moved to common code, e.g. 
https://github.com/apache/impala/blob/master/tests/common/test_result_verifier.py

We do similar things at other places, e.g. 
https://github.com/apache/impala/blob/2e07d0c07febf1d1ee9324708d543b792fb45b00/tests/query_test/test_observability.py#L404,
 but I prefer your version.


http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_queries.py
File tests/query_test/test_queries.py:

http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_queries.py@75
PS43, Line 75: parquet
nit: Parquet


http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_queries.py@80
PS43, Line 80: beeswax
I would prefer hs2, as beeswax may be deprecated in the not too far future.


http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_query_mem_limit.py
File tests/query_test/test_query_mem_limit.py:

http://gerrit.cloudera.org:8080/#/c/15105/43/tests/query_test/test_query_mem_limit.py@30
PS43, Line 30: add_exec_option_dimension,
Did something change in this file?



--
To view, visit http://gerrit.cloudera.org:8080/15105
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia7cbfa7c6734dcf03641629429057d6a4194aa6b
Gerrit-Change-Number: 15105
Gerrit-PatchSet: 43
Gerrit-Owner: Daniel Becker 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 15:13:37 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9744: Treat corrupt table stats as missing to avoid bad plans

2020-06-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16098 )

Change subject: IMPALA-9744: Treat corrupt table stats as missing to avoid bad 
plans
..


Patch Set 11:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/16098/10/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/16098/10/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@a1164
PS10, Line 1164:
> I thought that getNumClusteringCols.size() == number of partition columns f
Yes by a single partitioned table I mean a non-partitioned table. We use the 
term a lot in my previous job.


http://gerrit.cloudera.org:8080/#/c/16098/10/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1179
PS10, Line 1179: // If all partitions have good stats, return the total row 
count, contributed
   : // by all of them, as the row count for the table.
> > So to summarize, the goal here (or at least original intention of this JI
Yes, I agree that treating the missing and corrupt stats the same is a good 
idea, in the context of providing a good/useful row count (RC).

My first impression of the original logic, at line 1179,

if (numPartitionsWithNumRows_ > 0) return partitionNumRows_;

is that it can seriously under-estimate the RC when only one partition has the 
good stats. This is addressed by the fix.


http://gerrit.cloudera.org:8080/#/c/16098/10/tests/metadata/test_compute_stats.py
File tests/metadata/test_compute_stats.py:

http://gerrit.cloudera.org:8080/#/c/16098/10/tests/metadata/test_compute_stats.py@199
PS10, Line 199: int_col int,
> Isn't it the load data local inpath query that is setting the stats to a co
Yes, the bad stats was created by the loading part. Sorry I did not state it 
clearly in my comment above and missed a point as follows.

If we create the table with 'create table like' in Impala, the testing table 
itself becomes an Impala one, regardless of being internal or external. See 
https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/impala_tables.html#tables.

Normally, one needs to create a table "natively" in an engine to create a 
"native" table. Thus, if the objective is to test against a Hive table, the 
current version does the job. 

We could add a test case as you suggested to test the impala table.


http://gerrit.cloudera.org:8080/#/c/16098/10/tests/metadata/test_compute_stats.py@215
PS10, Line 215: # Make the table visible in Impala.
> It's set to true by default in Hive already: https://github.com/apache/hive
Maybe we add a comment here? The rational is to call out the condition to 
reproduce the bad stats in the test, regardless of the default setting in Hive.



--
To view, visit http://gerrit.cloudera.org:8080/16098
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9f4c64616ff7c0b6d5a48f2b5331325feeff3576
Gerrit-Change-Number: 16098
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 15:04:42 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6692: Trigger sort node run before hitting memory limit.

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15963 )

Change subject: IMPALA-6692: Trigger sort node run before hitting memory limit.
..


Patch Set 15:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6465/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15963
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2a0ba7c4bae4f1d300d4d9d7f594f63ced06a240
Gerrit-Change-Number: 15963
Gerrit-PatchSet: 15
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 14:45:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6079/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 4
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 14:33:18 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 4
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 14:33:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6692: Trigger sort node run before hitting memory limit.

2020-06-30 Thread Riza Suminto (Code Review)
Riza Suminto has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15963 )

Change subject: IMPALA-6692: Trigger sort node run before hitting memory limit.
..


Patch Set 15:

(8 comments)

Thank you, Csaba!

http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@10
PS14, Line 10: batches are added to current unsorted run or memory limit is hit,
> missing 'are'?
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@11
PS14, Line 11: sorter will immediately start the run. If the latter case 
happens,
> nit: happens
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@13
PS14, Line 13: new unsorted run object, and continue to add the next row 
batches, and
> missing 'to'
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@15
PS14, Line 15:
> nit: tries
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@23
PS14, Line 23:
> nit: speeds up
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@26
PS14, Line 26: 'sort_
> nit: exceeded
Done


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@32
PS14, Line 32: ssion
> nit: fit
Done


http://gerrit.cloudera.org:8080/#/c/15963/14/be/src/runtime/sorter.cc
File be/src/runtime/sorter.cc:

http://gerrit.cloudera.org:8080/#/c/15963/14/be/src/runtime/sorter.cc@952
PS14, Line 952: 3
> I didn't catch this one - I think that it should be also VLOG(3)
Done



--
To view, visit http://gerrit.cloudera.org:8080/15963
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2a0ba7c4bae4f1d300d4d9d7f594f63ced06a240
Gerrit-Change-Number: 15963
Gerrit-PatchSet: 15
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 14:18:47 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6692: Trigger sort node run before hitting memory limit.

2020-06-30 Thread Riza Suminto (Code Review)
Hello David Rorke, Tim Armstrong, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15963

to look at the new patch set (#15).

Change subject: IMPALA-6692: Trigger sort node run before hitting memory limit.
..

IMPALA-6692: Trigger sort node run before hitting memory limit.

Sorter node works by adding row batches to a sort run. After all
batches are added to current unsorted run or memory limit is hit,
sorter will immediately start the run. If the latter case happens,
sorter will spill the sorted run to disk after sort complete, create
new unsorted run object, and continue to add the next row batches, and
so on.

This algorithm tries to fit as much rows into memory before start
sorting. However, in the case of partitioned sort with large number of
row batches, fitting too much rows into memory will cause the sort to
be slow and block the sorter node for a long time before it can
release some memory and continue accepting the next row batch from
exchange node. One slow sorter node can block exchange node from
sending row batches to other sorter node that is free.

This patch speeds up the decision to start the sort without waiting it
to hit memory limit first by capping the intermediary quicksort run to
lower memory limit, determined by query option 'sort_run_bytes_limit'.
If the total used reservation of quicksort has exceeded
sort_run_bytes_limit, current unsorted_run_ will be wrapped up,
sorted, and then spilled. Thus, overlapping the next sort run with
spill from previous sort run.

To reduce regression for cases where total input size of sort node
might be fully fit into available memory, sort_run_bytes_limit will
not be enforced for the first sort run. However, it will stay limited
by sort_run_bytes_limit if planner estimates hint that spill is
inevitably will happen.

We also add new summary counter 'AddBatchTime' to get summary of how
much time spent in Sorter::AddBatch. Max of 'AddBatchTime' indicate
the longest time spent in Sorter::AddBatch, presumably busy doing
intermediary sort.

Testing:
- Add new e2e test TestQueryFullSort::test_multiple_sort_run_bytes_limits
- Run core tests
- Run data loading of 3 largest TPC-DS facts table of 300GB scale into
  real cluster using 5 backends, and 4GB mem_limit.
  sort_run_bytes_limit is varied between unspecified (not limited) vs
  512 MB. The performance result is summarized in the following table.

+---+-+--+---+-+
|  Insert table |  #Rows  |  Avg |   no limit|  512 MB 
limit   |
|   | | SortDataSize 
++--+-+---+
|   | |   per Node   |  Query |  Max |  Query  |
  Max  |
|   | |  |  Time  | AddBatchTime |   Time  |  
AddBatchTime |
+---+-+--++--+-+---+
| store_sales   | 864.00M | 15.29 GB | 30m18s | 53s311ms | 20m |
   5s634ms |
+---+-+--++--+-+---+
| catalog_sales | 431.97M | 11.34 GB | 23m24s | 31s212ms |  15m27s |
   3s603ms |
+---+-+--++--+-+---+
| web_sales | 216.01M |  5.67 GB |  8m16s | 29s250ms |   6m41s |
   3s856ms |
+---+-+--++--+-+---+

Change-Id: I2a0ba7c4bae4f1d300d4d9d7f594f63ced06a240
---
M be/src/exec/sort-node.cc
M be/src/exec/sort-node.h
M be/src/runtime/coordinator-backend-state.cc
M be/src/runtime/query-state.cc
M be/src/runtime/query-state.h
M be/src/runtime/sorter.cc
M be/src/runtime/sorter.h
M be/src/service/query-options-test.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M common/thrift/PlanNodes.thrift
M fe/src/main/java/org/apache/impala/planner/SortNode.java
M tests/query_test/test_sort.py
15 files changed, 225 insertions(+), 11 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/63/15963/15
--
To view, visit http://gerrit.cloudera.org:8080/15963
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2a0ba7c4bae4f1d300d4d9d7f594f63ced06a240
Gerrit-Change-Number: 15963
Gerrit-PatchSet: 15
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16082 )

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..


Patch Set 9: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 14:02:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 3: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6077/


-- 
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 13:32:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16119 )

Change subject: IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test 
suite
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16119
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Gerrit-Change-Number: 16119
Gerrit-PatchSet: 5
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 13:06:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16119 )

Change subject: IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test 
suite
..

IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

This patch adds the following 12 TPCDS queries to the class of
TestTpcdsDecimalV2Query: Q26, Q30, Q31, Q47, Q48, Q57, Q58, Q59, Q63,
Q83, Q85, and Q89. All the queries except for Q31 are added to the class
of TestTpcdsQuery as well because Impala returns one fewer row than
expected for TestTpcdsQuery::test_tpcds_q31(), which requires further
investigation.

To verify whether or not the returned result set from Impala for a given
query is correct, we compare the result set with that produced by the
HiveServer2 (HS2) in Impala's mini-cluster. We could execute SQL
statements in HS2 via Beeline, HS2's command line shell, which could be
launched by the following command.

beeline -u "jdbc:hive2://localhost:11050/default"

We note that among these 12 queries, the execution of Q31, Q58, and Q83
result in the error of "Counters limit exceeded" by TEZ. To work around
this problem, for these 3 queries we have to execute the following
statement before running them to increase the default number of
counters, which is set to 120.

set tez.counters.max=1200

On the other hand, the table of 'reason' is referenced by Q85. This
table was not referenced by any TPCDS query before this patch and thus
was not created. In this regard, in this patch we also modify
tpcds_schema_template.sql to create this additional table along with its
data.

Testing:
- Verified that this patch passes the exhaustive tests in the DEBUG
  build.

Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Reviewed-on: http://gerrit.cloudera.org:8080/16119
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M testdata/datasets/tpcds/tpcds_schema_template.sql
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q26.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q30.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q31.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q47.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q48.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q57.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q58.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q59.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q63.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q83.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q85.test
A testdata/workloads/tpcds/queries/tpcds-decimal_v2-q89.test
A testdata/workloads/tpcds/queries/tpcds-q26.test
A testdata/workloads/tpcds/queries/tpcds-q30.test
A testdata/workloads/tpcds/queries/tpcds-q47.test
A testdata/workloads/tpcds/queries/tpcds-q48.test
A testdata/workloads/tpcds/queries/tpcds-q57.test
A testdata/workloads/tpcds/queries/tpcds-q58.test
A testdata/workloads/tpcds/queries/tpcds-q59.test
A testdata/workloads/tpcds/queries/tpcds-q63.test
A testdata/workloads/tpcds/queries/tpcds-q83.test
A testdata/workloads/tpcds/queries/tpcds-q85.test
A testdata/workloads/tpcds/queries/tpcds-q89.test
M tests/query_test/test_tpcds_queries.py
M tests/util/parse_util.py
26 files changed, 2,622 insertions(+), 1 deletion(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16119
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Gerrit-Change-Number: 16119
Gerrit-PatchSet: 6
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16121/1/fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java
File fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java:

http://gerrit.cloudera.org:8080/#/c/16121/1/fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java@166
PS1, Line 166: loader,
 : pool.submit(() -> { loader.load(); return null; })));
> nit, can be moved to earlier line.
Done



--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 1
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 12:57:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9569: Fix progress bar and live summary to show info of the retried query

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16096 )

Change subject: IMPALA-9569: Fix progress bar and live_summary to show info of 
the retried query
..

IMPALA-9569: Fix progress bar and live_summary to show info of the retried query

Impala-shell periodically calls GetExecSummary() when the query is
queuing or running. If the query is being retried, GetExecSummary()
should return the TExecSummary of the retried query. So the progress bar
and live_summary can reflect the most recent state.

This patch also modifies get_summary() to return retry information in
error_logs of TExecSummary. Impala-shell and other clients can print the
info right after the query starts being retried. Modified impala-shell
to print the retried query link when the retried query is running.

Example output when the retried query is running:
Query: select count(*) from functional.alltypes where bool_col = sleep(60)
Query submitted at: 2020-06-18 22:08:49 (Coordinator: 
http://quanlong-OptiPlex-BJ:25000)
Query progress can be monitored at: 
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=9444fe7f0df0da28:29134b08
Failed due to unreachable impalad(s): quanlong-OptiPlex-BJ:22001

Retrying query using query id: 5748d9a3ccc28ba8:a75e2fab
Retried query link: 
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=5748d9a3ccc28ba8:a75e2fab
[###   ] 50%

Tests:
- Manually verify the progress bar and live_summary work when the query
  is being retried.
- Add tests in test_query_retries.py to validate the get_summary()
  results.

Change-Id: I8f96919f00e0b64d589efd15b6b5ec82fb725d56
Reviewed-on: http://gerrit.cloudera.org:8080/16096
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M shell/impala_shell.py
M tests/common/impala_connection.py
M tests/custom_cluster/test_query_retries.py
5 files changed, 120 insertions(+), 28 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16096
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8f96919f00e0b64d589efd15b6b5ec82fb725d56
Gerrit-Change-Number: 16096
Gerrit-PatchSet: 8
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 


[Impala-ASF-CR] IMPALA-9569: Fix progress bar and live summary to show info of the retried query

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16096 )

Change subject: IMPALA-9569: Fix progress bar and live_summary to show info of 
the retried query
..


Patch Set 7: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16096
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8f96919f00e0b64d589efd15b6b5ec82fb725d56
Gerrit-Change-Number: 16096
Gerrit-PatchSet: 7
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 12:11:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16082 )

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..


Patch Set 9:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6464/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 09:28:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16082 )

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..


Patch Set 9:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6078/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 09:02:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16082 )

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..


Patch Set 9: Code-Review+1

PS9 is a rebase. Carrying +1.


--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 09:02:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive types)

2020-06-30 Thread Zoltan Borok-Nagy (Code Review)
Hello Aman Sinha, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16082

to look at the new patch set (#9).

Change subject: IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified 
tables (primitive types)
..

IMPALA-9859: Full ACID Milestone 4: Part 1 Reading modified tables (primitive 
types)

Hive ACID supports row-level DELETE and UPDATE operations on a table.
It achieves it via assigning a unique row-id for each row, and
maintaining two sets of files in a table. The first set is in the
base/delta directories, they contain the INSERTed rows. The second set
of files are in the delete-delta directories, they contain the DELETEd
rows.

(UPDATE operations are implemented via DELETE+INSERT.)

In the filesystem it looks like e.g.:
 * full_acid/delta_001_001_/_0
 * full_acid/delta_002_002_/_0
 * full_acid/delete_delta_003_003_/_0

During scanning we need to return INSERTed rows minus DELETEd rows.
This patch implements it by creating an ANTI JOIN between the INSERT and
DELETE sets. It is a planner-only modification. Every HDFS SCAN
that scans full ACID tables (that also have deleted rows) are converted
to two HDFS SCANs, one for the INSERT deltas, and one for the DELETE
deltas. Then a LEFT ANTI HASH JOIN with BROADCAST distribution mode is
created above them.

Later we can add support for other distribution modes if the performance
requires it. E.g. if we have too many deleted rows then probably we are
better off with PARTITIONED distribution mode. We could estimate the
number of deleted rows by sampling the delete delta files.

The current patch only works for primitive types. I.e. we cannot select
nested data if the table has deleted rows.

Testing:
 * added planner test
 * added e2e tests

Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
---
M fe/src/main/java/org/apache/impala/catalog/FeFsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalFsPartition.java
M fe/src/main/java/org/apache/impala/planner/HashJoinNode.java
M fe/src/main/java/org/apache/impala/planner/JoinNode.java
M fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java
M fe/src/main/java/org/apache/impala/util/AcidUtils.java
M fe/src/test/java/org/apache/impala/planner/PlannerTest.java
M fe/src/test/java/org/apache/impala/util/AcidUtilsTest.java
M testdata/datasets/functional/functional_schema_template.sql
M testdata/datasets/functional/schema_constraints.csv
A testdata/workloads/functional-planner/queries/PlannerTest/acid-scans.test
M testdata/workloads/functional-query/queries/QueryTest/acid-negative.test
A testdata/workloads/functional-query/queries/QueryTest/full-acid-scans.test
M tests/query_test/test_acid.py
15 files changed, 1,148 insertions(+), 89 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/82/16082/9
--
To view, visit http://gerrit.cloudera.org:8080/16082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I15c8feabf40be1658f3dd46883f5a1b2aa5d0659
Gerrit-Change-Number: 16082
Gerrit-PatchSet: 9
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6463/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 08:44:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6077/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 08:28:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 08:28:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16121 )

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..


Patch Set 2: Code-Review+2

Carrying +2


--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Jun 2020 08:28:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() method

2020-06-30 Thread Zoltan Borok-Nagy (Code Review)
Hello Vihang Karajgaonkar, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16121

to look at the new patch set (#2).

Change subject: IMPALA-9907: Fix NullPointerException in 
ParallelFileMetadataLoader's load() method
..

IMPALA-9907: Fix NullPointerException in ParallelFileMetadataLoader's load() 
method

ParallelFileMetadataLoader.loaders_ became a Map, but at one place we
still treat it as a List:

  loaders_.get(i).getPartDir()

'i' is an integer, while the loaders_'s key type is Path.

Unfortunately the code compiled which means we got a
NullPointerException for this statement. I changed the logic to avoid
the NPE.

Testing:

  * Tested manually on an ACID table with deleted rows
  * I didn't add an automated test because I'm about to extend the ACID
functionalities, hence getting rid of these exceptions
  * The other case when we can hit this NPE is when we get an
IOException in the file metadata loader's load() method

Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
---
M fe/src/main/java/org/apache/impala/catalog/ParallelFileMetadataLoader.java
1 file changed, 7 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/21/16121/2
--
To view, visit http://gerrit.cloudera.org:8080/16121
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I4322a18a1f07e4106789591b80d25105c565a2b6
Gerrit-Change-Number: 16121
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-6692: Trigger sort node run before hitting memory limit.

2020-06-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15963 )

Change subject: IMPALA-6692: Trigger sort node run before hitting memory limit.
..


Patch Set 14: Code-Review+1

(8 comments)

http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@10
PS14, Line 10: batches added to current unsorted run or memory limit is hit, 
sorter
missing 'are'?


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@11
PS14, Line 11: will immediately start the run. If the latter case happen, 
sorter will
nit: happens


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@13
PS14, Line 13: run object, and continue add the next row batches, and so on.
missing 'to'


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@15
PS14, Line 15: try
nit: tries


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@23
PS14, Line 23: speedup
nit: speeds up


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@26
PS14, Line 26: exceed
nit: exceeded


http://gerrit.cloudera.org:8080/#/c/15963/14//COMMIT_MSG@32
PS14, Line 32: fitted
nit: fit


http://gerrit.cloudera.org:8080/#/c/15963/14/be/src/runtime/sorter.cc
File be/src/runtime/sorter.cc:

http://gerrit.cloudera.org:8080/#/c/15963/14/be/src/runtime/sorter.cc@952
PS14, Line 952: 2
I didn't catch this one - I think that it should be also VLOG(3)



--
To view, visit http://gerrit.cloudera.org:8080/15963
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2a0ba7c4bae4f1d300d4d9d7f594f63ced06a240
Gerrit-Change-Number: 15963
Gerrit-PatchSet: 14
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 08:19:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

2020-06-30 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16119 )

Change subject: IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test 
suite
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16119
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Gerrit-Change-Number: 16119
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:56:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16119 )

Change subject: IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test 
suite
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6076/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16119
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Gerrit-Change-Number: 16119
Gerrit-PatchSet: 5
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:57:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test suite

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16119 )

Change subject: IMPALA-9890 (Part 1): Add more TPCDS queries to Impala's test 
suite
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16119
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5f260e75a3803aabe9ccef271ba94036f96e5cf
Gerrit-Change-Number: 16119
Gerrit-PatchSet: 5
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:57:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9531: Dropped support for dateless timestamps

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15866 )

Change subject: IMPALA-9531: Dropped support for dateless timestamps
..


Patch Set 8:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6462/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15866
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I48c49bf027cc4b917849b3d58518facba372b322
Gerrit-Change-Number: 15866
Gerrit-PatchSet: 8
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:29:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9569: Fix progress bar and live summary to show info of the retried query

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16096 )

Change subject: IMPALA-9569: Fix progress bar and live_summary to show info of 
the retried query
..


Patch Set 7:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6075/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16096
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8f96919f00e0b64d589efd15b6b5ec82fb725d56
Gerrit-Change-Number: 16096
Gerrit-PatchSet: 7
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:08:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9569: Fix progress bar and live summary to show info of the retried query

2020-06-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16096 )

Change subject: IMPALA-9569: Fix progress bar and live_summary to show info of 
the retried query
..


Patch Set 7: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16096
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8f96919f00e0b64d589efd15b6b5ec82fb725d56
Gerrit-Change-Number: 16096
Gerrit-PatchSet: 7
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Tue, 30 Jun 2020 07:08:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9531: Dropped support for dateless timestamps

2020-06-30 Thread Adam Tamas (Code Review)
Adam Tamas has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/15866 )

Change subject: IMPALA-9531: Dropped support for dateless timestamps
..

IMPALA-9531: Dropped support for dateless timestamps

Removed the support for dateless timestamps.
During dateless timestamp casts if the format doesn't contain
date part we get an error during tokenization of the format.
If the input str doesn't contain a date part then we get null result.

Examples:
select cast('01:02:59' as timestamp);
This will come back as NULL value.

select to_timestamp('01:01:01', 'HH:mm:ss');
select cast('01:02:59' as timestamp format 'HH12:MI:SS');
select cast('12 AM' as timestamp FORMAT 'AM.HH12');
These will come back with a parsing errors.

Casting from a table will generate similar results.

Testing:
Modified the previous tests related to dateless timestamps.
Added test to read fromtables which are still containing dateless
timestamps and covered timestamp to string path when no date tokens
are requested in the output string.

Change-Id: I48c49bf027cc4b917849b3d58518facba372b322
---
M be/src/benchmarks/convert-timestamp-benchmark.cc
M be/src/benchmarks/parse-timestamp-benchmark.cc
M be/src/exec/text-converter.inline.h
M be/src/exprs/cast-functions-ir.cc
M be/src/exprs/expr-test.cc
M be/src/exprs/scalar-expr-evaluator.cc
M be/src/exprs/timestamp-functions-ir.cc
M be/src/exprs/timestamp-functions.cc
M be/src/exprs/timestamp-functions.h
M be/src/runtime/date-parse-util.cc
M be/src/runtime/date-test.cc
M be/src/runtime/datetime-iso-sql-format-tokenizer.cc
M be/src/runtime/datetime-parser-common.cc
M be/src/runtime/datetime-parser-common.h
M be/src/runtime/datetime-simple-date-format-parser.cc
M be/src/runtime/datetime-simple-date-format-parser.h
M be/src/runtime/timestamp-parse-util.cc
M be/src/runtime/timestamp-test.cc
M be/src/runtime/timestamp-value.h
M bin/rat_exclude_files.txt
M common/function-registry/impala_functions.py
M fe/src/test/java/org/apache/impala/analysis/AnalyzeKuduDDLTest.java
M testdata/data/README
A testdata/data/dateless_timestamps.parq
A testdata/data/dateless_timestamps.txt
M testdata/data/lazy_timestamp.csv
M testdata/workloads/functional-query/queries/QueryTest/date.test
A 
testdata/workloads/functional-query/queries/QueryTest/dateless_timestamp_parquet.test
A 
testdata/workloads/functional-query/queries/QueryTest/dateless_timestamp_text.test
M testdata/workloads/functional-query/queries/QueryTest/exprs.test
M 
testdata/workloads/functional-query/queries/QueryTest/select-lazy-timestamp.test
M tests/data_errors/test_data_errors.py
M tests/query_test/test_cast_with_format.py
M tests/query_test/test_scanners.py
34 files changed, 278 insertions(+), 231 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/66/15866/8
--
To view, visit http://gerrit.cloudera.org:8080/15866
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I48c49bf027cc4b917849b3d58518facba372b322
Gerrit-Change-Number: 15866
Gerrit-PatchSet: 8
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins