[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-21 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 30: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 30
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Mon, 21 Mar 2022 15:35:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-10 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has removed a vote on this change.

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 23
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-10 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 23: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 23
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 10 Mar 2022 17:24:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-08 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 19: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 19
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 08 Mar 2022 17:10:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 17: Code-Review+1

Looks good to me. Can you check with Kurt if he wants to do another review, 
otherwise I can give a +2.
Also, just curious if running all tests with 2 group sets caused a significant 
increase in core or exhaustive test run time. If it is quite significant, we 
might want to rethink that strategy


--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 17
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Sat, 05 Mar 2022 02:45:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10999 Flakiness in TestAsyncLoadData.test async load

2022-03-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18268 )

Change subject: IMPALA-10999 Flakiness in TestAsyncLoadData.test_async_load
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18268
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2ac954b0494b7413ce0ec405718fcc354dba9e0
Gerrit-Change-Number: 18268
Gerrit-PatchSet: 3
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Fri, 04 Mar 2022 22:25:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-03-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 16:

(6 comments)

Just a few more nits, everything else looks good

http://gerrit.cloudera.org:8080/#/c/18178/16/common/thrift/Frontend.thrift
File common/thrift/Frontend.thrift:

http://gerrit.cloudera.org:8080/#/c/18178/16/common/thrift/Frontend.thrift@734
PS16, Line 734: threshold
nit: wondering if it makes sense to just call it max_mem_limit.
This would make the variable name self explanatory


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java
File fe/src/main/java/org/apache/impala/service/Frontend.java:

http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1799
PS13, Line 1799: ExecutorGrou
> query_exec_request.query_ctx.request_pool is set after we have decided the
got it, thanks for the explanation


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1937
PS13, Line 1937: req.query_exec_request != nu
> This is just a preventive check.
Can this be converted into a Preconditions.check then?


http://gerrit.cloudera.org:8080/#/c/18178/16/fe/src/main/java/org/apache/impala/service/Frontend.java
File fe/src/main/java/org/apache/impala/service/Frontend.java:

http://gerrit.cloudera.org:8080/#/c/18178/16/fe/src/main/java/org/apache/impala/service/Frontend.java@1756
PS16, Line 1756: A group set is considered useless if its name is not a suffix
   :* of 'request_pool'.
nit: mentioned the default executor group case as an exception


http://gerrit.cloudera.org:8080/#/c/18178/16/fe/src/main/java/org/apache/impala/service/Frontend.java@1784
PS16, Line 1784: 64*MEGABYTE
nit: wondering if this is low enough to always be rejected


http://gerrit.cloudera.org:8080/#/c/18178/16/fe/src/main/java/org/apache/impala/service/Frontend.java@1787
PS16, Line 1787: "large"
nit: would be good to also retain the name if one exists. Otherwise this would 
be set as a pool name which might get confusing if something else is expected



--
To view, visit http://gerrit.cloudera.org:8080/18178
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I75cf17290be2c64fd4b732a5505bdac31869712a
Gerrit-Change-Number: 18178
Gerrit-PatchSet: 16
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 04 Mar 2022 18:41:35 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10999 Flakiness in TestAsyncLoadData.test async load

2022-03-03 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18268 )

Change subject: IMPALA-10999 Flakiness in TestAsyncLoadData.test_async_load
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18268/2/tests/metadata/test_load.py
File tests/metadata/test_load.py:

http://gerrit.cloudera.org:8080/#/c/18268/2/tests/metadata/test_load.py@131
PS2, Line 131: test_async_load
nit: can you add a comment on what this test is verifying? Is the intent to 
just do a sanity check or to also verify that 2 different behaviors are 
expected from sync and async mode?


http://gerrit.cloudera.org:8080/#/c/18268/2/tests/metadata/test_load.py@199
PS2, Line 199: exec_end_state == running_state or exec_end_state == 
finished_state
was the original intent of this test to ensure different behaviour? that is, 
sync => finished state, async => running state ?
Since sync can now return either, do we need to check any other metric or 
behavior to verify the difference between the two?



--
To view, visit http://gerrit.cloudera.org:8080/18268
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2ac954b0494b7413ce0ec405718fcc354dba9e0
Gerrit-Change-Number: 18268
Gerrit-PatchSet: 2
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Thu, 03 Mar 2022 19:35:36 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory

2022-02-25 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18178 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 13:

(16 comments)

http://gerrit.cloudera.org:8080/#/c/18178/13//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18178/13//COMMIT_MSG@9
PS13, Line 9: executor groups
nit: multiple executor group sets.

Change mentions of executor group to executor group sets where applicable in 
the commit message. This is to stay consistent with the distinction between 
executor groups vs group sets


http://gerrit.cloudera.org:8080/#/c/18178/13//COMMIT_MSG@74
PS13, Line 74:  Almost all FE and BE tests are now run in the artificial two
 : executor setup except a few where a specific cluster 
configuration
 : is desirable;
Please see my comment in Frontend.java about how we can ensure re-planning 
code-path gets tested while ensuring the actual group set gets used.


http://gerrit.cloudera.org:8080/#/c/18178/13/common/thrift/Frontend.thrift
File common/thrift/Frontend.thrift:

http://gerrit.cloudera.org:8080/#/c/18178/13/common/thrift/Frontend.thrift@729
PS13, Line 729: // The optional threshold for queries to run in an executor 
group
nit: can you provide more context here as to what this threshold signifies, 
like which resource is represents and how it helps the planner make decisions.


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java
File fe/src/main/java/org/apache/impala/service/Frontend.java:

http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@265
PS13, Line 265: ///
  : // BEGIN: Data members used by auto-scaling
nit: would it make sense to put this inside a separate
State class?


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@284
PS13, Line 284: // The stmt table cache to be used by all iterations of 
auto-scaling compilation.
nit: mention when it is set and when can it be reset


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@335
PS13, Line 335: auto-scaling
nit: not sure auto-scaling is the right term here, since we are not scaling 
anything but rather assigning a query to the right group. Open to suggestions 
for more semantically appropriate terms.


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1757
PS13, Line 1757: removes useless groups from it and fills the
   :* threshold field
nit: would be good to explain why a group can be classified as useless and how 
the threshold is evaluated


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1776
PS13, Line 1776:  ExecutorMembershipSnapshot cluster = 
ExecutorMembershipSnapshot.getCluster();
   : int num_nodes = cluster.numExecutors();
   : // Form a two-executor group testing environment so 
that we can exercise
   : // auto-scaling logic (see getTExecRequest() in 
Frontend.java).
   : TExecutorGroupSet r = new TExecutorGroupSet(num_nodes, 
num_nodes, "regular");
   : r.setThreshold(64*MEGABYTE);
   : result.add(r);
   : TExecutorGroupSet l = new TExecutorGroupSet(num_nodes, 
num_nodes, "large");
   : l.setThreshold(Long.MAX_VALUE);
   : result.add(l);
if we want to emulate a 2 exec group set configuration, what if we only add a 
dummy executor group set which will always be rejected (i think its the 
'regular' one in this case) and the other one as the actual group set 'e = 
executorGroupSets.get(0)'. Otherwise this can get very confusing if the someone 
is running a test where test_replan is true by default (and is not aware of 
this implication) on a different configuration but the plan always gets created 
for a 3 node  group set. This can especially be true for custom cluster tests.


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1788
PS13, Line 1788: executorGroupSets.get(0)
nit: can just use 'e' here too.


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1799
PS13, Line 1799: request_pool
if we use query_exec_request.query_ctx.request_pool instead of 
queryOptions.getRequest_pool(), that should have the resolved request pool that 
always has the "root." prefix


http://gerrit.cloudera.org:8080/#/c/18178/13/fe/src/main/java/org/apache/impala/service/Frontend.java@1816
PS13, Line 1816: throw new AnalysisException("Request pool '" + request_pool
   :   + "' does not name

[Impala-ASF-CR] IMPALA-10992 Planner changes for estimate peak memory - v1

2022-02-24 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18143 )

Change subject: IMPALA-10992 Planner changes for estimate peak memory - v1
..


Patch Set 29:

(20 comments)

http://gerrit.cloudera.org:8080/#/c/18143/29//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18143/29//COMMIT_MSG@9
PS29, Line 9: a set of executor
: groups
nit: multiple executor group sets.

Change mentions of executor group to executor group sets where applicable in 
the commit message. This is to stay consistent with the distinction between 
executor groups vs group sets


http://gerrit.cloudera.org:8080/#/c/18143/29//COMMIT_MSG@25
PS29, Line 25: To facilitate testing, the patch imposes an artificial two 
executor
 : group setup in FE. This setup is enabled when 'enable_replan' is 
set
 : to 3 (testing) or RuntimeEnv.INSTANCE.isTestEnv() is true as in 
most
 : frontend tests. The artificial two executor groups are 
configured as
 : follows.
Does this mean all tests are run in this configuration? All planner tests and 
end2end pytest are currently based on a minicluster or a custom cluster to 
emulate an actual cluster.

Please see my comment in Frontend.java about how we can ensure re-planning 
code-path gets tested while ensuring the actual group set gets used.


http://gerrit.cloudera.org:8080/#/c/18143/29/common/thrift/Frontend.thrift
File common/thrift/Frontend.thrift:

http://gerrit.cloudera.org:8080/#/c/18143/29/common/thrift/Frontend.thrift@729
PS29, Line 729: The optional threshold for queries to run in an executor group
nit: can you provide more context here as to what this threshold signifies, 
like which resource is represents and how it helps the planner make decisions.


http://gerrit.cloudera.org:8080/#/c/18143/29/common/thrift/Query.thrift
File common/thrift/Query.thrift:

http://gerrit.cloudera.org:8080/#/c/18143/29/common/thrift/Query.thrift@585
PS29, Line 585: TEnableReplanMode.TESTING
shouldn't this be set to ON by default as per the comment?


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java
File fe/src/main/java/org/apache/impala/service/Frontend.java:

http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@267
PS29, Line 267: ///
  : // BEGIN: Data members used by auto-scaling
nit: would it make sense to put this inside a separate
State class?


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@286
PS29, Line 286: // The stmt table cache to be used by all iterations of 
auto-scaling compilation.
nit: mention when it is set and when can it be reset


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@339
PS29, Line 339: auto-scaling
nit: not sure auto-scaling is the right term here, since we are not scaling 
anything but rather assigning a query to the right group. Open to suggestions 
for more semantically appropriate terms.


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1763
PS29, Line 1763: setupThresholdsForGroupSets
nit: add a comment about what it does


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1780
PS29, Line 1780: enable_replan == TEnableReplanMode.TESTING
nit: also might be worth logging this while planning


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1787
PS29, Line 1787: TExecutorGroupSet l = new TExecutorGroupSet(3, 3, 
"large");
   : l.setThreshold(Long.MAX_VALUE);
   : result.add(l);
if we want to emulate a 2 exec group set configuration, what if we only add a 
dummy executor group set which will always be rejected (i think its the 
'regular' one in this case) and the other one as the actual group set 'e = 
executorGroupSets.get(0)'. Otherwise this can get very confusing if the someone 
is running a test (and is not aware of this) on a different configuration but 
the plan always gets created for a 3 node  group set.


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1792
PS29, Line 1792: executorGroupSets.get(0)
nit: can just use 'e' here too.


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1801
PS29, Line 1801: request_pool
if we use query_exec_request.query_ctx.request_pool instead of 
queryOptions.getRequest_pool(), that should have the resolved request pool that 
always has the "root." prefix


http://gerrit.cloudera.org:8080/#/c/18143/29/fe/src/main/java/org/apache/impala/service/Frontend.java@1802
PS29, Line 1802: if (request_pool != nu

[Impala-ASF-CR] IMPALA-11063: Add metrics to expose state of each executor group set

2022-01-12 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18142 )

Change subject: IMPALA-11063: Add metrics to expose state of each executor 
group set
..


Patch Set 2:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.h
File be/src/scheduling/cluster-membership-mgr.h:

http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.h@248
PS1, Line 248: aggregated_group_set_m
> nit. May name it as overall_group_set_metrics_ to keep it consistent with p
Done


http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.cc@64
PS1, Line 64: static const string LIVE_EXEC_GROUP_KEY_FORMAT(
> Nit: This is not incorrect, but "cluster-membership.executor-groups.total.$
Done


http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.cc@89
PS1, Line 89: MetricGroup* metric_grp = 
metrics->GetOrCreateChildGroup("cluster-membership");
:   aggregated_group_set_metrics_.total_live_executor_groups_ =
:   metric_grp->AddCounter(LIVE_EXEC_GROUP_KEY, 0);
:   aggregated_group_set_metrics_.total_healthy_executor_groups_ =
:   metric_grp->AddCounter(HEALTHY_EXEC_GROUP_KEY, 0);
> nit.these lines of code can be moved to a method GroupSetMetrics::Add(Metri
The keys and inputs to AddCounter() are different for 'overall' vs 
'per_group_set' so the special casing for both would end up with the same 
structure as InitMetrics() here.


http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.cc@606
PS1, Line 606: ngPiece name(grou
> I wonder if there is a method ExecutorGroup::isLive(). If so, it can be use
group.NumHosts() > 0 is the only simple check we need unlike IsHealthy() which 
needs to know the 'min_size' of the exec group as well (where 'min size' is the 
minimum number of executors in the group to be considered healthy)


http://gerrit.cloudera.org:8080/#/c/18142/1/be/src/scheduling/cluster-membership-mgr.cc@611
PS1, Line 611:
 :   if (group.NumHosts() > 0) {
> nit. duplicated lines with the THEN branch.
Good point. Done.


http://gerrit.cloudera.org:8080/#/c/18142/1/tests/custom_cluster/test_executor_groups.py
File tests/custom_cluster/test_executor_groups.py:

http://gerrit.cloudera.org:8080/#/c/18142/1/tests/custom_cluster/test_executor_groups.py@134
PS1, Line 134: metric_name = "cluster-membership.executor-groups.total"
> This is where the code I marked with a Nit is actually useful
Done


http://gerrit.cloudera.org:8080/#/c/18142/1/tests/custom_cluster/test_executor_groups.py@685
PS1, Line 685: c
> flake8: E251 unexpected spaces around keyword / parameter equals
Done


http://gerrit.cloudera.org:8080/#/c/18142/1/tests/custom_cluster/test_executor_groups.py@694
PS1, Line 694: _
> flake8: E251 unexpected spaces around keyword / parameter equals
Done



--
To view, visit http://gerrit.cloudera.org:8080/18142
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib39f940de830ef6302785aee30eeb847fa5deeba
Gerrit-Change-Number: 18142
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 13 Jan 2022 01:31:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11063: Add metrics to expose state of each executor group set

2022-01-12 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Wenzhe Zhou, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18142

to look at the new patch set (#2).

Change subject: IMPALA-11063: Add metrics to expose state of each executor 
group set
..

IMPALA-11063: Add metrics to expose state of each executor group set

This adds metrics for each executor group set that expose the number
of executor groups, the number of healthy executor groups and the
total number of backends associated with that group set.

Testing:
Added an e2e test to verify metrics are updated correctly.

Change-Id: Ib39f940de830ef6302785aee30eeb847fa5deeba
---
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M common/thrift/metrics.json
M tests/custom_cluster/test_executor_groups.py
4 files changed, 172 insertions(+), 29 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/42/18142/2
--
To view, visit http://gerrit.cloudera.org:8080/18142
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib39f940de830ef6302785aee30eeb847fa5deeba
Gerrit-Change-Number: 18142
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11063: Add metrics to expose state of each executor group set

2022-01-11 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18142


Change subject: IMPALA-11063: Add metrics to expose state of each executor 
group set
..

IMPALA-11063: Add metrics to expose state of each executor group set

This adds metrics for each executor group set that expose the number
of executor groups, the number of healthy executor groups and the
total number of backends associated with that group set.

Testing:
Added an e2e test to verify metrics are updated correctly.

Change-Id: Ib39f940de830ef6302785aee30eeb847fa5deeba
---
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M common/thrift/metrics.json
M tests/custom_cluster/test_executor_groups.py
4 files changed, 162 insertions(+), 27 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/42/18142/1
--
To view, visit http://gerrit.cloudera.org:8080/18142
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ib39f940de830ef6302785aee30eeb847fa5deeba
Gerrit-Change-Number: 18142
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-11068: Add tuning flag to reduce scanner thread launch.

2022-01-07 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18126 )

Change subject: IMPALA-11068: Add tuning flag to reduce scanner thread launch.
..


Patch Set 1:

Should we look into making the constant COMPRESSED_TEXT_COMPRESSION_RATIO 
tunable in hdfs-scan-node.cc instead?


--
To view, visit http://gerrit.cloudera.org:8080/18126
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I03cadf1230eed00d69f2890c82476c6861e37466
Gerrit-Change-Number: 18126
Gerrit-PatchSet: 1
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Sat, 08 Jan 2022 03:26:27 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2022-01-07 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 9: Code-Review+2

Saw 3 tests fail in the GVO, fixed the one that was related, others are 
unrelated flaky tests. IMPALA-10983, IMPALA-10927


--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 9
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Sat, 08 Jan 2022 02:46:52 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2022-01-07 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#9).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size (the number of executors in each group). Every executor group
that is a part of this set will have the same prefix which will
also be equivalent to the resource pool name that it maps to.
These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
M tests/custom_cluster/test_executor_groups.py
12 files changed, 450 insertions(+), 67 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/9
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 9
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2022-01-07 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 8: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 8
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 07 Jan 2022 18:09:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-22 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 5:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc
File be/src/scheduling/cluster-membership-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@659
PS5, Line 659: EXPECT_EQ(update_req.exec_group_sets.size(), 1);
 : EXPECT_EQ(update_req.exec_group_sets[0].curr_num_executors, 
1);
 : 
EXPECT_EQ(update_req.exec_group_sets[0].expected_num_executors, 20);
 : 
EXPECT_EQ(update_req.exec_group_sets[0].exec_group_name_prefix, "");
> not really. I think adding some extra assertions in this test is sufficient
Those methods are part of ExecutorMembershipSnapshot which is
a class in java and cannot be used here in a c++ unit test.


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.h
File be/src/scheduling/cluster-membership-mgr.h:

http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.h@222
PS5, Line 222: expected_exec_group_sets
> In this patch we have two objects defined: TExecutorGroupSets and ExpectedE
agreed, changing both to TExecutorGroupSets.

to provide more context, my initial attempt for this patch had used 
TExecutorGroupSets in both places and the only reason i switched to using 
standard lib classes was to avoid including a larger header file 
(gen-cpp/Frontend_types.h) in cluster-membership-mgr.h since 
cluster-membership-mgr is itself included in a bunch of places and could 
potentially inflate build times



--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 5
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 22 Dec 2021 21:14:28 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-22 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#7).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size (the number of executors in each group). Every executor group
that is a part of this set will have the same prefix which will
also be equivalent to the resource pool name that it maps to.
These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 447 insertions(+), 64 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/7
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 7
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-21 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 5:

(10 comments)

http://gerrit.cloudera.org:8080/#/c/18093/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18093/5//COMMIT_MSG@11
PS5, Line 11: size(the number of executors in each group). Every executor group
> Nit: need space after size
Done


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc
File be/src/scheduling/cluster-membership-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@573
PS5, Line 573: first
> nit. Can we add 'group_set_name' and 'expected_size' to ExpectedExecGroupSe
just to clarify, the ExpectedExecGroupSets is actually a typedef of 
vector> which is where the 'first' and 'second' accessors 
come from (std::pair<>).
(https://gerrit.cloudera.org/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.h)

To be able to name the member variables I would have to create a new struct 
definition. Let me know if you would strongly prefer to do the alternative.


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@588
PS5, Line 588:   // increasing order
> nit. of expected group size.
Done


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@615
PS5, Line 615: 4
> nit. duplicated case number (@587).
Done


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@631
PS5, Line 631:  Invalid input with no colon separator
> nit title is not right.
Done


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr-test.cc@659
PS5, Line 659: EXPECT_EQ(update_req.exec_group_sets.size(), 1);
 : EXPECT_EQ(update_req.exec_group_sets[0].curr_num_executors, 
1);
 : 
EXPECT_EQ(update_req.exec_group_sets[0].expected_num_executors, 20);
 : 
EXPECT_EQ(update_req.exec_group_sets[0].exec_group_name_prefix, "");
> Can we add some assertions on methods numExecutors() and contains() for all
just to clarify, do you want me to add separate JUnit tests that mimic these 
cases?


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.cc@622
PS5, Line 622:   DCHECK(!prefix.empty() || exec_group_sets.size() == 1)
> Should this be
exec_group_sets.size() == 1 is there to ensure that if prefix is empty then it 
should only contain a single exec_group_set that is added in L613


http://gerrit.cloudera.org:8080/#/c/18093/5/be/src/scheduling/cluster-membership-mgr.cc@629
PS5, Line 629: if (!it.second.IsHealthy()) continue;
> Is this interesting enough to warrant a log message?
if a group is not healthy it, we have visibility over it as it will show up on 
the metrics so I felt logging a warning here might be an overkill.
this is why the way matching_exec_groups_found is incremented, it always 
accounts for a group regardless of whether it is healthy or not, so an 
unhealthy group will not be logged below


http://gerrit.cloudera.org:8080/#/c/18093/5/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
File fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java:

http://gerrit.cloudera.org:8080/#/c/18093/5/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java@97
PS5, Line 97:  String host = address.getHostname();
> Return false when exec_group_sets.size() > 0?
exec_group_sets will always be non empty for any kind of update. If using 
'default' exec group, the addresses will be populated and we should return 
those, but if using non-default exec groups this 'address' will be empty and 
false will be returned


http://gerrit.cloudera.org:8080/#/c/18093/5/fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
File fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java:

http://gerrit.cloudera.org:8080/#/c/18093/5/fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java@61
PS5, Line 61: int num
> nit. Probably it is better to use two arguments (int expected_num_executors
Done



--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 5
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comme

[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-21 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#6).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size (the number of executors in each group). Every executor group
that is a part of this set will have the same prefix which will
also be equivalent to the resource pool name that it maps to.
These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 447 insertions(+), 63 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/6
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 6
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-17 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 5:

(14 comments)

http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc
File be/src/scheduling/cluster-membership-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc@554
PS3, Line 554: /// This tests various valid and invalid cases that the parsing 
logic in
> Add a description comment
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc@638
PS3, Line 638: }
> Add a description of the test
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc@644
PS3, Line 644:   gflags::FlagSaver saver;
> This is my problem but when I see 'expected' in a test I think it means a v
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc@664
PS3, Line 664:   }
> This sets the flag for all subsequent tests?
forgot to remove that, removed


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr-test.cc@682
PS3, Line 682: ExecutorGroup exec_group("foo-group1", 1);
> Nit: with more executors?
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.cc@685
PS2, Line 685:  parsed_group_prefixes.insert(group_prefix);
> Since FE will use ExecutorMembershipSnapshot.exec_group_sets to generate se
If we are planning to base the threshold on the resource pool configs then that 
information can change dynamically whereas an update to FE only happens when 
there is a change in the cluster state. Considering that, it would be better to 
fetch the latest resource config while planning and letting the planner arrange 
the list as it sees fit.


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr.cc@585
PS3, Line 585: /// include the hostnames and IP addresses in the update to the 
frontend. For non-default
> Nit: addresses in
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr.cc@586
PS3, Line 586: /// executor groups, we assume that we will read data remotely 
and will only send the
> Nit: will read
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr.cc@588
PS3, Line 588: /// specified we apply the aforementioned steps for each group 
set.
> Nit: steps for
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/scheduling/cluster-membership-mgr.cc@646
PS3, Line 646:   LOG(WARNING) << "Some executor groups either do not match 
expected group sets or "
> I wonder if this should be a WARNING level log?
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/service/impala-server.cc@291
PS3, Line 291: "'expected_executor_group_sets' which is a more expressive 
way of specifying "
> Maybe explain here how this overrides the value from num_expected_executors
Done


http://gerrit.cloudera.org:8080/#/c/18093/3/be/src/service/impala-server.cc@297
PS3, Line 297: "For eg. “prefix1:10”, this set will include executor groups 
named like "
> Is there any warning printed in this case?
yes, as per your previous suggestion, this will now be logged at warning level 
instead of info level


http://gerrit.cloudera.org:8080/#/c/18093/2/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
File fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java:

http://gerrit.cloudera.org:8080/#/c/18093/2/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java@71
PS2, Line 71: exec_group_sets
> I think the only missing piece in TExecutorGroupSet is the threshold. If FE
The resource config file is on the local file system which is loaded, parsed 
and cached into memory. Only changes to the local file trigger a change to the 
cached config. So any number of fetches for the configs should be quite fast.


http://gerrit.cloudera.org:8080/#/c/18093/3/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
File fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java:

http://gerrit.cloudera.org:8080/#/c/18093/3/fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java@71
PS3, Line 71:   private final List exec_group_sets_;
> Should this be exec_group_sets_ (i.e with a trailing underscore) ?
Done



--
To view, visit h

[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-17 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#5).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size(the number of executors in each group). Every executor group
that is a part of this set will have the same prefix which will
also be equivalent to the resource pool name that it maps to.
These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 439 insertions(+), 58 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/5
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 5
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-17 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#4).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size. Every executor group that is a part of this set will have the
same prefix which will also be equivalent to the resource pool name
that it maps to. These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 439 insertions(+), 58 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/4
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-15 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18093 )

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..


Patch Set 3:

(14 comments)

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/runtime/exec-env.cc
File be/src/runtime/exec-env.cc:

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/runtime/exec-env.cc@178
PS2, Line 178: const ClusterMembershipMgr::ExpectedExecGroupSets& 
expected_exec_group_sets,
> typedef for ExecGroupSets
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr-test.cc
File be/src/scheduling/cluster-membership-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr-test.cc@556
PS2, Line 556:   ClusterMembershipMgr::ExpectedExecGroupSets 
expected_exec_group_sets;
> use ExecGroupSets typedef here.
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr-test.cc@753
PS2, Line 753: EXPECT_EQ(update_req.exec_group_sets[0].curr_num_executors, 
1);
> negative test for duplicate names or invalid parameters?
The inputs for PopulateExecutorMembershipRequest are from the cluster snapshot 
and the ExpectedExecGroupSets which are already parsed and error checked, so 
none of those checks added for it.
However adding a test for duplicate check for TestPopulateExpectedExecGroupSets


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h
File be/src/scheduling/cluster-membership-mgr.h:

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@173
PS2, Line 173:   /// Returns a pointer to the static empty group reserved for 
scheduling coord only
> use ExecGroupSets typedef
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@218
PS2, Line 218:
> It will be nice if the current #executors is stored in this data structure.
That details is extracted from the cluster snapshot and populated in 
PopulateExecutorMembershipRequest


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@221
PS2, Line 221:   /// Parses the --expected_executor_group_sets startup flag and 
populates
> use ExecGroupSets typedef
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@229
PS2, Line 229:
> use ExecGroupSets typedef
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@229
PS2, Line 229:
> I wonder if we need to use 'expected' in the name. To me, the concept of a
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.h@285
PS2, Line 285: /// The frontend uses cluster membership information to 
determine whether it expects the
> use ExecGroupSets typedef
Done


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.cc@685
PS2, Line 685:  parsed_group_prefixes.insert(group_prefix);
> How do we deal with a common situation where a small number of exec groups
For the first part of your comment: FE is sent the max size of an executor 
group among all groups in the set, so it would plan based off of that and does 
not need to consider individual groups in the set separately. (See 
PopulateExecutorMembershipRequest for how the update to FE is staged)


For the second part: This method is only used to parse the startup flag and 
does not have cluster's current state. The reason I chose to sort here by 
expected size is because in general multiple exec group sets would have a 
non-trivial difference in their sizes and a min executor count for a group 
which usually is set to 90% to be considered healthy will prevent situations 
where exec group sets switch places in this order when an update is sent to FE.
For, eg

set1 -> expected size 10 -> min executors to consider healthy: 9
set2 -> 20  -> min executors to consider healthy: 18


http://gerrit.cloudera.org:8080/#/c/18093/2/be/src/scheduling/cluster-membership-mgr.cc@688
PS2, Line 688:   Substitute("Invalid executor group set format: $0", 
group.ToString()));
> could either DCHECK first.second!=second.second or make strong order to han
added check for group prefix duplicates. However, allowing duplicates on the 
expected number as there can be a use case (though unlikely, for eg, 2 group 
sets with same group size but different defaults for resource configs like 
mt_dop).
Let me know if you think we should be more stringent on these allowances.


http://gerrit.cloudera.org:8080/#/c/18093/2/common/thrift/Frontend.thrift
File common/thrift/Frontend.thrift:

http://gerrit.cloudera.org:8080/#/c/18093/2/common/thrift/Frontend.thrift@716
PS2, Line 716: cur
> max or current?
updated the des

[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-15 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Kurt Deschler, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#3).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size. Every executor group that is a part of this set will have the
same prefix which will also be equivalent to the resource pool name
that it maps to. These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 431 insertions(+), 56 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/3
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11054: Support resource pool polling for frontend

2021-12-15 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18078 )

Change subject: IMPALA-11054: Support resource pool polling for frontend
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18078
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia78b1a0574f6b8ad4df5bb0fc9533f218b486e6b
Gerrit-Change-Number: 18078
Gerrit-PatchSet: 4
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 15 Dec 2021 18:34:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-13 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Qifan Chen, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18093

to look at the new patch set (#2).

Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size. Every executor group that is a part of this set will have the
same prefix which will also be equivalent to the resource pool name
that it maps to. These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
M fe/src/test/java/org/apache/impala/planner/ClusterSizeTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
11 files changed, 411 insertions(+), 56 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/2
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 


[Impala-ASF-CR] IMPALA-11054: Support resource pool polling for frontend

2021-12-13 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18078 )

Change subject: IMPALA-11054: Support resource pool polling for frontend
..


Patch Set 3: Code-Review+1

Looks good to me, deferring +2 to Qifan in case he has any more comments


--
To view, visit http://gerrit.cloudera.org:8080/18078
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia78b1a0574f6b8ad4df5bb0fc9533f218b486e6b
Gerrit-Change-Number: 18078
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Mon, 13 Dec 2021 23:20:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11054: Support resource pool polling for frontend

2021-12-13 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18078 )

Change subject: IMPALA-11054: Support resource pool polling for frontend
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18078/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18078/2//COMMIT_MSG@9
PS2, Line 9: To support different sizes of resource pools, Planner will need to 
poll
   : resource pool which is only done by backend currently.
nit: can remove this context since its not relevant to the patch directly


http://gerrit.cloudera.org:8080/#/c/18078/2/fe/src/main/java/org/apache/impala/util/RequestPoolService.java
File fe/src/main/java/org/apache/impala/util/RequestPoolService.java:

http://gerrit.cloudera.org:8080/#/c/18078/2/fe/src/main/java/org/apache/impala/util/RequestPoolService.java@345
PS2, Line 345:   if (LOG.isTraceEnabled()) {
 : LOG.trace("resolveRequestPool(pool={}, user={}): 
resolved_pool={}, has_access={}",
 : resolvePoolParams.getRequested_pool(), 
resolvePoolParams.getUser(),
 : result.resolved_pool, result.has_access);
 :   }
maybe log this at the end right before return so that its logged in all cases 
and not only when there is a successful resolution. (similar to how it was 
logged every time before this patch/0



--
To view, visit http://gerrit.cloudera.org:8080/18078
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia78b1a0574f6b8ad4df5bb0fc9533f218b486e6b
Gerrit-Change-Number: 18078
Gerrit-PatchSet: 2
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Comment-Date: Mon, 13 Dec 2021 17:41:56 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11033: Add support for specifying multiple executor group sets

2021-12-13 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18093


Change subject: IMPALA-11033: Add support for specifying multiple executor 
group sets
..

IMPALA-11033: Add support for specifying multiple executor group sets

This patch introduces the concept of executor group sets. Each group
set specifies an executor group name prefix and an expected group
size. Every executor group that is a part of this set will have the
same prefix which will also be equivalent to the resource pool name
that it maps to. These sets are specified via a startup flag
'expected_executor_group_sets' which is a comma separated list in
the format :.

Testing:
- Added unit tests

Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
---
M be/src/runtime/exec-env.cc
M be/src/scheduling/CMakeLists.txt
M be/src/scheduling/cluster-membership-mgr-test.cc
M be/src/scheduling/cluster-membership-mgr.cc
M be/src/scheduling/cluster-membership-mgr.h
M be/src/service/impala-hs2-server.cc
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/util/ExecutorMembershipSnapshot.java
9 files changed, 395 insertions(+), 54 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/18093/1
--
To view, visit http://gerrit.cloudera.org:8080/18093
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I9e0a3a5fe2b1f0b7507b7c096b7a3c373bc2e684
Gerrit-Change-Number: 18093
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10764: hide /logs link in webui if --logtostderr=true

2021-12-01 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18062 )

Change subject: IMPALA-10764: hide /logs link in webui if --logtostderr=true
..


Patch Set 1: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18062/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18062/1//COMMIT_MSG@10
PS1, Line 10: --redirect_stdout_stderr=false
> The default for redirect_stdout_stderr is true
got it, thanks for the explanation



--
To view, visit http://gerrit.cloudera.org:8080/18062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I65234213f32902caa1f4368181b49f012a4dbcb3
Gerrit-Change-Number: 18062
Gerrit-PatchSet: 1
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 02 Dec 2021 01:01:15 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10764: hide /logs link in webui if --logtostderr=true

2021-12-01 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18062 )

Change subject: IMPALA-10764: hide /logs link in webui if --logtostderr=true
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18062/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18062/1//COMMIT_MSG@10
PS1, Line 10: --redirect_stdout_stderr=false
is this also required to ensure no logs are generated? if yes, should we add a 
check on it too while removing the page?



--
To view, visit http://gerrit.cloudera.org:8080/18062
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I65234213f32902caa1f4368181b49f012a4dbcb3
Gerrit-Change-Number: 18062
Gerrit-PatchSet: 1
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 01 Dec 2021 18:29:34 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10970: Fix criterion for classifying coordinator only query

2021-11-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/17937 )

Change subject: IMPALA-10970: Fix criterion for classifying coordinator only 
query
..

IMPALA-10970: Fix criterion for classifying coordinator only query

This patch fixes a bug in the criterion which decided whether a query
can be considered as a coordinator only query. It did not consider
the possibility of parallel plans and ended up mis-classifying some
queries as coordinator only queries.

This classification was used during scheduling when dedicated
coordinators and executor groups are used and allowed coordinator
queries to be scheduled only on the coordinator even in the absence
of healthy executor groups.

As a result of this bug, queries classified wrongly ended up with
error code: NO_REGISTERED_BACKENDS.

Testing:
- Add new mt_dop test case for functional_query and pass
- Ran and passed custom_cluster/test_coordinators, test_executor_groups

Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Reviewed-on: http://gerrit.cloudera.org:8080/17937
Tested-by: Impala Public Jenkins 
Reviewed-by: Bikramjeet Vig 
---
M be/src/scheduling/scheduler.cc
M common/thrift/Query.thrift
M testdata/workloads/functional-query/queries/QueryTest/mt-dop.test
3 files changed, 129 insertions(+), 2 deletions(-)

Approvals:
  Impala Public Jenkins: Verified
  Bikramjeet Vig: Looks good to me, approved

--
To view, visit http://gerrit.cloudera.org:8080/17937
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Gerrit-Change-Number: 17937
Gerrit-PatchSet: 5
Gerrit-Owner: guojingfeng 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: guojingfeng 


[Impala-ASF-CR] IMPALA-10970: Fix criterion for classifying coordinator only query

2021-11-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17937 )

Change subject: IMPALA-10970: Fix criterion for classifying coordinator only 
query
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17937
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Gerrit-Change-Number: 17937
Gerrit-PatchSet: 4
Gerrit-Owner: guojingfeng 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: guojingfeng 
Gerrit-Comment-Date: Tue, 30 Nov 2021 20:12:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11035: Make x-forwarded-for http header case-insensitive

2021-11-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18048 )

Change subject: IMPALA-11035: Make x-forwarded-for http header case-insensitive
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18048
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id9c4070a4a2d5ad9decb186a9219957d8c26a7d7
Gerrit-Change-Number: 18048
Gerrit-PatchSet: 2
Gerrit-Owner: Abhishek Rawat 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 24 Nov 2021 03:33:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11035: Make x-forwarded-for http header case-insensitive

2021-11-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18048 )

Change subject: IMPALA-11035: Make x-forwarded-for http header case-insensitive
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18048/1/fe/src/test/java/org/apache/impala/customcluster/LdapHS2Test.java
File fe/src/test/java/org/apache/impala/customcluster/LdapHS2Test.java:

http://gerrit.cloudera.org:8080/#/c/18048/1/fe/src/test/java/org/apache/impala/customcluster/LdapHS2Test.java@396
PS1, Line 396: "X-Forwarded-For"
nit: maybe just add a separate simple test like Case2 above (mentioning that it 
checks specifically for case-insensitivity) to make it obvious what we are 
testing



--
To view, visit http://gerrit.cloudera.org:8080/18048
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id9c4070a4a2d5ad9decb186a9219957d8c26a7d7
Gerrit-Change-Number: 18048
Gerrit-PatchSet: 1
Gerrit-Owner: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 23 Nov 2021 21:04:57 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10970: Fix criterion for classifying coordinator only query

2021-11-18 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17937 )

Change subject: IMPALA-10970: Fix criterion for classifying coordinator only 
query
..


Patch Set 3:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/17937/3/be/src/scheduling/scheduler.h
File be/src/scheduling/scheduler.h:

http://gerrit.cloudera.org:8080/#/c/17937/3/be/src/scheduling/scheduler.h@77
PS3, Line 77:   /// If there are only one TPlanExecInfo, single unpartitioned 
fragment is considered
:   /// coordinator only.
:   /// If there are multiple TPlanExecInfo, we should check all of 
them is co-located in
:   /// coordinator node. If all other TPlanExecInfo has only one 
fragments and it's
:   /// partition type is unpartitioned, we should also take this 
Query is a coordinator
:   /// only. (e.g. JOIN BUILD sink (MT_DOP>0) is co-located with 
its pre order ancestor
:   /// : JOIN_NODE) Check IMPALA-4224 for more detail.
nit: I think the original comment is good enough, this is adding extra context 
for an edge case which is an implementation detail and does not need to me 
mentioned in the method comment


http://gerrit.cloudera.org:8080/#/c/17937/3/be/src/scheduling/scheduler.cc
File be/src/scheduling/scheduler.cc:

http://gerrit.cloudera.org:8080/#/c/17937/3/be/src/scheduling/scheduler.cc@824
PS3, Line 824: if (exec_request.plan_exec_info.size() > 1) {
 : /// There are intermediate join build side plan that 
co-located with join node,
 : /// don't take the query as coordinator only.
 : return false;
 :   }
we can instead do something like,
num_parallel_plans = exec_request.plan_exec_info.size();
then at the end check for num_parallel_plans == 1, this way it makes the code 
self explanatory and helps us avoid adding comments. In addition to this, to 
further improve readability, you can add a better description to 
'plan_exec_info' member in struct TQueryExecRequest in Query.thrift file as 
follows:
'The first plan in the list materializes the query result and subsequent plans 
materialize the build sides of joins. Each plan appears before its dependencies 
in the list.'


http://gerrit.cloudera.org:8080/#/c/17937/3/testdata/workloads/functional-query/queries/QueryTest/mt-dop.test
File testdata/workloads/functional-query/queries/QueryTest/mt-dop.test:

http://gerrit.cloudera.org:8080/#/c/17937/3/testdata/workloads/functional-query/queries/QueryTest/mt-dop.test@30
PS3, Line 30: # TIMPALA-128: Coordinator only query determination error.
: # From IMPALMA-4224 we execute join build in a separate 
fragments, the root
: # plan contains two subplan, one of it contains join node while 
the other
: # contains join builder as its root, the separate join build 
fragment is
: # co-located with join node.
: # This test case will fail due to outdated coordinator only 
determination
: # logic which only take only the first subplan for concern. 
: # To ensure the inline left table placed at join probe side, we 
add
: # STRAIGHT_JOIN hint to disable join inversion optimization.
How about?
IMPALA-10970: Verify that a query plan containing 2 parallel plans with the 
first one having a single unpartitioned fragment is correctly classified as not 
being a coordinator only query. This query create a plan with the 
aforementioned condition and has a join build fragment that will be co-located 
with the root sink fragment(IMPALMA-4224). This ensures that the query fails to 
schedule and eventually times out in the queue if incorrectly classified as a 
coordinator only query.
To ensure the inline left table placed at join probe side, we add STRAIGHT_JOIN 
hint to disable join inversion optimization.



--
To view, visit http://gerrit.cloudera.org:8080/17937
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Gerrit-Change-Number: 17937
Gerrit-PatchSet: 3
Gerrit-Owner: guojingfeng 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: guojingfeng 
Gerrit-Comment-Date: Fri, 19 Nov 2021 00:43:04 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10970: Update coordinator only judgment logic to adapt separate join build execution

2021-11-17 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17937 )

Change subject: IMPALA-10970: Update coordinator only judgment logic to adapt 
separate join build execution
..


Patch Set 1:

Just wanted to check if you got a chance to work on the next iteration of this 
patch


--
To view, visit http://gerrit.cloudera.org:8080/17937
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Gerrit-Change-Number: 17937
Gerrit-PatchSet: 1
Gerrit-Owner: guojingfeng 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 17 Nov 2021 17:44:19 +
Gerrit-HasComments: No


[Impala-ASF-CR] [WIP] IMPALA-10992 Planner changes for estimate peak memory

2021-11-16 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17994 )

Change subject: [WIP] IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 11:

> I was thinking that the default and large group can be partitioned
 > among two disjoint sets of nodes in a cluster. Thus each should
 > have their own host names/ ip addresses for the scans (in
 > particular local reads) to work properly.
 >
 > The update on these two groups is contained in 
 > TUpdateExecutorMembershipRequest
 > (https://gerrit.cloudera.org/#/c/17994/11/common/thrift/Frontend.thrift).
 >  It should still be one message.

When using executor groups (i.e., non-default executor group name) ip addresses 
and hostnames are not sent in the updates (as the comment above 
PopulateExecutorMembershipRequest suggests). So all reads are assumed as remote 
reads and those ip addresses and hostnames will never be sent.


--
To view, visit http://gerrit.cloudera.org:8080/17994
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1dca6933f7db3d9e00b20c93b38310b0e77a09eb
Gerrit-Change-Number: 17994
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Comment-Date: Tue, 16 Nov 2021 21:08:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] [WIP] IMPALA-10992 Planner changes for estimate peak memory

2021-11-15 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17994 )

Change subject: [WIP] IMPALA-10992 Planner changes for estimate peak memory
..


Patch Set 11:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/17994/11/be/src/service/impala-server.h
File be/src/service/impala-server.h:

http://gerrit.cloudera.org:8080/#/c/17994/11/be/src/service/impala-server.h@525
PS11, Line 525: LARGE_EXECUTOR_GROUP_NAME
I went through the patch briefly and it seems like the patch is based around 
the idea that there can be 2 sets of executor group types in the "default" 
mode. ‘default’ here is a bit of a misnomer. The “DEFAULT_EXECUTOR_GROUP_NAME” 
is used when we are not using the new concept of multiple executor groups at 
all. This is the legacy mode where there is a single cluster and all executors 
belong to that. So we dont need to specify a different 
“LARGE_EXECUTOR_GROUP_NAME” name since a large executor group can only exist 
when using “executor groups”. The same concept carries over to the changes in 
cluster-membership-mgr and ExecutorMembershipSnapshot. We dont need to send 
separate updates for large and regular executor groups because when we are 
using executor-groups, fields like hostnames and ip_addresses are not passed 
on. We only need ‘num_executors’. Therefore we can avoid keeping 2 sets of  
hostnames and ip_addresses and only need another set of num_executors. This 
will greatly simplify the update code.



--
To view, visit http://gerrit.cloudera.org:8080/17994
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1dca6933f7db3d9e00b20c93b38310b0e77a09eb
Gerrit-Change-Number: 17994
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Comment-Date: Tue, 16 Nov 2021 02:16:18 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-09 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has removed a vote on this change.

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-09 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17891 )

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Patch Set 4:

Hit another flaky test IMPALA-11012


--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 09 Nov 2021 20:42:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-08 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has removed a vote on this change.

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10970: Update coordinator only judgment logic to adapt separate join build execution

2021-11-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17937 )

Change subject: IMPALA-10970: Update coordinator only judgment logic to adapt 
separate join build execution
..


Patch Set 1:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/17937/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17937/1//COMMIT_MSG@7
PS1, Line 7: Update coordinator only judgment logic to adapt separate j
Fix criterion for classifying coordinator only query


http://gerrit.cloudera.org:8080/#/c/17937/1//COMMIT_MSG@9
PS1, Line 9: IMPALA-8830 introduce new executor group assignment for 
coordinator only
   : queries which assign all coordinator queries to a 'empty' executor 
group
   : that we got the ability to execute trivil queries regardless of the
   : presence of any healthy executor groups.
   :
   : IsCoordinatorOnlyQuery is introduced to determine query is 
coordinator
   : only or not. Before this change, IsCoordinatorOnlyQuery only take 
the
   : first TPlanExecInfo of TQueryExecRequest for concern.
   :
   : This is not enough since IMPALA-4224 execute separate join builds
   : fragments for parallel join execution. With the new join execution 
mode,
   : join builder is executed in separated plan if MT_DOP > 0. This 
means we
   : will have 2 TPlanExecInfo for a single join node, join builder is
   : co-located with join node but at separate TPlanExecInfo. So 
without this
   : patch the following query will fail in admission because we will 
take
   : it as coordinator only query. In debug build it will hit the 
DECHECK at
   : /be/src/scheduling/scheduler.cc#L162, in release build it will
   : fail with: "Scheduling for executor group: empty group
   : (using coordinator only) with 0 executors".
   :
   : This patch update the judgment logic to take all of TPlanExecInfo 
for
   : concern.
nit: This patch fixes a bug in the criterion which decided whether a query can 
be considered as a coordinator only query. It did not consider the possibility 
of parallel plans and ended up mis-classifying some queries as coordinator only 
queries. This classification was used during scheduling when dedicated 
coordinators and executor groups are used and allowed coordinator queries to be 
scheduled only on the coordinator even in the absence of healthy executor 
groups. As a result of this bug, queries classified wrongly ended up being 
queued and timed-out waiting.


http://gerrit.cloudera.org:8080/#/c/17937/1/be/src/scheduling/scheduler.cc
File be/src/scheduling/scheduler.cc:

http://gerrit.cloudera.org:8080/#/c/17937/1/be/src/scheduling/scheduler.cc@a824
PS1, Line 824:
Thanks for catching this bug.
A stronger check here would be to ensure that there is a single element in the 
plan_exec_info list. Can you please add a check for plan_exec_info.size() == 1.

To give you more context the only way there can be more plans in plan_exec_info 
is if they produce intermediate join build sides in a parallel plan. Since in 
those cases they cannot be small coordinator only queries it would make sense 
to check for that condition as well.


http://gerrit.cloudera.org:8080/#/c/17937/1/testdata/workloads/functional-query/queries/QueryTest/mt-dop.test
File testdata/workloads/functional-query/queries/QueryTest/mt-dop.test:

http://gerrit.cloudera.org:8080/#/c/17937/1/testdata/workloads/functional-query/queries/QueryTest/mt-dop.test@39
PS1, Line 39: SELECT
: STRAIGHT_JOIN t1.ts,count(t2.l_orderkey)
: FROM
: ( SELECT '20210831' AS ts
:  UNION ALL SELECT '20210901' AS ts
:  UNION ALL SELECT '20210902' AS ts
:  UNION ALL SELECT '20210903' AS ts ) t1
: CROSS JOIN tpch.lineitem t2
: GROUP BY t1.ts
: ORDER BY t1.ts;
can you use the following query instead to bring down the runtime. (using 
tpch.lineitem takes ~ 7s vs functional.alltypes which takes ~ 0.17s)

SELECT STRAIGHT_JOIN t2.int_col
FROM
( SELECT '20210831' AS ts
 UNION ALL SELECT '20210901' AS ts
 UNION ALL SELECT '20210902' AS ts
 UNION ALL SELECT '20210903' AS ts ) t1
CROSS JOIN functional.alltypes t2
LIMIT 100



--
To view, visit http://gerrit.cloudera.org:8080/17937
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icaaf1f1ba7a976122b4d37bd675e6d8181dc8700
Gerrit-Change-Number: 17937
Gerrit-PatchSet: 1
Gerrit-Owner: guojingfeng 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 04 Nov 2021 20:00:24 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17891 )

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Patch Set 3:

Hit another unrelated failure IMPALA-10886


--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 02 Nov 2021 20:07:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-01 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17891 )

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Patch Set 3:

unrelated flaky tests failed in last GVO, starting another one.


--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 01 Nov 2021 22:56:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-11-01 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has removed a vote on this change.

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-10-29 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17891 )

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..


Patch Set 2: Code-Review+2

(4 comments)

Carrying over Andrew's +2

http://gerrit.cloudera.org:8080/#/c/17891/1/tests/custom_cluster/test_executor_groups.py
File tests/custom_cluster/test_executor_groups.py:

http://gerrit.cloudera.org:8080/#/c/17891/1/tests/custom_cluster/test_executor_groups.py@610
PS1, Line 610:
> flake8: E122 continuation line missing indentation or outdented
Done


http://gerrit.cloudera.org:8080/#/c/17891/1/tests/custom_cluster/test_executor_groups.py@617
PS1, Line 617: "
> flake8: E251 unexpected spaces around keyword / parameter equals
Done


http://gerrit.cloudera.org:8080/#/c/17891/1/tests/custom_cluster/test_executor_groups.py@620
PS1, Line 620: "
> flake8: E251 unexpected spaces around keyword / parameter equals
Done


http://gerrit.cloudera.org:8080/#/c/17891/1/tests/custom_cluster/test_executor_groups.py@650
PS1, Line 650: s
> flake8: F841 local variable 'handle_long_running_queue2' is assigned to but
Done



--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Fri, 29 Oct 2021 23:08:54 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-10-29 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Abhishek Rawat, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17891

to look at the new patch set (#2).

Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..

IMPALA-10943: Add test to verify support for multiple resource and
executor pools

This patch adds a test to verify that admission control accounting
works when using multiple coordinators and multiple executor groups
mapped to different resource pools and having different sizes.

Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
---
M tests/custom_cluster/test_executor_groups.py
1 file changed, 86 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/17891/2
--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10973: Do not schedule empty scan nodes to coordinator

2021-10-19 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17954 )

Change subject: IMPALA-10973: Do not schedule empty scan nodes to coordinator
..


Patch Set 3:

@Csaba It would good to have a test case that verified this behavior


--
To view, visit http://gerrit.cloudera.org:8080/17954
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie31df3861aad2e3e91cab621ff122a4f721905ef
Gerrit-Change-Number: 17954
Gerrit-PatchSet: 3
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Oct 2021 21:49:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10973: Do not schedule empty scan nodes to coordinator

2021-10-19 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17954 )

Change subject: IMPALA-10973: Do not schedule empty scan nodes to coordinator
..


Patch Set 2:

Had started GVO but then noticed that a GVO had already succeeded. So instead 
will just submit the changes


--
To view, visit http://gerrit.cloudera.org:8080/17954
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie31df3861aad2e3e91cab621ff122a4f721905ef
Gerrit-Change-Number: 17954
Gerrit-PatchSet: 2
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Oct 2021 19:59:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10973: Do not schedule empty scan nodes to coordinator

2021-10-19 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/17954 )

Change subject: IMPALA-10973: Do not schedule empty scan nodes to coordinator
..

IMPALA-10973: Do not schedule empty scan nodes to coordinator

Until now fragments with scan nodes that have no scan ranges were
scheduled to the coordinator, even if it is an exclusive coordinator.

This could possibly lead to a lot of work to be scheduled to the
coordinator. This patch changes the logic to choose a random executor
instead.

Change-Id: Ie31df3861aad2e3e91cab621ff122a4f721905ef
Reviewed-on: http://gerrit.cloudera.org:8080/17954
Tested-by: Impala Public Jenkins 
Reviewed-by: Abhishek Rawat 
Reviewed-by: Bikramjeet Vig 
---
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
2 files changed, 17 insertions(+), 14 deletions(-)

Approvals:
  Impala Public Jenkins: Verified
  Abhishek Rawat: Looks good to me, but someone else must approve
  Bikramjeet Vig: Looks good to me, approved

--
To view, visit http://gerrit.cloudera.org:8080/17954
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ie31df3861aad2e3e91cab621ff122a4f721905ef
Gerrit-Change-Number: 17954
Gerrit-PatchSet: 3
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-10973: Do not schedule empty scan nodes to coordinator

2021-10-19 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17954 )

Change subject: IMPALA-10973: Do not schedule empty scan nodes to coordinator
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17954
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie31df3861aad2e3e91cab621ff122a4f721905ef
Gerrit-Change-Number: 17954
Gerrit-PatchSet: 2
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Oct 2021 19:57:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10950: Update expr-benchmark.cc

2021-10-08 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17894 )

Change subject: IMPALA-10950: Update expr-benchmark.cc
..


Patch Set 6: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17894
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5b17434d85e32a58622bffb64a697b062a8bf43f
Gerrit-Change-Number: 17894
Gerrit-PatchSet: 6
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Fri, 08 Oct 2021 20:13:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10950: Update expr-benchmark.cc

2021-10-06 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17894 )

Change subject: IMPALA-10950: Update expr-benchmark.cc
..


Patch Set 5:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc
File be/src/benchmarks/expr-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@108
PS3, Line 108: FragmentState* fragment_state = state->obj_pool()->Add(
> You're absolutely right! We explicitly set ENABLE_EXPR_REWRITES=0 in patch
can you add context here about the kind of plan generated? maybe just the 
textual plan itself?


http://gerrit.cloudera.org:8080/#/c/17894/4/be/src/benchmarks/expr-benchmark.cc
File be/src/benchmarks/expr-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/17894/4/be/src/benchmarks/expr-benchmark.cc@78
PS4, Line 78: ity class
nit: how about EnableCodegen(true)


http://gerrit.cloudera.org:8080/#/c/17894/4/be/src/benchmarks/expr-benchmark.cc@629
PS4, Line 629: col",
was the previous case not valid?


http://gerrit.cloudera.org:8080/#/c/17894/5/be/src/benchmarks/expr-benchmark.cc
File be/src/benchmarks/expr-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/17894/5/be/src/benchmarks/expr-benchmark.cc@104
PS5, Line 104: new RuntimeState(query_ctx, &exec_env_);
you can also allocate this from pool_ so that it can be cleaned eventually


http://gerrit.cloudera.org:8080/#/c/17894/5/be/src/benchmarks/expr-benchmark.cc@171
PS5, Line 171: test_data
does this ever get deleted?



--
To view, visit http://gerrit.cloudera.org:8080/17894
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5b17434d85e32a58622bffb64a697b062a8bf43f
Gerrit-Change-Number: 17894
Gerrit-PatchSet: 5
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Thu, 07 Oct 2021 01:35:20 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10950: Update expr-benchmark.cc

2021-10-05 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17894 )

Change subject: IMPALA-10950: Update expr-benchmark.cc
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc
File be/src/benchmarks/expr-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@108
PS3, Line 108: const TQueryExecRequest& query_request = 
request.query_exec_request;
> This one seems to be a bigger obstacle for benchmarking expression.
you can use this to query option to disable re-write: ENABLE_EXPR_REWRITES



--
To view, visit http://gerrit.cloudera.org:8080/17894
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5b17434d85e32a58622bffb64a697b062a8bf43f
Gerrit-Change-Number: 17894
Gerrit-PatchSet: 3
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Tue, 05 Oct 2021 19:42:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-05 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17893 )

Change subject: IMPALA-10942: Fix memory leak in admission controller
..


Patch Set 4:

GVO failed earlier due to flaky test. Re-running


--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: abeltian 
Gerrit-Comment-Date: Tue, 05 Oct 2021 18:18:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-05 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has removed a vote on this change.

Change subject: IMPALA-10942: Fix memory leak in admission controller
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: abeltian 


[Impala-ASF-CR] IMPALA-10950: Update expr-benchmark.cc

2021-10-05 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17894 )

Change subject: IMPALA-10950: Update expr-benchmark.cc
..


Patch Set 3:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/17894/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17894/3//COMMIT_MSG@9
PS3, Line 9: Query planner has move the scalar expression's thrift definition 
from
   : 'fragments[0].output_sink.output_exprs' to
   : 'fragments[0].plan.nodes[0].union_node.const_expr_lists[0]'.
which change made this switch? curious as to what the context there was


http://gerrit.cloudera.org:8080/#/c/17894/3//COMMIT_MSG@16
PS3, Line 16: some expressions
: in the benchmark now will crash without codegen due to missing 
symbols
why are they now crashing if the benchmark earlier used to run fine without 
codegen?


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc
File be/src/benchmarks/expr-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@108
PS3, Line 108: const TQueryExecRequest& query_request = 
request.query_exec_request;
nit: might be worth adding context here about the kind of plan created for a 
constant query


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@119
PS3, Line 119: // Uncomment the following lines for debugging.
 : // union_node.printTo(cout);
 : // cout << endl << " Need Codegen: " << 
test->fragment_state->ScalarExprNeedsCodegen()
 : //  << endl << endl;
do these need to be removed?


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@134
PS3, Line 134: codegen->EnableOptimizations(false);
why are we disabling this?


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@141
PS3, Line 141:   void ReleaseTestData() {
 : test_data_.clear();
 : mem_pool_.FreeAll();
 :   }
nit: you can probably just add this to the destructor instead of a separate 
function


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@183
PS3, Line 183: #define BENCHMARK(name, stmt) \
 :   suite->AddBenchmark(name, BenchmarkQueryFn, 
GenerateBenchmarkExprs(stmt, true))
we should probably do both with and without codegen and also keep the benchmark 
results for both, will be good to have historical data.


http://gerrit.cloudera.org:8080/#/c/17894/3/be/src/benchmarks/expr-benchmark.cc@234
PS3, Line 234: // Cast: FunctionRate  
Comparison
 : // 
--
 : // int_to_int 824
  1X
 : //int_to_bool 878
  1.066X
 : //  int_to_double   775.4
  0.941X
 : //  int_to_string   32.47
0.03941X
 : //  double_to_boolean   823.5
 0.9994X
 : //   double_to_bigint   775.4
  0.941X
 : //   double_to_string   4.682   
0.005682X
 : //  string_to_int   402.6
 0.4886X
 : //string_to_float   145.8
 0.1769X
 : //string_to_timestamp   83.76
 0.1017X
all of these values are for benchmark without codegen, can you update these too?



--
To view, visit http://gerrit.cloudera.org:8080/17894
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5b17434d85e32a58622bffb64a697b062a8bf43f
Gerrit-Change-Number: 17894
Gerrit-PatchSet: 3
Gerrit-Owner: Riza Suminto 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Tue, 05 Oct 2021 07:01:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17893 )

Change subject: IMPALA-10942: Fix memory leak in admission controller
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: abeltian 
Gerrit-Comment-Date: Mon, 04 Oct 2021 16:41:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-04 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17893 )

Change subject: IMPALA-10942: Fix memory leak in admission controller
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: abeltian 
Gerrit-Comment-Date: Mon, 04 Oct 2021 16:41:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-04 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, abeltian, Riza Suminto, David Rorke, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17893

to look at the new patch set (#3).

Change subject: IMPALA-10942: Fix memory leak in admission controller
..

IMPALA-10942: Fix memory leak in admission controller

This patch fixes a memory leak in the admission controller where
objects tracking state required for queuing were being retained
despite the query being admitted or rejected immediately.

Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
---
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/scheduling/local-admission-control-client.cc
4 files changed, 6 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/17893/3
--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Rorke 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: abeltian 


[Impala-ASF-CR] IMPALA-10942: Fix memory leak in admission controller

2021-10-01 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17893


Change subject: IMPALA-10942: Fix memory leak in admission controller
..

IMPALA-10942: Fix memory leak in admission controller

This patch fixes a memory leak in the admission controller where
objects tracking state required for queuing where being retained
despite the query being admitted or rejected immediately.

Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
---
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/scheduling/local-admission-control-client.cc
4 files changed, 6 insertions(+), 6 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/93/17893/2
--
To view, visit http://gerrit.cloudera.org:8080/17893
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1df0de0e658f6dcb5a044d108d0943aaa3dea0a1
Gerrit-Change-Number: 17893
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10943: Add test to verify support for multiple resource and executor pools

2021-09-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17891


Change subject: IMPALA-10943: Add test to verify support for multiple resource 
and executor pools
..

IMPALA-10943: Add test to verify support for multiple resource and
executor pools

This patch adds a test to verify that admission control accounting
works when using multiple coordinators and multiple executor groups
mapped to different resource pools and having different sizes.

Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
---
M tests/custom_cluster/test_executor_groups.py
1 file changed, 86 insertions(+), 7 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/17891/1
--
To view, visit http://gerrit.cloudera.org:8080/17891
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: If76d386d8de5730da937674ddd9a69aa1aa1355e
Gerrit-Change-Number: 17891
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-9930 (part 2): Introduce new admission control rpc service

2021-09-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16412 )

Change subject: IMPALA-9930 (part 2): Introduce new admission control rpc 
service
..


Patch Set 14:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16412/14/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/16412/14/be/src/scheduling/admission-controller.cc@1155
PS14, Line 1155: if (!queued) queue_nodes_.erase(request.query_id);
> We can get a memory leak here.
You are absolutely right! I have created a ticket for this IMPALA-10942 and 
will get a fix for it in soon.
Thanks a lot for catching it.



--
To view, visit http://gerrit.cloudera.org:8080/16412
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I594fc593a27b24b6952e381a9bc1a9a5c6b757ae
Gerrit-Change-Number: 16412
Gerrit-PatchSet: 14
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: abeltian 
Gerrit-Comment-Date: Thu, 30 Sep 2021 20:14:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10784 (part 3): Prepare to publish impala-shell on PyPi

2021-09-15 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17826 )

Change subject: IMPALA-10784 (part 3): Prepare to publish impala-shell on PyPi
..


Patch Set 3: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/17826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I378404e2407396d4de3bb0eea4d49a9c5bb4e46a
Gerrit-Change-Number: 17826
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 15 Sep 2021 18:52:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9976 IMPALA-10866: Add recovery mechanism to admission service and fix consistency between coord failure detection and registration

2021-08-31 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17332 )

Change subject: IMPALA-9976 IMPALA-10866: Add recovery mechanism to admission 
service and fix consistency between coord failure detection and registration
..


Patch Set 2:

(19 comments)

http://gerrit.cloudera.org:8080/#/c/17332/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17332/1//COMMIT_MSG@9
PS1, Line 9:
> I wrote my own version of the commit message to see that I understand the c
Pretty much. The only thing I would add is that if a backend is marked as down 
by the statestore, it will also have to register(send full admission state) 
again with the admissiond to be able to be serviced


http://gerrit.cloudera.org:8080/#/c/17332/1//COMMIT_MSG@12
PS1, Line 12: - Leverages the admission heartbeat mechanism to signal the
> Nit" should this be "No RPCs are serviced from a coordinator until it has s
Done


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-client.cc
File be/src/scheduling/admission-control-client.cc:

http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-client.cc@32
PS1, Line 32: "Re-submit for admission due to a possible admission service 
restart or network "
> Is there a reason it is only a "possible" restart?
as this can also be due to a generic network error that prevents the Admit RPC 
to go through.
Added this to the error msg. Open to suggestion for a better message text


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-service.cc
File be/src/scheduling/admission-control-service.cc:

http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-service.cc@304
PS1, Line 304: LOG(INFO) << "Received heartbeat from unrecognized 
coord_id=" << req->coord_id();
> Maybe WARNING is too high, we do expect to see this after a restart, maybe
Just a log line at the INFO level should suffice in that case


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-service.cc@332
PS1, Line 332:   bool registered_on_ac_service =
> Is it possible that the admissionstate we have just received is more up-to-
I have made both the RebuildAdmissionState and  
CancelQueriesOnFailedCoordinators atomic operations so there should be no 
inconsistency anymore. Also if the coord is already registered it would return 
an OK status. In that case as well there should be no inconsistency as all RPCs 
after the first successful call to RebuildAdmissionState would be serviced and 
ideally any subsequent calls for RebuildAdmissionState should just be previous 
retries.


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-control-service.cc@449
PS1, Line 449: lock_guard l(admission_state->lock);
> The size of known_coord_ids_ is interesting, maybe add it to the message.
Done


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-controller.cc@2103
PS1, Line 2103:   DCHECK(
> line too long (93 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/admission-controller.cc@2104
PS1, Line 2104:   num_backends_to_release_.find(state->query_id()) == 
num_backends_to_release_.end());
> line too long (92 > 90)
Done


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/remote-admission-control-client.h
File be/src/scheduling/remote-admission-control-client.h:

http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/remote-admission-control-client.h@54
PS1, Line 54:   /// TODO: add info on what this does? here or in the class 
comments
> Yes it seems like these methods don't have descriptions
will add these in the next update


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/remote-admission-control-client.cc
File be/src/scheduling/remote-admission-control-client.cc:

http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/remote-admission-control-client.cc@131
PS1, Line 131: // retry_admission can be false if 
AttemptAdmissionAndWait succeeded but a
> I think this was already set to true?
This might not be set to true if the RPC succeeded but the impala-server 
initiated a ResetPendingAdmit. This can happen in the following case:
- the RPCs (both admit and getQueryStatus) succeeded on a previous instance of 
the admissiond
- The coordinator re-registered with an ongoing (old or a new restarted) 
instance of the admissiond.


http://gerrit.cloudera.org:8080/#/c/17332/1/be/src/scheduling/remote-admission-control-client.cc@158
PS1, Line 158:   while (admit_rpc_status.IsNetworkError()
> Nit: "admissiond"
Done


http://gerrit.cloudera.org:8080/#/c/17332/1/common/protobuf/admission_control_service.proto
File common/protobuf/admission_control_service.proto:

http://gerrit.cl

[Impala-ASF-CR] IMPALA-9976 IMPALA-10866: Add recovery mechanism to admission service and fix consistency between coord failure detection and registration

2021-08-31 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17332

to look at the new patch set (#2).

Change subject: IMPALA-9976 IMPALA-10866: Add recovery mechanism to admission 
service and fix consistency between coord failure detection and registration
..

IMPALA-9976 IMPALA-10866: Add recovery mechanism to admission service
and fix consistency between coord failure detection and registration

Major changes:
IMPALA-9976:
- Leverages the admission heartbeat mechanism to signal the
coordinator to send its complete admission state
- No RPCs are serviced by a coordinator until it has sent its complete
admission state. This is to prevent making admission decisions till
admission service has built its view of the cluster
- The complete admission state consists of the states of all queries
that have successfully been admitted, that is, received a valid
schedule from the admission controller and have marked its admission
as complete (for remote admission it means its pending admit status
has transitioned from true to false)
- This helps prevent sending incomplete/inconsistent state to the
admission controller
- Queries that have not started admission get a chance to send their
request to the new service
- Queries that are queued restart the admission process by sending
the request again. This re-try is now also marked in the query profile
- Other RPCs like ReleaseBackend, ReleaseQuery, CancelQuery that
don't get serviced (till initial admission state is sent) can result
in inconsistent state. This state will be rectified in the admission
heartbeats
- AdmitQuery and GetQueryStatus just retry again if they notice a
network failure(assuming admissiond might be down/restarting) or
received the error message that they cannot be serviced yet
admissiond is waiting on initial state from this coordinator)

IMPALA-10866:
- Made sure that admission state removal on failure detection and
admission state rebuilding on coordinator registration are atomic
operations.
- Leverage statestore's membership view to detect failure and
allow coordinator registration.

Limitations:
- Rebuilding the state can not ensure that queued queries will
maintain their spot in the queue.
- Queries can be admitted before all coordinators get a chance to
send their state. This can result in a brief period of over-admission
We cannot rely completely on the statestore membership update and
wait for all coordinators there to send admission state because
that membership is also dynamic which makes it difficult to decide
when to assume that the admission state is complete.
- The functionalities for coordinator failure detection and
registration rely completely on the statestore.

Testing:
- Added end to end tests

Change-Id: I8ad3ef9b9e2496c484833d6326ce914c851e02fd
---
M be/src/runtime/coordinator-backend-resource-state.cc
M be/src/runtime/coordinator-backend-state.h
M be/src/runtime/coordinator.cc
M be/src/runtime/coordinator.h
M be/src/scheduling/admission-control-client.cc
M be/src/scheduling/admission-control-client.h
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-control-service.h
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/scheduling/admissiond-env.cc
M be/src/scheduling/local-admission-control-client.cc
M be/src/scheduling/local-admission-control-client.h
M be/src/scheduling/remote-admission-control-client.cc
M be/src/scheduling/remote-admission-control-client.h
M be/src/scheduling/schedule-state.cc
M be/src/service/client-request-state.cc
M be/src/service/client-request-state.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M common/protobuf/admission_control_service.proto
M common/thrift/generate_error_codes.py
M tests/custom_cluster/test_admission_controller.py
24 files changed, 756 insertions(+), 93 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/32/17332/2
--
To view, visit http://gerrit.cloudera.org:8080/17332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8ad3ef9b9e2496c484833d6326ce914c851e02fd
Gerrit-Change-Number: 17332
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10784 (part 2): Fix retaining cookies for impala-shell

2021-08-20 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17796 )

Change subject: IMPALA-10784 (part 2): Fix retaining cookies for impala-shell
..


Patch Set 3: Code-Review+1

(1 comment)

http://gerrit.cloudera.org:8080/#/c/17796/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17796/3//COMMIT_MSG@11
PS3, Line 11: Pythonic coding
Adding Attila to verify changes are in line with python best practices



--
To view, visit http://gerrit.cloudera.org:8080/17796
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I65432b952929c1c96a081bb87fd4a096624d711b
Gerrit-Change-Number: 17796
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 20 Aug 2021 22:04:09 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10874: Upgrade impyla to the latest version

2021-08-19 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17795 )

Change subject: IMPALA-10874: Upgrade impyla to the latest version
..


Patch Set 1: Code-Review+1

Looks good to me. @Attila will the version changes in requirements.txt for 
dependent libs like sasl or bitarray affect other dependencies?


--
To view, visit http://gerrit.cloudera.org:8080/17795
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I990e5cdde4e98d6ab3581fe48f53a5d0590ce492
Gerrit-Change-Number: 17795
Gerrit-PatchSet: 1
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 19 Aug 2021 23:05:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-08-16 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17524 )

Change subject: IMPALA-10720: Add versioning to admission heartbeats
..


Patch Set 3: Code-Review+2

Carrying forward +2 from Andrew


--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 17 Aug 2021 00:59:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-08-16 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17524 )

Change subject: IMPALA-10720: Add versioning to admission heartbeats
..


Patch Set 3:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17524/2/be/src/scheduling/admission-control-service.h
File be/src/scheduling/admission-control-service.h:

http://gerrit.cloudera.org:8080/#/c/17524/2/be/src/scheduling/admission-control-service.h@139
PS2, Line 139:   /// from it. NOTE: Can contain stale data from coordinators 
that have restarted.
> Nit: "from"
Done


http://gerrit.cloudera.org:8080/#/c/17524/2/common/protobuf/admission_control_service.proto
File common/protobuf/admission_control_service.proto:

http://gerrit.cloudera.org:8080/#/c/17524/2/common/protobuf/admission_control_service.proto@274
PS2, Line 274:   /// resources. Stale heartbeat messages are ignored.
> Nit: "heartbeat"
Done



--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 17 Aug 2021 00:59:02 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-08-16 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17524

to look at the new patch set (#3).

Change subject: IMPALA-10720: Add versioning to admission heartbeats
..

IMPALA-10720: Add versioning to admission heartbeats

This patch adds version numbers to admission heartbeats which
allows the admission service to ignore stale (out of order)
heartbeats from coordinators.

Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
---
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-control-service.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M common/protobuf/admission_control_service.proto
5 files changed, 39 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/17524/3
--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10783: Fixed flakiness in run and verify query cancellation test

2021-08-16 Thread Bikramjeet Vig (Code Review)
Hello Csaba Ringhofer,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17768

to look at the new patch set (#2).

Change subject: IMPALA-10783: Fixed flakiness in 
run_and_verify_query_cancellation_test
..

IMPALA-10783: Fixed flakiness in run_and_verify_query_cancellation_test

The issue was that after the impala-shell is started in a seperate
process and an error is encountered then the process lingers on
and a long running query can hold on to resources and potentially
affect other tests running on the impala cluster.
This patch just makes sure that the impala-shell process is killed
regardless of any errors encountered.

Change-Id: I9f6d22d639921051cde5675fae1845bedb61c8cc
---
M tests/shell/test_shell_commandline.py
1 file changed, 8 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/68/17768/2
--
To view, visit http://gerrit.cloudera.org:8080/17768
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9f6d22d639921051cde5675fae1845bedb61c8cc
Gerrit-Change-Number: 17768
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 


[Impala-ASF-CR] IMPALA-10783: Fixed flakiness in run and verify query cancellation test

2021-08-16 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17768 )

Change subject: IMPALA-10783: Fixed flakiness in 
run_and_verify_query_cancellation_test
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17768/1/tests/shell/test_shell_commandline.py
File tests/shell/test_shell_commandline.py:

http://gerrit.cloudera.org:8080/#/c/17768/1/tests/shell/test_shell_commandline.py@387
PS1, Line 387: try:
> Having this line in the try block doesn't seem useful to me, as the finally
Done


http://gerrit.cloudera.org:8080/#/c/17768/1/tests/shell/test_shell_commandline.py@423
PS1, Line 423:
> Same as line 387
Done



--
To view, visit http://gerrit.cloudera.org:8080/17768
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9f6d22d639921051cde5675fae1845bedb61c8cc
Gerrit-Change-Number: 17768
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Comment-Date: Tue, 17 Aug 2021 00:55:42 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10846: Skip Authentication for connection with trusted auth header

2021-08-09 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17759 )

Change subject: IMPALA-10846: Skip Authentication for connection with trusted 
auth header
..


Patch Set 1:

(1 comment)

Looks good, Andrew's covered most of the stuff, just one more nit from my side

http://gerrit.cloudera.org:8080/#/c/17759/1/be/src/util/webserver.cc
File be/src/util/webserver.cc:

http://gerrit.cloudera.org:8080/#/c/17759/1/be/src/util/webserver.cc@907
PS1, Line 907: bool Webserver::HandleTrustedAuthHeader(
 : struct sq_connection* connection, struct sq_request_info* 
request_info) {
 :   string err_msg;
 :   if (!GetUsernameFromAuthHeader(connection, request_info, 
err_msg)) {
 : LOG(ERROR) << "Found trusted auth header but " << err_msg;
 : return false;
 :   }
 :   return true;
 : }
nit: you can just get rid of HandleTrustedAuthHeader if you are only calling 
GetUsernameFromAuthHeader.



--
To view, visit http://gerrit.cloudera.org:8080/17759
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7103012bc9cf9dfdc9c184af3a281526d8127a31
Gerrit-Change-Number: 17759
Gerrit-PatchSet: 1
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 10 Aug 2021 00:30:12 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all coordinators

2021-07-27 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17683 )

Change subject: IMPALA-8762: Track host level admission stats across all 
coordinators
..


Patch Set 3: Code-Review+2

Carrying over Andrew's +2


--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 27 Jul 2021 23:22:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all coordinators

2021-07-27 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17683 )

Change subject: IMPALA-8762: Track host level admission stats across all 
coordinators
..


Patch Set 3:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17683/2/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/17683/2/be/src/scheduling/admission-controller.cc@257
PS2, Line 257:   << "All admission topic key prefixes should be of the same 
size";
> Nit: "All admission topic keys" is better, as this may make it clearer for
Done


http://gerrit.cloudera.org:8080/#/c/17683/2/tests/custom_cluster/test_executor_groups.py
File tests/custom_cluster/test_executor_groups.py:

http://gerrit.cloudera.org:8080/#/c/17683/2/tests/custom_cluster/test_executor_groups.py@573
PS2, Line 573: # identical but this will at least test that code path as a 
sanity check.
> Nit: "at least" "code path"
Done



--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 27 Jul 2021 23:22:30 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all coordinators

2021-07-27 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Joe McDonnell, Wenzhe Zhou, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17683

to look at the new patch set (#3).

Change subject: IMPALA-8762: Track host level admission stats across all 
coordinators
..

IMPALA-8762: Track host level admission stats across all coordinators

This patch adds the ability to share the per-host stats for locally
admitted queries across all coordinators. This helps to get a more
consolidated view of the cluster for stats like slots_in_use and
mem_admitted when making local admission decisions.

Testing:
Added e2e py test

Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
---
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M common/thrift/StatestoreService.thrift
M common/thrift/metrics.json
M tests/common/custom_cluster_test_suite.py
M tests/custom_cluster/test_catalog_wait.py
M tests/custom_cluster/test_executor_groups.py
M tests/custom_cluster/test_runtime_profile.py
M tests/custom_cluster/test_scratch_disk.py
9 files changed, 261 insertions(+), 86 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/83/17683/3
--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all coordinators

2021-07-20 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17683 )

Change subject: IMPALA-8762: Track host level admission stats across all 
coordinators
..


Patch Set 2:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/17683/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17683/1//COMMIT_MSG@7
PS1, Line 7: coordinators
> nit: coordinators
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.h
File be/src/scheduling/admission-controller.h:

http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.h@440
PS1, Line 440:   // This maps a backends's id(host/port id) to its host level 
statistics which are used
> Nit: Its not a struct any more
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.h@442
PS1, Line 442: kends.
> nit: what's the map key? host_id?
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.cc@75
PS1, Line 75: // "". "!" is used 
because the backend id
> Nit: is it clearer to say it is of the form?
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.cc@244
PS1, Line 244: // Parses the topic key to separate the prefix that helps 
recognize the kind of update
> Nit: separate
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.cc@245
PS1, Line 245: // received.
> Nit: received
Done


http://gerrit.cloudera.org:8080/#/c/17683/1/be/src/scheduling/admission-controller.cc@1652
PS1, Line 1652:   if (topic_backend_id == host_id_) continue;
> This is skipping updates about the current host?
this skips the update that was added by the current host in its previous 
statestore update, since the current host's local version is more up to date. 
Similar to what we do in L1635 above


http://gerrit.cloudera.org:8080/#/c/17683/1/tests/custom_cluster/test_executor_groups.py
File tests/custom_cluster/test_executor_groups.py:

http://gerrit.cloudera.org:8080/#/c/17683/1/tests/custom_cluster/test_executor_groups.py@96
PS1, Line 96: self.num_impalads = 2
> why is this 2?
this is a mis-nomer, it basically refers to the num of impalad instances that 
self._start_impala_cluster will look for. See L66 and L80.
I'll update the naming of the variables to represent their true meaning.



--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Tue, 20 Jul 2021 20:50:27 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all coordinators

2021-07-20 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Joe McDonnell, Wenzhe Zhou, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17683

to look at the new patch set (#2).

Change subject: IMPALA-8762: Track host level admission stats across all 
coordinators
..

IMPALA-8762: Track host level admission stats across all coordinators

This patch adds the ability to share the per-host stats for locally
admitted queries across all coordinators. This helps to get a more
consolidated view of the cluster for stats like slots_in_use and
mem_admitted when making local admission decisions.

Testing:
Added e2e py test

Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
---
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M common/thrift/StatestoreService.thrift
M common/thrift/metrics.json
M tests/common/custom_cluster_test_suite.py
M tests/custom_cluster/test_catalog_wait.py
M tests/custom_cluster/test_executor_groups.py
M tests/custom_cluster/test_runtime_profile.py
M tests/custom_cluster/test_scratch_disk.py
9 files changed, 261 insertions(+), 86 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/83/17683/2
--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-8762: Track host level admission stats across all cooridnators

2021-07-13 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17683


Change subject: IMPALA-8762: Track host level admission stats across all 
cooridnators
..

IMPALA-8762: Track host level admission stats across all cooridnators

This patch adds the ability to share the per-host stats for locally
admitted queries across all cordinators. This helps to get a more
consolidated view of the cluster for stats like slots_in_use and
mem_admitted when making local admission decisions.

Testing:
Added e2e py test

Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
---
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M common/thrift/StatestoreService.thrift
M common/thrift/metrics.json
M tests/custom_cluster/test_executor_groups.py
5 files changed, 242 insertions(+), 68 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/83/17683/1
--
To view, visit http://gerrit.cloudera.org:8080/17683
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I2946832e0a89b077d0f3bec755e4672be2088243
Gerrit-Change-Number: 17683
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10784: Add support for retaining cookies in impala-shell

2021-07-12 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17667 )

Change subject: IMPALA-10784: Add support for retaining cookies in impala-shell
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17667
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I193422d5ec891886a522d82ecb0e9d974132ff2a
Gerrit-Change-Number: 17667
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Mon, 12 Jul 2021 16:15:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10784: Add support for retaining cookies in impala-shell

2021-07-09 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17667 )

Change subject: IMPALA-10784: Add support for retaining cookies in impala-shell
..


Patch Set 3:

(4 comments)

Looks good, just a few nits

http://gerrit.cloudera.org:8080/#/c/17667/3/shell/ImpalaHttpClient.py
File shell/ImpalaHttpClient.py:

http://gerrit.cloudera.org:8080/#/c/17667/3/shell/ImpalaHttpClient.py@66
PS3, Line 66: http_cookie_names is comma-separated list of cookie names which 
are used to specify
: the cookie-based authentication or session management
nit: http_cookie_names is used to specify a comma-separated list of possible 
cookie names used for cookie-based authentication  or session management. If a 
cookie with one of these names is returned in an http response by the server or 
an intermediate proxy then it will be included in each subsequent request for 
the same connection.


http://gerrit.cloudera.org:8080/#/c/17667/3/shell/ImpalaHttpClient.py@170
PS3, Line 170: updateHttpCookie
nit: updateHttpCookies


http://gerrit.cloudera.org:8080/#/c/17667/3/shell/ImpalaHttpClient.py@184
PS3, Line 184: self.__http_cookie_dict[cn] = Cookie(cookie=None, 
expiry_time=None)
nit: can we just remove the entry from __http_cookie_dict?


http://gerrit.cloudera.org:8080/#/c/17667/3/shell/ImpalaHttpClient.py@194
PS3, Line 194: addHttpCookie
nit: addHttpCookiesToRequestHeaders



--
To view, visit http://gerrit.cloudera.org:8080/17667
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I193422d5ec891886a522d82ecb0e9d974132ff2a
Gerrit-Change-Number: 17667
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 09 Jul 2021 19:39:14 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10557: Support Kudu's multi-row transaction

2021-06-24 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17553 )

Change subject: IMPALA-10557: Support Kudu's multi-row transaction
..


Patch Set 21: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17553
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I876ada48991afdff5d61b5d6a0417571aba7cb34
Gerrit-Change-Number: 17553
Gerrit-PatchSet: 21
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Andrew Wong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 24 Jun 2021 14:32:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10557: Support Kudu's multi-row transaction

2021-06-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17553 )

Change subject: IMPALA-10557: Support Kudu's multi-row transaction
..


Patch Set 20: Code-Review+2

(2 comments)

http://gerrit.cloudera.org:8080/#/c/17553/20/common/thrift/Query.thrift
File common/thrift/Query.thrift:

http://gerrit.cloudera.org:8080/#/c/17553/20/common/thrift/Query.thrift@684
PS20, Line 684: // Stores the transaction id if the query is transactional.
nit: update to mention that this is only used for HIVE ACID


http://gerrit.cloudera.org:8080/#/c/17553/20/tests/custom_cluster/test_kudu.py
File tests/custom_cluster/test_kudu.py:

http://gerrit.cloudera.org:8080/#/c/17553/20/tests/custom_cluster/test_kudu.py@540
PS20, Line 540: patch
nit: batch



--
To view, visit http://gerrit.cloudera.org:8080/17553
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I876ada48991afdff5d61b5d6a0417571aba7cb34
Gerrit-Change-Number: 17553
Gerrit-PatchSet: 20
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Andrew Wong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 24 Jun 2021 00:38:40 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10762: ASAN tests fail with use-after-poison in HdfsParquetScanner::FindSkipRangesForPagesWithMinMaxFilters

2021-06-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17630 )

Change subject: IMPALA-10762: ASAN tests fail with use-after-poison in 
HdfsParquetScanner::FindSkipRangesForPagesWithMinMaxFilters
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/17630
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I97c43571d771805834577a0f03d476e24c82b82b
Gerrit-Change-Number: 17630
Gerrit-PatchSet: 1
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 23 Jun 2021 20:18:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-06-21 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17524 )

Change subject: IMPALA-10720: Add versioning to admission heartbeats
..


Patch Set 2:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/17524/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17524/1//COMMIT_MSG@10
PS1, Line 10: allows the admission service to ignore stale (out of order)
> Maybe clarify that stale here means out of order?
Done


http://gerrit.cloudera.org:8080/#/c/17524/1/be/src/scheduling/admission-control-service.h
File be/src/scheduling/admission-control-service.h:

http://gerrit.cloudera.org:8080/#/c/17524/1/be/src/scheduling/admission-control-service.h@138
PS1, Line 138:   /// Maps from coordinator ID to the latest heartbeat version 
number that was processed
> Nit: "coordinator"
Done


http://gerrit.cloudera.org:8080/#/c/17524/1/be/src/scheduling/admission-control-service.h@140
PS1, Line 140:   /// TODO: Leverage IMPALA-9155 to add coord_id the first time 
a coord sends a heartbeat
> I think this map will contains stale data form coordinators that have resta
Added a TODO to add a mechanism to delete ids when coords fail


http://gerrit.cloudera.org:8080/#/c/17524/1/be/src/scheduling/admission-control-service.cc
File be/src/scheduling/admission-control-service.cc:

http://gerrit.cloudera.org:8080/#/c/17524/1/be/src/scheduling/admission-control-service.cc@276
PS1, Line 276: VLOG(1) << "Stale heartbeat received for coord_id: "<< 
req->host_id();
> Maybe should also say that this may be expected if a coordinator restarts.
if a coord restarts it'll receive a new Id when it registers with the 
statestore. So this case can only happen due to network issues.



--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 21 Jun 2021 22:18:18 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-06-21 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17524

to look at the new patch set (#2).

Change subject: IMPALA-10720: Add versioning to admission heartbeats
..

IMPALA-10720: Add versioning to admission heartbeats

This patch adds version numbers to admission heartbeats which
allows the admission service to ignore stale (out of order)
heartbeats from coordinators.

Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
---
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-control-service.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M common/protobuf/admission_control_service.proto
5 files changed, 39 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/17524/2
--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] WIP IMPALA-10557: Support Kudu's multi-row transaction

2021-06-09 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17553 )

Change subject: WIP IMPALA-10557: Support Kudu's multi-row transaction
..


Patch Set 5:

(9 comments)

Looks good, just adding a few more nits

http://gerrit.cloudera.org:8080/#/c/17553/5/be/src/exec/kudu-table-sink.h
File be/src/exec/kudu-table-sink.h:

http://gerrit.cloudera.org:8080/#/c/17553/5/be/src/exec/kudu-table-sink.h@107
PS5, Line 107:   /// Returns TRUE if it's in Kudu transaction.
nit: add: valid only after Open() succeeds


http://gerrit.cloudera.org:8080/#/c/17553/5/be/src/exec/kudu-table-sink.h@123
PS5, Line 123: std::string kudu_txn_token_
> nit: is it possible to add 'const' specifier and initialize the token in th
Deserialize can return an error status which we cant propagate during 
initialization, therefore anything like this is usually deferred to the Open() 
call


http://gerrit.cloudera.org:8080/#/c/17553/5/be/src/service/client-request-state.cc
File be/src/service/client-request-state.cc:

http://gerrit.cloudera.org:8080/#/c/17553/5/be/src/service/client-request-state.cc@1323
PS5, Line 1323: if (InTransaction()) {
  :   AbortTransaction();
  : } else if (InKuduTransaction()) {
  :   AbortKuduTransaction();
  : }
  : LOG(ERROR) << "ERROR Finalizing DML: " << 
status.GetDetail();
  : return status;
  :   }
  :   if (InTransaction()) {
  : // UpdateCatalog() succeeded and already committed the 
transaction for us.
  : int64_t txn_id = GetTransactionId();
  : if (!frontend_->UnregisterTransaction(txn_id).ok()) {
  :   LOG(ERROR) << Substitute("Failed to unregister 
transaction $0", txn_id);
  : }
  : ClearTransactionState();
  : query_events_->MarkEvent("Transaction committed");
  :   } else if (InKuduTransaction()) {
  : CommitKuduTransaction();
  :   }
wondering if we can consolidate the methods for Transactions and let them take 
care of doing the right thing for hive and kudu. will make it easier to read.

eg L1331-L1341 can be taken care of by FinalizeTransaction
L1323-L1327 can be taken care of by a single AbortTransaction()


http://gerrit.cloudera.org:8080/#/c/17553/5/common/thrift/ImpalaService.thrift
File common/thrift/ImpalaService.thrift:

http://gerrit.cloudera.org:8080/#/c/17553/5/common/thrift/ImpalaService.thrift@673
PS5, Line 673: KUDU_TRANSACTION
nit: how about ENABLE_KUDU_TRANSACTION


http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/analysis/InsertStmt.java
File fe/src/main/java/org/apache/impala/analysis/InsertStmt.java:

http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/analysis/InsertStmt.java@1124
PS5, Line 1124: setTransactionToken
nit: maybe call it setKuduTransactionToken


http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/planner/TableSink.java
File fe/src/main/java/org/apache/impala/planner/TableSink.java:

http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/planner/TableSink.java@110
PS5, Line 110:* Same as above, plus it takes an ACID write id in parameter 
'writeId'.
nit: update comment to mention txnToken


http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/planner/TableSink.java@116
PS5, Line 116: txnToken
nit: maybe name it kuduTxnToken to explicitly specify that this is kudu specific


http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/service/Frontend.java
File fe/src/main/java/org/apache/impala/service/Frontend.java:

http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/service/Frontend.java@1732
PS5, Line 1732: TransactionException
update comment for TransactionException to include kudu cases too.


http://gerrit.cloudera.org:8080/#/c/17553/5/fe/src/main/java/org/apache/impala/service/Frontend.java@2242
PS5, Line 2242: txn.close();
nit: is close() something that can be put in a finally block. Like should it 
always be called even it rollback or commit fails? I understand that it is 
Auto-closeable but if we are calling it explicitly here and not using the 
try-with-resource code construct then might be good to just put it in the 
finally block.



--
To view, visit http://gerrit.cloudera.org:8080/17553
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I876ada48991afdff5d61b5d6a0417571aba7cb34
Gerrit-Change-Number: 17553
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Andrew Wong 
Gerrit-Revie

[Impala-ASF-CR] IMPALA-10720: Add versioning to admission heartbeats

2021-05-27 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17524


Change subject: IMPALA-10720: Add versioning to admission heartbeats
..

IMPALA-10720: Add versioning to admission heartbeats

This patch adds version numbers to admission heartbeats which
allows the admission service to ignore stale heartbeats from
coordinators.

Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
---
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-control-service.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M common/protobuf/admission_control_service.proto
5 files changed, 37 insertions(+), 2 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/17524/1
--
To view, visit http://gerrit.cloudera.org:8080/17524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1338211dc0bca67f8fde93a1b05c0780be583d5d
Gerrit-Change-Number: 17524
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-9155: Add recovery mechanism to admission service

2021-04-22 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17332


Change subject: IMPALA-9155: Add recovery mechanism to admission service
..

IMPALA-9155: Add recovery mechanism to admission service

Major changes:
- Leverages the admission heartbeat mechanism to signal the
coordinator to send its complete admission state
- No RPCs are serviced by a coordinator unless it sends its complete
admission state. This is to prevent making admission decisions till
admission service has built its view of the cluster
- The complete admission state consists of the states of all queries
that have successfully been admitted, that is, received a valid
schedule from the admission controller and have marked its admission
as complete (for remote admission it means its pending admit status
has transitioned from true to false)
- This helps prevent sending incomplete/inconsistent state to the
admission controller
- Queries that have not started admission get a chance to send their
request to the new service
- Queries that are queued restart the admission process by sending
the request again. This re-try is now also marked in the query profile
- Other RPCs like ReleaseBackend, ReleaseQuery, CancelQuery that
don't get serviced (till initial admission state is sent) can result
in inconsistent state. This state will be rectified in the admission
heartbeats
- AdmitQuery and GetQueryStatus just retry again if they notice a
network failure(assuming admissiond might be down/restarting) or
received the error message that they cannot be serviced yet
admissiond is waiting on initial state from this coordinator)

Limitations:
- Rebuilding the state can not ensure that queued queries will
maintain their spot in the queue.
- Queries can be admitted before all coordinators get a chance to
send their state. This can result in a brief period of over-admission
We cannot rely completely on the statestore membership update and
wait for all coordinators there to send admission state because
that membership is also dynamic which makes it difficult to decide
when to assume that the admission state is complete.

Testing:
- Added end to end tests

Change-Id: I8ad3ef9b9e2496c484833d6326ce914c851e02fd
---
M be/src/runtime/coordinator-backend-resource-state.cc
M be/src/runtime/coordinator-backend-state.h
M be/src/runtime/coordinator.cc
M be/src/runtime/coordinator.h
M be/src/scheduling/admission-control-client.cc
M be/src/scheduling/admission-control-client.h
M be/src/scheduling/admission-control-service.cc
M be/src/scheduling/admission-control-service.h
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/scheduling/local-admission-control-client.cc
M be/src/scheduling/local-admission-control-client.h
M be/src/scheduling/remote-admission-control-client.cc
M be/src/scheduling/remote-admission-control-client.h
M be/src/scheduling/schedule-state.cc
M be/src/service/client-request-state.cc
M be/src/service/client-request-state.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M common/protobuf/admission_control_service.proto
M common/thrift/generate_error_codes.py
M tests/custom_cluster/test_admission_controller.py
23 files changed, 674 insertions(+), 65 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/32/17332/1
--
To view, visit http://gerrit.cloudera.org:8080/17332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I8ad3ef9b9e2496c484833d6326ce914c851e02fd
Gerrit-Change-Number: 17332
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10596: De-flake TestAdmissionControllerStress

2021-04-05 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17272


Change subject: IMPALA-10596: De-flake TestAdmissionControllerStress
..

IMPALA-10596: De-flake TestAdmissionControllerStress

Currently some tests in TestAdmissionControllerStress fail because
they rely on queries holding onto resources on backends till they
are explicitly closed. This was done by keeping the query alive by
fetching very limited rows in intervals. After results spooling was
turned on by default, the queries started to finish early and
released their resources on other backends which let queued queries
to get admitted. This patch turns off result spooling for the queries
to avoid this situation.

Testing:
Ran test in a loop on my local dev machine.

Change-Id: I1594a82f507db8dc094d4441dd8739d037f974ff
---
M tests/custom_cluster/test_admission_controller.py
1 file changed, 4 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/72/17272/1
--
To view, visit http://gerrit.cloudera.org:8080/17272
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1594a82f507db8dc094d4441dd8739d037f974ff
Gerrit-Change-Number: 17272
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10596: De-flake teardown in TestAdmissionControllerStress

2021-03-31 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17256


Change subject: IMPALA-10596: De-flake teardown in TestAdmissionControllerStress
..

IMPALA-10596: De-flake teardown in TestAdmissionControllerStress

Currently if the threads running queries in
TestAdmissionControllerStress hit an error, they close their client
which ultimately closes the query that it was running. If teardown()
runs after the client is closed, it tries to cancel the query that
the thread was running and hits an exception trying to cancel an
already closed query. This results in the pytest throws the exception
encountered in teardown() instead of the original exception that
caused the test to fail in the first place. This patch fixes this by
removing the query handle from the thread if the client is closed.

Testing:
Simulated hitting an error condition in the main thread that
initially triggered this condition.

Change-Id: I8aa8315d9f598ba80d13cd2091e3cc743c64ba77
---
M tests/custom_cluster/test_admission_controller.py
1 file changed, 7 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/56/17256/1
--
To view, visit http://gerrit.cloudera.org:8080/17256
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I8aa8315d9f598ba80d13cd2091e3cc743c64ba77
Gerrit-Change-Number: 17256
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10397: De-flake test single workload

2021-03-29 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17239


Change subject: IMPALA-10397: De-flake test_single_workload
..

IMPALA-10397: De-flake test_single_workload

This patch removes a flaky part of the test that relies on query
completion rate. Since we are already verifying that number of
healthy executor groups increases, this additional check is not
adding much to the test.

Change-Id: I6f75afdbe676d9dd6922b6ba8aa1919daa161947
---
M tests/custom_cluster/test_auto_scaling.py
1 file changed, 3 insertions(+), 23 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/39/17239/1
--
To view, visit http://gerrit.cloudera.org:8080/17239
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I6f75afdbe676d9dd6922b6ba8aa1919daa161947
Gerrit-Change-Number: 17239
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10397: Fix test single workload

2021-03-24 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17218 )

Change subject: IMPALA-10397: Fix test_single_workload
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/17218/1/tests/custom_cluster/test_auto_scaling.py
File tests/custom_cluster/test_auto_scaling.py:

http://gerrit.cloudera.org:8080/#/c/17218/1/tests/custom_cluster/test_auto_scaling.py@49
PS1, Line 49:   # two comparisons and should take around 2 second to complete.
> I think this will now be 2 seconds?
Done



--
To view, visit http://gerrit.cloudera.org:8080/17218
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ide3c7fb4509ce9a797b4cbdd141b2a319b923d4e
Gerrit-Change-Number: 17218
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Thu, 25 Mar 2021 00:15:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10397: Fix test single workload

2021-03-24 Thread Bikramjeet Vig (Code Review)
Hello Andrew Sherman, Abhishek Rawat, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/17218

to look at the new patch set (#2).

Change subject: IMPALA-10397: Fix test_single_workload
..

IMPALA-10397: Fix test_single_workload

The logs on failed runs indicated that the autoscaler never started
another cluster. This can only happen if it never notices a queued
query which is possible since this test was only failing in release
builds. This patch increases the runtime of the sample query to
make execution more predictable.

Testing:
Looped on my local on a release build

Change-Id: Ide3c7fb4509ce9a797b4cbdd141b2a319b923d4e
---
M tests/custom_cluster/test_auto_scaling.py
1 file changed, 6 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/18/17218/2
--
To view, visit http://gerrit.cloudera.org:8080/17218
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ide3c7fb4509ce9a797b4cbdd141b2a319b923d4e
Gerrit-Change-Number: 17218
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-10397: Fix test single workload

2021-03-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/17218


Change subject: IMPALA-10397: Fix test_single_workload
..

IMPALA-10397: Fix test_single_workload

The logs on failed runs indicated that the autoscaler never started
another cluster. This can only happen if it never notices a queued
query which is possible since this test was only failing in release
builds. This patch increases the runtime of the sample query to
make execution more predictable.

Testing:
Looped on my local on a release build

Change-Id: Ide3c7fb4509ce9a797b4cbdd141b2a319b923d4e
---
M tests/custom_cluster/test_auto_scaling.py
1 file changed, 5 insertions(+), 6 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/18/17218/1
--
To view, visit http://gerrit.cloudera.org:8080/17218
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ide3c7fb4509ce9a797b4cbdd141b2a319b923d4e
Gerrit-Change-Number: 17218
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-10594: Handle failed coordinators in admissiond

2021-03-22 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17209 )

Change subject: IMPALA-10594: Handle failed coordinators in admissiond
..


Patch Set 2: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/17209/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17209/2//COMMIT_MSG@13
PS2, Line 13:
> There won't be any overadmission because the executors also monitor the sta
Got it. Thanks for the explanation.



--
To view, visit http://gerrit.cloudera.org:8080/17209
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I883f323bb765680ef24b3c3f51fb209dea15f0b0
Gerrit-Change-Number: 17209
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Mon, 22 Mar 2021 20:08:54 +
Gerrit-HasComments: Yes


  1   2   3   4   5   6   7   8   9   10   >