[Impala-ASF-CR] IMPALA-5552: Add support for authorized proxy groups

2018-06-05 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10510 )

Change subject: IMPALA-5552: Add support for authorized proxy groups
..


Patch Set 8:

(13 comments)

Just came across this CR. I have some comments. Will look at tests in the next 
round.

http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.h
File be/src/service/impala-server.h:

http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.h@819
PS8, Line 819:   /// Adds authorized proxy config into the given map.
Maybe a little more verbose? The context is not clear by just reading this 
comment

/// Utility to parse --foo and --bar and populate .


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.h@820
PS8, Line 820: AddAuthorizedProxyConfig
nit: How about we call it PopulateAuthorizedProxyConfig()?


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@320
PS8, Line 320: [](const string& config) {
 :   return Substitute("Invalid proxy user configuration. 
No mapping value "
 :   "specified for the proxy user. For more 
information review usage of the "
 :   "--authorized_proxy_user_config flag: $0", config);
 : }
I think using 'Status' as the return of AddAuthorizedProxyConfig() is an Impala 
way of doing this (caller checks for !status.ok()). I don't see why we should 
unnecessarily complicate this function signature with a lambda. thoughts?


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@401
PS8, Line 401:  const string& authorized_proxy_config,
 : const string& authorized_proxy_config_delimiter
Input params precede output vals [1]. Also we generally use output params as 
pointers.

[1] https://google.github.io/styleguide/cppguide.html#Output_Parameters


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@425
PS8, Line 425:   parsed_allowed_users_or_groups.begin(), 
parsed_allowed_users_or_groups.end());
nit: should we check for duplicate values instead of overwriting? ex: 
foo=bar;foo=car


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1386
PS8, Line 1386:   TGetHadoopGroupsRequest req;
Could you add a comment here that breiefly summarizes what we are doing


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1389
PS8, Line 1389: GetHadoopGroups
I have seen issues around "GroupMapping" (especially with LDAP etc) taking 
longer than expected in the HDFS land. I'd assume we can run into similar 
issues, especially if we are calling this for every OpenSession(). Any thoughts 
on what we can do to help debug such issues without having to take thread dumps 
and then figuring it out.


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1391
PS8, Line 1391: There was an
nit: remove


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1397
PS8, Line 1397:   AuthorizedProxyMap::const_iterator proxy_group =
  : authorized_proxy_group_config_.find(short_user);
  : if (proxy_group != authorized_proxy_group_config_.end()) {
Move it outside of the loop? Also can we bypass the GetHadoopGroups() if this 
fails in the first place?


http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1400
PS8, Line 1400: for (const string& group : proxy_group->second) {
  : if (group == "*" || group == do_as_group) return 
Status::OK();
  :   }
Why are we looping here? Isn't this a set? Could we find() in O(1) ?


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java
File fe/src/main/java/org/apache/impala/service/JniFrontend.java:

http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@585
PS8, Line 585: GROUPS
Is this thread safe?


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@641
PS8, Line 641: throw new InternalException(e.getMessage());
Can you LOG the full exception here. Helps debug issues with group mapping not 
working as expected.


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@672
PS8, Line 672:  if (provider instanceof ShellBasedUnixGroupsMapping ||
 : provider instanceof ShellBasedUnixGroupsNetgroupMapping) 
{
Please add comments as to why these are problematic.



--
To view, visit http://gerrit.cloudera.org:8080/10510
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: 

[Impala-ASF-CR] IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, Partition

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10611 )

Change subject: IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, 
Partition
..


Patch Set 1: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/2604/


--
To view, visit http://gerrit.cloudera.org:8080/10611
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id55f7d2e94d81e66ce720acb6315f15a89621b31
Gerrit-Change-Number: 10611
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 04:54:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Sailesh Mukil (Code Review)
Sailesh Mukil has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10560 )

Change subject: Fix Indents from IMPALA-4970
..


Patch Set 3: Verified+1

> It's a whitespace-only change and it compiled.

Thanks for taking care of this, Tim.


--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 3
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 03:48:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7089: reenable test kudu dml reporting

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10577 )

Change subject: IMPALA-7089: reenable test_kudu_dml_reporting
..


Patch Set 2: Code-Review+2

Let's get this test coverage back now that we've fixed the root cause.


--
To view, visit http://gerrit.cloudera.org:8080/10577
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5716f30458eb6db08f735bcbd2f79d205334930
Gerrit-Change-Number: 10577
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 03:45:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7089: reenable test kudu dml reporting

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10577 )

Change subject: IMPALA-7089: reenable test_kudu_dml_reporting
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2606/


--
To view, visit http://gerrit.cloudera.org:8080/10577
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib5716f30458eb6db08f735bcbd2f79d205334930
Gerrit-Change-Number: 10577
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 03:45:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 11:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2605/


--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 11
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 03:42:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix errors from error-prone
..

IMPALA-7110. Fix errors from error-prone

* Fixes several cases of Preconditions.checkNotNull() which were
  ineffective. Instead, we should use Preconditions.checkState.

* Fixed a case of synchronizing on a non-final class member, which is a
  bad practice.

* Fixed a case of trying to stringify an array instead of using Joiner
  to join it to a human-readable representation.

* Fixed a couple cases where the format string in String.format didn't
  match the number of arguments passed.

* Fixed a test where PrimitiveType objects were being used to look up
  elements in a Map. This would always return null,
  so the test was ineffective.

I elected not to enable error-prone in the default maven profile since
it increases compilation time noticeably. It can be enabled by running
'mvn -Perrorprone'.

Even with this patch there are a bunch of warnings -- this only
addresses the issues that errorprone considers "errors".

Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Reviewed-on: http://gerrit.cloudera.org:8080/10583
Reviewed-by: Bharath Vissapragada 
Reviewed-by: Philip Zeyliger 
Tested-by: Impala Public Jenkins 
---
M fe/pom.xml
M fe/src/main/java/org/apache/impala/analysis/CreateUdfStmt.java
M fe/src/main/java/org/apache/impala/analysis/LiteralExpr.java
M fe/src/main/java/org/apache/impala/analysis/Path.java
M fe/src/main/java/org/apache/impala/catalog/DiskIdMapper.java
M fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java
M fe/src/main/java/org/apache/impala/catalog/StructType.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java
11 files changed, 51 insertions(+), 14 deletions(-)

Approvals:
  Bharath Vissapragada: Looks good to me, but someone else must approve
  Philip Zeyliger: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 3
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Todd Lipcon 


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10560 )

Change subject: Fix Indents from IMPALA-4970
..

Fix Indents from IMPALA-4970

fixing the mistake in indentation made previously

Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Reviewed-on: http://gerrit.cloudera.org:8080/10560
Reviewed-by: Sailesh Mukil 
Tested-by: Tim Armstrong 
---
M be/src/runtime/coordinator.cc
1 file changed, 4 insertions(+), 4 deletions(-)

Approvals:
  Sailesh Mukil: Looks good to me, approved
  Tim Armstrong: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 3
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has removed a vote on this change.

Change subject: Fix Indents from IMPALA-4970
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 2
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10602 )

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..

IMPALA-6956: deflake and logging for query_expiration test

There was a recent flake where the number of in-flight queries
that were executing differed from the expected number.
The reason for the false negative is that one of the queries
expired before the check for in-flight queries: it took too long
to issue the queries.

This change modifies the timeout for the expired query to not expire
so quickly. Additional logging is added to the check for in-flight
queries so that we can distinguish the case of too few queries vs.
queries that have the wrong state.

Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Reviewed-on: http://gerrit.cloudera.org:8080/10602
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M tests/custom_cluster/test_query_expiration.py
1 file changed, 9 insertions(+), 6 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 3
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10602 )

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 2
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 03:19:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7105. Ensure fe tests pass when running standalone

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10570 )

Change subject: IMPALA-7105. Ensure fe tests pass when running standalone
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10570
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I85286ba44f03526b487212a8ddadf71a9f427555
Gerrit-Change-Number: 10570
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 01:58:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7105. Ensure fe tests pass when running standalone

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10570 )

Change subject: IMPALA-7105. Ensure fe tests pass when running standalone
..

IMPALA-7105. Ensure fe tests pass when running standalone

CatalogTest and HdfsStorageDescriptorTest would fail when run
individually because they relied on various initialization that only
occurs when FeSupport is loaded. These tests weren't explicitly loading
FeSupport, but instead relying on earlier tests in 'mvn test' to do so.

Tested that, with this change, I'm able to run these tests in Eclipse
and they pass. I also tested 'mvn test -DreuseForks=false' and verified
that all tests now pass in this mode.

Change-Id: I85286ba44f03526b487212a8ddadf71a9f427555
Reviewed-on: http://gerrit.cloudera.org:8080/10570
Reviewed-by: Vuk Ercegovac 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/java/org/apache/impala/service/FeSupport.java
M fe/src/test/java/org/apache/impala/catalog/CatalogTest.java
M fe/src/test/java/org/apache/impala/catalog/HdfsStorageDescriptorTest.java
M fe/src/test/java/org/apache/impala/testutil/CatalogServiceTestCatalog.java
4 files changed, 14 insertions(+), 3 deletions(-)

Approvals:
  Vuk Ercegovac: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10570
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I85286ba44f03526b487212a8ddadf71a9f427555
Gerrit-Change-Number: 10570
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix errors from error-prone
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 06 Jun 2018 01:46:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-2782: part 2 - Allow impala-shell to connect directly to impalad when configured with load balancer and kerberos.

2018-06-05 Thread Vincent Tran (Code Review)
Hello a...@phdata.io, Philip Zeyliger,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10580

to look at the new patch set (#2).

Change subject: IMPALA-2782: part 2 - Allow impala-shell to connect directly to 
impalad when configured with load balancer and kerberos.
..

IMPALA-2782: part 2 - Allow impala-shell to connect directly to
impalad when configured with load balancer and kerberos.

After some additional testing around IMPALA-2782, it was
discovered that the impala-shell starts the session displaying
the expected hostname (as passed -i flag) on the prompt. This gives
the impression that the load balancer was bypassed, however the
actual TSSLSocket was still created with the hostname passed
into the -b or --kerberos_host_fqdn flag.

This change ensures that the hostname used to create the
TSSLSocket will always be the one passed in via the -i flag
on impala-shell, regardless of whether the -b flag is set.

Change-Id: Ibee05bd0dbe8c6ae108b890f0ae0f6900149773a
---
M shell/impala_client.py
1 file changed, 7 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/80/10580/2
--
To view, visit http://gerrit.cloudera.org:8080/10580
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibee05bd0dbe8c6ae108b890f0ae0f6900149773a
Gerrit-Change-Number: 10580
Gerrit-PatchSet: 2
Gerrit-Owner: Vincent Tran 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: a...@phdata.io


[Impala-ASF-CR] IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, Partition

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10611 )

Change subject: IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, 
Partition
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2604/


--
To view, visit http://gerrit.cloudera.org:8080/10611
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id55f7d2e94d81e66ce720acb6315f15a89621b31
Gerrit-Change-Number: 10611
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 01:26:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, Partition

2018-06-05 Thread Todd Lipcon (Code Review)
Todd Lipcon has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10611


Change subject: IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, 
Partition
..

IMPALA-7128 (part 1) Refactor interfaces for Db, View, Table, Partition

This refactors out interfaces in the frontend for the interaction
between the analysis/planning code and the classes that implement
these catalog objects.

This takes care of the most commonly used objects but defers some others
(e.g. functions, cache pools, data sources, etc) to follow-on patches.

There are a few spots remaining in the frontend that still downcast to
implementation classes, particularly where the frontend actually makes
modifications to catalog objects in-place. I left TODOs in those spots
and will come back later as necessary.

Change-Id: Id55f7d2e94d81e66ce720acb6315f15a89621b31
---
M 
fe/src/main/java/org/apache/impala/analysis/AlterTableAddDropRangePartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableAddPartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableAddReplaceColsStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableAlterColStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableDropColStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableDropPartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableOrViewRenameStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetCachedStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetFileFormatStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetLocationStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetRowFormatStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSortByStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterViewStmt.java
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/analysis/BaseTableRef.java
M fe/src/main/java/org/apache/impala/analysis/ColumnLineageGraph.java
M fe/src/main/java/org/apache/impala/analysis/ComputeStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateDbStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateFunctionStmtBase.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeStmt.java
M fe/src/main/java/org/apache/impala/analysis/DescribeTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/DescriptorTable.java
M fe/src/main/java/org/apache/impala/analysis/DropDbStmt.java
M fe/src/main/java/org/apache/impala/analysis/DropFunctionStmt.java
M fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/main/java/org/apache/impala/analysis/InlineViewRef.java
M fe/src/main/java/org/apache/impala/analysis/InsertStmt.java
M fe/src/main/java/org/apache/impala/analysis/IsNullPredicate.java
M fe/src/main/java/org/apache/impala/analysis/LoadDataStmt.java
M fe/src/main/java/org/apache/impala/analysis/ModifyStmt.java
M fe/src/main/java/org/apache/impala/analysis/PartitionDef.java
M fe/src/main/java/org/apache/impala/analysis/PartitionSet.java
M fe/src/main/java/org/apache/impala/analysis/PartitionSpecBase.java
M fe/src/main/java/org/apache/impala/analysis/Path.java
M fe/src/main/java/org/apache/impala/analysis/PrivilegeSpec.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/ShowCreateFunctionStmt.java
M fe/src/main/java/org/apache/impala/analysis/ShowCreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/ShowFilesStmt.java
M fe/src/main/java/org/apache/impala/analysis/ShowStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/SlotRef.java
M fe/src/main/java/org/apache/impala/analysis/StmtMetadataLoader.java
M fe/src/main/java/org/apache/impala/analysis/TableDef.java
M fe/src/main/java/org/apache/impala/analysis/TableRef.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/analysis/TruncateStmt.java
M fe/src/main/java/org/apache/impala/analysis/TupleDescriptor.java
M fe/src/main/java/org/apache/impala/analysis/WithClause.java
M fe/src/main/java/org/apache/impala/catalog/Catalog.java
M fe/src/main/java/org/apache/impala/catalog/Db.java
A fe/src/main/java/org/apache/impala/catalog/FeCatalog.java
A fe/src/main/java/org/apache/impala/catalog/FeDb.java
A fe/src/main/java/org/apache/impala/catalog/FeHdfsPartition.java
A fe/src/main/java/org/apache/impala/catalog/FeHdfsTable.java
A 

[Impala-ASF-CR] IMPALA-7016: Implement ALTER DATABASE SET OWNER

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/10471 )

Change subject: IMPALA-7016: Implement ALTER DATABASE SET OWNER
..

IMPALA-7016: Implement ALTER DATABASE SET OWNER

Alters the database owner to either user or role.

Syntax:
ALTER DATABASE db SET OWNER USER user
ALTER DATABASE db SET OWNER ROLE role

Testing:
- Added new front-end tests
- Added new end-to-end tests

Change-Id: Ie3b923021ebce5192d2d64784e7ddb952ba82bc3
---
M common/thrift/CatalogService.thrift
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
A fe/src/main/java/org/apache/impala/analysis/AlterDbSetOwnerStmt.java
A fe/src/main/java/org/apache/impala/analysis/AlterDbStmt.java
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
A fe/src/main/java/org/apache/impala/analysis/Owner.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M fe/src/test/java/org/apache/impala/analysis/AuthorizationTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M tests/metadata/test_ddl.py
M tests/metadata/test_ddl_base.py
14 files changed, 328 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/71/10471/5
--
To view, visit http://gerrit.cloudera.org:8080/10471
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie3b923021ebce5192d2d64784e7ddb952ba82bc3
Gerrit-Change-Number: 10471
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Adam Holley 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-7016: Implement ALTER DATABASE SET OWNER

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10471 )

Change subject: IMPALA-7016: Implement ALTER DATABASE SET OWNER
..


Patch Set 5:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/10471/3/common/thrift/JniCatalog.thrift
File common/thrift/JniCatalog.thrift:

http://gerrit.cloudera.org:8080/#/c/10471/3/common/thrift/JniCatalog.thrift@55
PS3, Line 55:   COMMENT_ON,
> need comma
Done


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/cup/sql-parser.cup
File fe/src/main/cup/sql-parser.cup:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/cup/sql-parser.cup@1030
PS3, Line 1030: comment_on_stmt ::=
> Can you add a short comment as to why USER cannot be a keyword?
Done


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/java/org/apache/impala/analysis/OwnerType.java
File fe/src/main/java/org/apache/impala/analysis/OwnerType.java:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/java/org/apache/impala/analysis/OwnerType.java@23
PS3, Line 23:
> nit: Can the name be changed since it's not just type but includes the name
I'll change it to Owner.


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/test/java/org/apache/impala/analysis/ParserTest.java
File fe/src/test/java/org/apache/impala/analysis/ParserTest.java:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/test/java/org/apache/impala/analysis/ParserTest.java@3757
PS3, Line 3757: ParserError("ALTER DATABASE SET OWNER foo");
> Add test for no parms after OWNER.
Done



-- 
To view, visit http://gerrit.cloudera.org:8080/10471
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie3b923021ebce5192d2d64784e7ddb952ba82bc3
Gerrit-Change-Number: 10471
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Adam Holley 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 01:02:34 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6812: Fix flaky Kudu scan tests

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10503 )

Change subject: IMPALA-6812: Fix flaky Kudu scan tests
..


Patch Set 1: -Code-Review

We have the kudu_latest_observed_timestamp logic in Impala that stores the 
timestamp in the user session after a DML operation. I believe in all the cases 
we've seen the queries should be part of the same user session (otherwise there 
are other consistency issues that we'd see).


--
To view, visit http://gerrit.cloudera.org:8080/10503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I70df84f2cbc663107f2ad029565d3c15bdfbd47c
Gerrit-Change-Number: 10503
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: David Ribeiro Alves 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:59:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10560 )

Change subject: Fix Indents from IMPALA-4970
..


Patch Set 2:

Pretty sure this was an OOM due to 
https://issues.apache.org/jira/browse/IMPALA-7008 (the patchset wasn't rebased 
onto a commit with the fix)


--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 2
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:43:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 11: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 11
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:33:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7078: Part 2: reduce queue size based on num scanner threads

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10573

to look at the new patch set (#4).

Change subject: IMPALA-7078: Part 2: reduce queue size based on 
num_scanner_threads
..

IMPALA-7078: Part 2: reduce queue size based on num_scanner_threads

This gives a tighter bound on memory consumption when running with
a lower num_scanner_threads value. With IMPALA-7096 we'll revisit
the approach to reliably avoid OOM.

Cap the maximum row batch queue size at 5 * max_num_scanner_threads_
so that we guarantee lower amounts of memory in the row batch queue
when num_scanner_threads is set, rather than just achieving it
statistically because of the producer running slower relative to
consumer. It does not reduce the default significantly on typical
server configurations that would have 24+ cores except under high
concurrency or low memory environments where the number of scanner
threads is limited. We should evaluate reducing the default further
or otherwise better controlling memory consumption in a follow-up,
based on experiments.

Testing:
Tested along with Part 1.

Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
---
M be/src/exec/hdfs-scan-node.cc
1 file changed, 19 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/73/10573/4
--
To view, visit http://gerrit.cloudera.org:8080/10573
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
Gerrit-Change-Number: 10573
Gerrit-PatchSet: 4
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10550

to look at the new patch set (#11).

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..

IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

Revert to the pre-IMPALA-3905 algorithm for deciding when to return a
batch from an Avro scan. The post-IMPALA-3905 algorithm is bad for
wide tables where there are only a small number of rows per Avro block.

Optimise memory transfer for selective scans - don't attach unused
decompression buffers to the output batch. Combined with the previous
change, this dramatically reduces the amount of memory transferred out
of scanner threads for selective scans of wide tables.

Includes some observability improvements including additional
counters that will help diagnose issues like this more easily:
* Add counters to give some insight into row batch queue. Here's
  an excerpt:
   - RowBatchBytesEnqueued: 20.89 MB (21903380)
   - RowBatchQueueCapacity: 5 (5)
   - RowBatchQueueGetWaitTime: 59.187ms
   - RowBatchQueuePeakMemoryUsage: 8.85 MB (9279347)
   - RowBatchQueuePutWaitTime: 0.000ns
   - RowBatchesEnqueued: 6 (6)
* Don't create AverageScannerThreadConcurrency for MT scan node where
  it's not actually used.
* Track the row batch queue memory consumption against a sub-tracker
  HDFS_SCAN_NODE (id=2): Reservation=48.00 MB OtherMemory=588.00 KB Total=48.57 
MB Peak=48.62 MB
Queued Batches: Total=588.00 KB Peak=637.00 KB

Ran the repro in the JIRA. Memory consumption was reduced from ~500MB
to ~220MB on my system.

Testing:
* Ran stress test for an hour on uncompressed and 3 hours on snappy-compressed 
avro.
* Debug exhaustive tests passed.
* ASAN core tests passed.

Perf:
- Parquet TPC-H scale factor 60 on one impalad showed no change in perf
- Avro/Snappy scale factor 20 showed a small improvement:
+--+-+-++++
| Workload | File Format | Avg (s) | Delta(Avg) | GeoMean(s) | 
Delta(GeoMean) |
+--+-+-++++
| TPCH(20) | avro / snap / block | 9.86| -2.23% | 7.83   | -2.37%   
  |
+--+-+-++++

+--+--+--++-++---++-+---+
| Workload | Query| File Format  | Avg(s) | Base Avg(s) | 
Delta(Avg) | StdDev(%) | Base StdDev(%) | Num Clients | Iters |
+--+--+--++-++---++-+---+
| TPCH(20) | TPCH-Q6  | avro / block / block | 5.59   | 5.17|   +8.10%  
 |   0.75%   |   0.71%| 1   | 30|
| TPCH(20) | TPCH-Q14 | avro / block / block | 6.31   | 5.89|   +7.21%  
 |   0.68%   |   0.74%| 1   | 30|
| TPCH(20) | TPCH-Q15 | avro / block / block | 11.32  | 10.64   |   +6.41%  
 |   0.37%   |   0.46%| 1   | 30|
| TPCH(20) | TPCH-Q12 | avro / block / block | 8.57   | 8.14|   +5.23%  
 |   0.67%   |   0.81%| 1   | 30|
| TPCH(20) | TPCH-Q13 | avro / block / block | 6.72   | 6.54|   +2.72%  
 |   0.77%   |   0.73%| 1   | 30|
| TPCH(20) | TPCH-Q4  | avro / block / block | 11.76  | 11.61   |   +1.32%  
 |   0.60%   |   0.61%| 1   | 30|
| TPCH(20) | TPCH-Q7  | avro / block / block | 14.43  | 14.26   |   +1.21%  
 |   1.14%   |   0.35%| 1   | 30|
| TPCH(20) | TPCH-Q21 | avro / block / block | 34.12  | 34.25   |   -0.36%  
 |   0.27%   |   0.24%| 1   | 30|
| TPCH(20) | TPCH-Q20 | avro / block / block | 8.49   | 8.52|   -0.38%  
 |   0.45%   |   0.54%| 1   | 30|
| TPCH(20) | TPCH-Q1  | avro / block / block | 6.99   | 7.02|   -0.38%  
 |   0.96%   |   0.65%| 1   | 30|
| TPCH(20) | TPCH-Q22 | avro / block / block | 2.44   | 2.47|   -1.09%  
 |   1.81%   |   1.47%| 1   | 30|
| TPCH(20) | TPCH-Q11 | avro / block / block | 1.99   | 2.02|   -1.57%  
 |   1.95%   |   1.90%| 1   | 30|
| TPCH(20) | TPCH-Q17 | avro / block / block | 13.57  | 13.79   |   -1.63%  
 |   1.53%   |   1.31%| 1   | 30|
| TPCH(20) | TPCH-Q18 | avro / block / block | 21.93  | 22.31   |   -1.72%  
 |   0.31%   |   0.34%| 1   | 30|
| TPCH(20) | TPCH-Q8  | avro / block / block | 9.05   | 9.31|   -2.81%  
 |   0.85%   |   0.72%| 1   | 30|
| TPCH(20) | TPCH-Q19 | avro / block / block | 7.20   | 7.41|   -2.91%  
 |   0.72%   |   0.52%| 1   | 30   

[Impala-ASF-CR] IMPALA-7078: Part 2: reduce queue size based on num scanner threads

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10573

to look at the new patch set (#3).

Change subject: IMPALA-7078: Part 2: reduce queue size based on 
num_scanner_threads
..

IMPALA-7078: Part 2: reduce queue size based on num_scanner_threads

This gives a tighter bound on memory consumption when running with
a lower num_scanner_threads value. With IMPALA-7096 we'll revisit
the approach to reliably avoid OOM.

Cap the maximum row batch queue size at 5 * max_num_scanner_threads_
so that we guarantee lower amounts of memory in the row batch queue
when num_scanner_threads is set, rather than just achieving it
statistically because of the producer running slower relative to
consumer. It does not reduce the default significantly on typical
server configurations that would have 24+ cores except under high
concurrency or low memory environments where the number of scanner
threads is limited. We should evaluate reducing the default further
or otherwise better controlling memory consumption in a follow-up,
based on experiments.

Testing:
Tested along with Part 1.

Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
---
M be/src/exec/hdfs-scan-node.cc
1 file changed, 19 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/73/10573/3
--
To view, visit http://gerrit.cloudera.org:8080/10573
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
Gerrit-Change-Number: 10573
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 9:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10550/9/be/src/exec/hdfs-scan-node.cc
File be/src/exec/hdfs-scan-node.cc:

http://gerrit.cloudera.org:8080/#/c/10550/9/be/src/exec/hdfs-scan-node.cc@a171
PS9, Line 171:
Oops!



--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 9
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:31:48 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10550

to look at the new patch set (#10).

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..

IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

Revert to the pre-IMPALA-3905 algorithm for deciding when to return a
batch from an Avro scan. The post-IMPALA-3905 algorithm is bad for
wide tables where there are only a small number of rows per Avro block.

Optimise memory transfer for selective scans - don't attach unused
decompression buffers to the output batch. Combined with the previous
change, this dramatically reduces the amount of memory transferred out
of scanner threads for selective scans of wide tables.

Includes some observability improvements including additional
counters that will help diagnose issues like this more easily:
* Add counters to give some insight into row batch queue. Here's
  an excerpt:
   - RowBatchBytesEnqueued: 20.89 MB (21903380)
   - RowBatchQueueCapacity: 5 (5)
   - RowBatchQueueGetWaitTime: 59.187ms
   - RowBatchQueuePeakMemoryUsage: 8.85 MB (9279347)
   - RowBatchQueuePutWaitTime: 0.000ns
   - RowBatchesEnqueued: 6 (6)
* Don't create AverageScannerThreadConcurrency for MT scan node where
  it's not actually used.
* Track the row batch queue memory consumption against a sub-tracker
  HDFS_SCAN_NODE (id=2): Reservation=48.00 MB OtherMemory=588.00 KB Total=48.57 
MB Peak=48.62 MB
Queued Batches: Total=588.00 KB Peak=637.00 KB

Ran the repro in the JIRA. Memory consumption was reduced from ~500MB
to ~220MB on my system.

Testing:
* Ran stress test for an hour on uncompressed and 3 hours on snappy-compressed 
avro.
* Debug exhaustive tests passed.
* ASAN core tests passed.

Perf:
- Parquet TPC-H scale factor 60 on one impalad showed no change in perf
- Avro/Snappy scale factor 20 showed a small improvement:
+--+-+-++++
| Workload | File Format | Avg (s) | Delta(Avg) | GeoMean(s) | 
Delta(GeoMean) |
+--+-+-++++
| TPCH(20) | avro / snap / block | 9.86| -2.23% | 7.83   | -2.37%   
  |
+--+-+-++++

+--+--+--++-++---++-+---+
| Workload | Query| File Format  | Avg(s) | Base Avg(s) | 
Delta(Avg) | StdDev(%) | Base StdDev(%) | Num Clients | Iters |
+--+--+--++-++---++-+---+
| TPCH(20) | TPCH-Q6  | avro / block / block | 5.59   | 5.17|   +8.10%  
 |   0.75%   |   0.71%| 1   | 30|
| TPCH(20) | TPCH-Q14 | avro / block / block | 6.31   | 5.89|   +7.21%  
 |   0.68%   |   0.74%| 1   | 30|
| TPCH(20) | TPCH-Q15 | avro / block / block | 11.32  | 10.64   |   +6.41%  
 |   0.37%   |   0.46%| 1   | 30|
| TPCH(20) | TPCH-Q12 | avro / block / block | 8.57   | 8.14|   +5.23%  
 |   0.67%   |   0.81%| 1   | 30|
| TPCH(20) | TPCH-Q13 | avro / block / block | 6.72   | 6.54|   +2.72%  
 |   0.77%   |   0.73%| 1   | 30|
| TPCH(20) | TPCH-Q4  | avro / block / block | 11.76  | 11.61   |   +1.32%  
 |   0.60%   |   0.61%| 1   | 30|
| TPCH(20) | TPCH-Q7  | avro / block / block | 14.43  | 14.26   |   +1.21%  
 |   1.14%   |   0.35%| 1   | 30|
| TPCH(20) | TPCH-Q21 | avro / block / block | 34.12  | 34.25   |   -0.36%  
 |   0.27%   |   0.24%| 1   | 30|
| TPCH(20) | TPCH-Q20 | avro / block / block | 8.49   | 8.52|   -0.38%  
 |   0.45%   |   0.54%| 1   | 30|
| TPCH(20) | TPCH-Q1  | avro / block / block | 6.99   | 7.02|   -0.38%  
 |   0.96%   |   0.65%| 1   | 30|
| TPCH(20) | TPCH-Q22 | avro / block / block | 2.44   | 2.47|   -1.09%  
 |   1.81%   |   1.47%| 1   | 30|
| TPCH(20) | TPCH-Q11 | avro / block / block | 1.99   | 2.02|   -1.57%  
 |   1.95%   |   1.90%| 1   | 30|
| TPCH(20) | TPCH-Q17 | avro / block / block | 13.57  | 13.79   |   -1.63%  
 |   1.53%   |   1.31%| 1   | 30|
| TPCH(20) | TPCH-Q18 | avro / block / block | 21.93  | 22.31   |   -1.72%  
 |   0.31%   |   0.34%| 1   | 30|
| TPCH(20) | TPCH-Q8  | avro / block / block | 9.05   | 9.31|   -2.81%  
 |   0.85%   |   0.72%| 1   | 30|
| TPCH(20) | TPCH-Q19 | avro / block / block | 7.20   | 7.41|   -2.91%  
 |   0.72%   |   0.52%| 1   | 30   

[Impala-ASF-CR] IMPALA-7078: Part 2: reduce queue size based on num scanner threads

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10573 )

Change subject: IMPALA-7078: Part 2: reduce queue size based on 
num_scanner_threads
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10573/2/be/src/exec/hdfs-scan-node.cc
File be/src/exec/hdfs-scan-node.cc:

http://gerrit.cloudera.org:8080/#/c/10573/2/be/src/exec/hdfs-scan-node.cc@204
PS2, Line 204: f (file_descs_.empty() || progress_.done()) return Status::OK();
> is this somehow related to this change? why do this?
This was an error in separating out this from the previous patchset. The 
previous patchset erroneously deleted this line.



--
To view, visit http://gerrit.cloudera.org:8080/10573
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
Gerrit-Change-Number: 10573
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:31:25 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 9: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 9
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:26:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10550

to look at the new patch set (#8).

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..

IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

Revert to the pre-IMPALA-3905 algorithm for deciding when to return a
batch from an Avro scan. The post-IMPALA-3905 algorithm is bad for
wide tables where there are only a small number of rows per Avro block.

Optimise memory transfer for selective scans - don't attach unused
decompression buffers to the output batch. Combined with the previous
change, this dramatically reduces the amount of memory transferred out
of scanner threads for selective scans of wide tables.

Includes some observability improvements including additional
counters that will help diagnose issues like this more easily:
* Add counters to give some insight into row batch queue. Here's
  an excerpt:
   - RowBatchBytesEnqueued: 20.89 MB (21903380)
   - RowBatchQueueCapacity: 5 (5)
   - RowBatchQueueGetWaitTime: 59.187ms
   - RowBatchQueuePeakMemoryUsage: 8.85 MB (9279347)
   - RowBatchQueuePutWaitTime: 0.000ns
   - RowBatchesEnqueued: 6 (6)
* Don't create AverageScannerThreadConcurrency for MT scan node where
  it's not actually used.
* Track the row batch queue memory consumption against a sub-tracker
  HDFS_SCAN_NODE (id=2): Reservation=48.00 MB OtherMemory=588.00 KB Total=48.57 
MB Peak=48.62 MB
Queued Batches: Total=588.00 KB Peak=637.00 KB

Ran the repro in the JIRA. Memory consumption was reduced from ~500MB
to ~220MB on my system.

Testing:
* Ran stress test for an hour on uncompressed and 3 hours on snappy-compressed 
avro.
* Debug exhaustive tests passed.
* ASAN core tests passed.

Perf:
- Parquet TPC-H scale factor 60 on one impalad showed no change in perf
- Avro/Snappy scale factor 20 showed a small improvement:
+--+-+-++++
| Workload | File Format | Avg (s) | Delta(Avg) | GeoMean(s) | 
Delta(GeoMean) |
+--+-+-++++
| TPCH(20) | avro / snap / block | 9.86| -2.23% | 7.83   | -2.37%   
  |
+--+-+-++++

+--+--+--++-++---++-+---+
| Workload | Query| File Format  | Avg(s) | Base Avg(s) | 
Delta(Avg) | StdDev(%) | Base StdDev(%) | Num Clients | Iters |
+--+--+--++-++---++-+---+
| TPCH(20) | TPCH-Q6  | avro / block / block | 5.59   | 5.17|   +8.10%  
 |   0.75%   |   0.71%| 1   | 30|
| TPCH(20) | TPCH-Q14 | avro / block / block | 6.31   | 5.89|   +7.21%  
 |   0.68%   |   0.74%| 1   | 30|
| TPCH(20) | TPCH-Q15 | avro / block / block | 11.32  | 10.64   |   +6.41%  
 |   0.37%   |   0.46%| 1   | 30|
| TPCH(20) | TPCH-Q12 | avro / block / block | 8.57   | 8.14|   +5.23%  
 |   0.67%   |   0.81%| 1   | 30|
| TPCH(20) | TPCH-Q13 | avro / block / block | 6.72   | 6.54|   +2.72%  
 |   0.77%   |   0.73%| 1   | 30|
| TPCH(20) | TPCH-Q4  | avro / block / block | 11.76  | 11.61   |   +1.32%  
 |   0.60%   |   0.61%| 1   | 30|
| TPCH(20) | TPCH-Q7  | avro / block / block | 14.43  | 14.26   |   +1.21%  
 |   1.14%   |   0.35%| 1   | 30|
| TPCH(20) | TPCH-Q21 | avro / block / block | 34.12  | 34.25   |   -0.36%  
 |   0.27%   |   0.24%| 1   | 30|
| TPCH(20) | TPCH-Q20 | avro / block / block | 8.49   | 8.52|   -0.38%  
 |   0.45%   |   0.54%| 1   | 30|
| TPCH(20) | TPCH-Q1  | avro / block / block | 6.99   | 7.02|   -0.38%  
 |   0.96%   |   0.65%| 1   | 30|
| TPCH(20) | TPCH-Q22 | avro / block / block | 2.44   | 2.47|   -1.09%  
 |   1.81%   |   1.47%| 1   | 30|
| TPCH(20) | TPCH-Q11 | avro / block / block | 1.99   | 2.02|   -1.57%  
 |   1.95%   |   1.90%| 1   | 30|
| TPCH(20) | TPCH-Q17 | avro / block / block | 13.57  | 13.79   |   -1.63%  
 |   1.53%   |   1.31%| 1   | 30|
| TPCH(20) | TPCH-Q18 | avro / block / block | 21.93  | 22.31   |   -1.72%  
 |   0.31%   |   0.34%| 1   | 30|
| TPCH(20) | TPCH-Q8  | avro / block / block | 9.05   | 9.31|   -2.81%  
 |   0.85%   |   0.72%| 1   | 30|
| TPCH(20) | TPCH-Q19 | avro / block / block | 7.20   | 7.41|   -2.91%  
 |   0.72%   |   0.52%| 1   | 30

[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 7:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc
File be/src/exec/hdfs-avro-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc@498
PS7, Line 498: starts off with AtCapacity() == true.
> under which case do we get here with row_batch already AtCapacity()? Not as
We can get there if the tuple buffer is > 8mb, e.g. for extremely wide rows.


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc@498
PS7, Line 498: mke
> make
Done


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.h
File be/src/runtime/mem-tracker.h:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.h@238
PS7, Line 238: Must not be used if
> do we have cases that transfer in this case? if so, what's the recommended
I think the data stream receiver might have this case. For now we do a 
Release() and Consume(). Maybe, ideally, we should do a TryTransfer() or 
something like that?


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.cc
File be/src/runtime/mem-tracker.cc:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.cc@236
PS7, Line 236:   && all_trackers_[ancestor_idx - 1] == 
dst->all_trackers_[dst_ancestor_idx - 1]) {
> should we dcheck that the limits are not set?
These MemTrackers can have limits, it's the ones below the common ancestors 
that are not allowed to have limits. ConsumeLocal() enforces this.



--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 7
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 06 Jun 2018 00:26:22 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10602 )

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2603/


--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 2
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:52:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7078: Part 2: reduce queue size based on num scanner threads

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10573 )

Change subject: IMPALA-7078: Part 2: reduce queue size based on 
num_scanner_threads
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10573/2/be/src/exec/hdfs-scan-node.cc
File be/src/exec/hdfs-scan-node.cc:

http://gerrit.cloudera.org:8080/#/c/10573/2/be/src/exec/hdfs-scan-node.cc@204
PS2, Line 204: f (file_descs_.empty() || progress_.done()) return Status::OK();
is this somehow related to this change? why do this?



--
To view, visit http://gerrit.cloudera.org:8080/10573
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c0bd14e6bcd9fc1655e344a5307ea0eb4600e8b
Gerrit-Change-Number: 10573
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:48:37 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10602 )

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 2
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:48:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 6: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 6
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:47:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..

IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

If we hit EOS, we'll wait for all the backends to report status (to try
to get a complete profile). But if the query is closed after this point,
then we can get stuck waiting since once the query is closed,
ImpalaServer won't know about this coordinator and so it will stop
forwarding on the ReportStatus RPCs.

The real fix for this is IMPALA-6984, but in the mean time, add another
special case for this JIRA (see the other TODO IMPALA-6984 in
coordinator.cc).

The cancellation test only finds this race once in a while (several
hours) indirectly in a COMPUTE STATS query because the
ChildQueryExecutor will do a CloseOperation() while the execution thread
is inside Fetch(). To make this more reproducible, modify the
cancellation test to allow the close and fetch rpcs to execute
concurrently (don't join the test's fetch thread until after
close). This makes the race reproducible in a few iterations and a few
minutes.

Testing:
- Loop test_cancellation.py

Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Reviewed-on: http://gerrit.cloudera.org:8080/10601
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M be/src/runtime/coordinator.cc
M tests/query_test/test_cancellation.py
2 files changed, 47 insertions(+), 9 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 7
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Vuk Ercegovac (Code Review)
Hello Tim Armstrong,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10602

to look at the new patch set (#2).

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..

IMPALA-6956: deflake and logging for query_expiration test

There was a recent flake where the number of in-flight queries
that were executing differed from the expected number.
The reason for the false negative is that one of the queries
expired before the check for in-flight queries: it took too long
to issue the queries.

This change modifies the timeout for the expired query to not expire
so quickly. Additional logging is added to the check for in-flight
queries so that we can distinguish the case of too few queries vs.
queries that have the wrong state.

Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
---
M tests/custom_cluster/test_query_expiration.py
1 file changed, 9 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/02/10602/2
--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 2
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6956: deflake and logging for query expiration test

2018-06-05 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10602 )

Change subject: IMPALA-6956: deflake and logging for query_expiration test
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10602/1/tests/custom_cluster/test_query_expiration.py
File tests/custom_cluster/test_query_expiration.py:

http://gerrit.cloudera.org:8080/#/c/10602/1/tests/custom_cluster/test_query_expiration.py@58
PS1, Line 58: client.execute("SET EXEC_TIME_LIMIT_S=3")
> The problem I saw was that it just took longer than assumed to get these fo
ah, you're right about the second query. there were two tests, one failed with 
one query off and other with both these queries off.
updated the other query as well.



--
To view, visit http://gerrit.cloudera.org:8080/10602
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I01a8762d28ad920b9ec8a0b1b82469618c66768f
Gerrit-Change-Number: 10602
Gerrit-PatchSet: 1
Gerrit-Owner: Vuk Ercegovac 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:42:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 6: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 6
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:41:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..

IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

toSql() method is used to print SQL string that is close to the original
SQL string, which is something that users want to see. When debugging
issues related to SQL rewrites, it can be very useful to be able to print/get
the SQL string that is being rewritten. This patch adds a new method
toRewrittenSql() to get the rewritten SQL string. The LOG.trace statement that
prints the rewritten SQL is also updated to use toRewrittenSql().

Testing:
- Added FE test for the rewritten SQL string
- Ran all FE tests

Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
---
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/FromClause.java
M fe/src/main/java/org/apache/impala/analysis/InlineViewRef.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/StatementBase.java
M fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java
M fe/src/main/java/org/apache/impala/analysis/TableRef.java
M fe/src/main/java/org/apache/impala/analysis/UnionStmt.java
M fe/src/test/java/org/apache/impala/analysis/ExprRewriterTest.java
9 files changed, 145 insertions(+), 10 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/71/10571/7
--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 7
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6035: Add query options to limit thread reservation

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10365 )

Change subject: IMPALA-6035: Add query options to limit thread reservation
..


Patch Set 9: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10365
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5b5bbbdad5cd6b24442eb6c99a4d38c2ad710007
Gerrit-Change-Number: 10365
Gerrit-PatchSet: 9
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:40:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..


Patch Set 7:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/10571/3/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/10571/3/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@972
PS3, Line 972:
> I tried "SELECT * FROM (SELECT (CASE WHEN TRUE THEN 1 ELSE id END) FROM fun
I forgot to update the InlineRefView. Done.


http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java
File fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java:

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java@117
PS5, Line 117: stmt.sqlString_ = null;
> I don't quite understand why you chose to restore the sql string this way.
I did try removing this line and it didn't work. It also broke other tests.



--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 7
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:40:10 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7078: Part 1: improve memory consumption of wide Avro scans

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10550 )

Change subject: IMPALA-7078: Part 1: improve memory consumption of wide Avro 
scans
..


Patch Set 7: Code-Review+2

(4 comments)

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc
File be/src/exec/hdfs-avro-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc@498
PS7, Line 498: starts off with AtCapacity() == true.
under which case do we get here with row_batch already AtCapacity()? Not asking 
to restructure now, but why is it structured that way?


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/exec/hdfs-avro-scanner.cc@498
PS7, Line 498: mke
make


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.h
File be/src/runtime/mem-tracker.h:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.h@238
PS7, Line 238: Must not be used if
do we have cases that transfer in this case? if so, what's the recommended 
pattern? (Release() and Consume()?)


http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.cc
File be/src/runtime/mem-tracker.cc:

http://gerrit.cloudera.org:8080/#/c/10550/7/be/src/runtime/mem-tracker.cc@236
PS7, Line 236:   && all_trackers_[ancestor_idx - 1] == 
dst->all_trackers_[dst_ancestor_idx - 1]) {
should we dcheck that the limits are not set?



--
To view, visit http://gerrit.cloudera.org:8080/10550
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iebd2600b4784fd19696c9b92eefb7d7e9db0c80b
Gerrit-Change-Number: 10550
Gerrit-PatchSet: 7
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 23:17:08 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7050: [DOCS] Document the max serialized incremental stat size setting

2018-06-05 Thread Alex Rodoni (Code Review)
Hello Balazs Jeszenszky, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10457

to look at the new patch set (#4).

Change subject: IMPALA-7050: [DOCS] Document the max serialized incremental 
stat size setting
..

IMPALA-7050: [DOCS] Document the max serialized incremental stat size setting

Change-Id: Ifa80325f0008d42a9cc8178e7c144fc2b49d7d4e
---
M docs/topics/impala_perf_stats.xml
1 file changed, 35 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/57/10457/4
--
To view, visit http://gerrit.cloudera.org:8080/10457
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ifa80325f0008d42a9cc8178e7c144fc2b49d7d4e
Gerrit-Change-Number: 10457
Gerrit-PatchSet: 4
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-7050: [DOCS] Document the max serialized incremental stat size setting

2018-06-05 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10457 )

Change subject: IMPALA-7050: [DOCS] Document the max serialized incremental 
stat size setting
..


Patch Set 3:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml
File docs/topics/impala_perf_stats.xml:

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@709
PS3, Line 709: When you need to run COMPUTE INCREMENTAL STATS 
on
 : very large tables, you can use the configuration 
setting
 :   inc_stats_size_limit_bytes to 
reduce the load on
 : the catalog server.
> turn into passive?
Attempted.


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@714
PS3, Line 714: This limit is set as a safety check, to prevent the JVM from 
hitting
 : a maximum array limit of 1 GB or runs out of memory.
> the limit is 2GB with suggested JVM versions. Readers would need to underst
Done


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@724
PS3, Line 724: To change the inc_stats_size_limit_bytes value
> This is a startup option for the impala daemons and the catalogd, not a cli
Do I have to restart both impalad and catalogd after changing the option?


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@733
PS3, Line 733: spike in heap usage
> as well as crashes.
Done


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@736
PS3, Line 736: 
 : Setting inc_stats_size_limit_bytes 
to a big value,
 : such as 1 GB or more, can result in a spike in heap 
usage.
 :   
> duplicate of above?
Done


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_shell_options.xml
File docs/topics/impala_shell_options.xml:

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_shell_options.xml@570
PS3, Line 570: --inc_stats_size_limit_bytes
> this is an impalad-side option
Removed from this doc.



--
To view, visit http://gerrit.cloudera.org:8080/10457
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifa80325f0008d42a9cc8178e7c144fc2b49d7d4e
Gerrit-Change-Number: 10457
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:56:10 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7105. Ensure fe tests pass when running standalone

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10570 )

Change subject: IMPALA-7105. Ensure fe tests pass when running standalone
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2602/


--
To view, visit http://gerrit.cloudera.org:8080/10570
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I85286ba44f03526b487212a8ddadf71a9f427555
Gerrit-Change-Number: 10570
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:36:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] Add missing namespace qualifiers

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10608 )

Change subject: Add missing namespace qualifiers
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10608
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Gerrit-Change-Number: 10608
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:34:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] Add missing namespace qualifiers

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10608 )

Change subject: Add missing namespace qualifiers
..

Add missing namespace qualifiers

As part of an upcoming rebase of the KRPC code, this change adds missing
namespace qualifiers in some places.

Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Reviewed-on: http://gerrit.cloudera.org:8080/10608
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/kudu-util.h
M be/src/runtime/descriptors.h
2 files changed, 2 insertions(+), 2 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10608
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Gerrit-Change-Number: 10608
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-5552: Add support for authorized proxy groups

2018-06-05 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10510 )

Change subject: IMPALA-5552: Add support for authorized proxy groups
..


Patch Set 8:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/10510/8/be/src/service/impala-server.cc@1389
PS8, Line 1389:   Status status = exec_env_->frontend()->GetHadoopGroups(req, 
);
Does this code run even if the flag isn't set? If so, someone upgrading but 
using a "bad" UserGroupMapping provider will start getting the wrong message.


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java
File fe/src/main/java/org/apache/impala/service/JniFrontend.java:

http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@657
PS8, Line 657:   protected static void checkGroupsMappingProvider(Configuration 
conf)
We should be doing this at start-up rather than once per request. The issue 
isn't so much that it's expensive, but that you want to fail fast on this sort 
of configuration error.

If we do it at boot time, we should also log the value of this configuration.


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@659
PS8, Line 659: GroupMappingServiceProvider provider =
Isn't this just GROUPS?


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@665
PS8, Line 665: Map recommendations = new HashMap<>();
 : recommendations.put(
 : ShellBasedUnixGroupsMapping.class.getName(),
 : JniBasedUnixGroupsMappingWithFallback.class.getName());
 : recommendations.put(
 : ShellBasedUnixGroupsNetgroupMapping.class.getName(),
 : 
JniBasedUnixGroupsNetgroupMappingWithFallback.class.getName());
This could go into the if below.

But, really, just use a longer string. I think the clever map for 
recommendations takes away from the point. You can also just use an if/else for 
the two cases with the same number of lines.


http://gerrit.cloudera.org:8080/#/c/10510/8/fe/src/main/java/org/apache/impala/service/JniFrontend.java@685
PS8, Line 685:   public String checkConfiguration() {
This seems like a good place to check the user group mapping once and for all. 
Please look into what it does, of course.



--
To view, visit http://gerrit.cloudera.org:8080/10510
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6953f89c293b06b72f523b11802232133d9d6cbb
Gerrit-Change-Number: 10510
Gerrit-PatchSet: 8
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Greg Rahn 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:32:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-5737: Tighten minicluster memory limit

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10277 )

Change subject: IMPALA-5737: Tighten minicluster memory limit
..


Patch Set 7: Code-Review-2

Please don't merge this until I give it the OK, this seems risky.


--
To view, visit http://gerrit.cloudera.org:8080/10277
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If8240551e726c6da546a926a1ce3444f41ef87fe
Gerrit-Change-Number: 10277
Gerrit-PatchSet: 7
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:22:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix errors from error-prone
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2601/


--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:18:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix errors from error-prone
..


Patch Set 2: Code-Review+2

maven profile looks fine.


--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:16:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix errors from error-prone
..


Patch Set 2: Code-Review+1

(1 comment)

Phil, can you +2 this, I'm not too familiar with mvn profiles, rest of it lgtm.

http://gerrit.cloudera.org:8080/#/c/10583/1/fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java
File fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java:

http://gerrit.cloudera.org:8080/#/c/10583/1/fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java@85
PS1, Line 85: argTypes
> argTypes is a List, so the built-in toString actually works. My fix down be
ah, nice catch.



--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:11:57 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7012: Fix NPE when parsing unexpected tokens

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10512 )

Change subject: IMPALA-7012: Fix NPE when parsing unexpected tokens
..

IMPALA-7012: Fix NPE when parsing unexpected tokens

Currently some token are not added to tokenIdMap in the sql scanner and
an NPE will be thrown if such are parsed as unexpected tokens. This
patch fixes this bug and adds a case to ParserTest.

Change-Id: I9c846fdfb22ba37bfc3b1985b9a044014ab58968
Reviewed-on: http://gerrit.cloudera.org:8080/10512
Reviewed-by: Tianyi Wang 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/common/FrontendTestBase.java
3 files changed, 36 insertions(+), 21 deletions(-)

Approvals:
  Tianyi Wang: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I9c846fdfb22ba37bfc3b1985b9a044014ab58968
Gerrit-Change-Number: 10512
Gerrit-PatchSet: 4
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 


[Impala-ASF-CR] IMPALA-7012: Fix NPE when parsing unexpected tokens

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10512 )

Change subject: IMPALA-7012: Fix NPE when parsing unexpected tokens
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9c846fdfb22ba37bfc3b1985b9a044014ab58968
Gerrit-Change-Number: 10512
Gerrit-PatchSet: 3
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:09:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7077. Add a configuration for the max number of partitions to load

2018-06-05 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10584 )

Change subject: IMPALA-7077. Add a configuration for the max number of 
partitions to load
..


Patch Set 1:

Another idea that also came up was to go with a flag like the one in this 
change, but base the limit on memory consumed. That would cover more scenarios 
for banning a table: num partitions, num files, num columns, incremental stats, 
or some combination. We can obtain the size estimate at the per partition 
granularity, so once exceeded, back out as in this change.
There's still the downside of a static flag setting (less flexibility) but 
there is some ease-of-use that can be argued as well.


--
To view, visit http://gerrit.cloudera.org:8080/10584
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife99a97a891ed14675303ea472abb2932a72cb51
Gerrit-Change-Number: 10584
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:07:54 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5737: Tighten minicluster memory limit

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10277 )

Change subject: IMPALA-5737: Tighten minicluster memory limit
..


Patch Set 7: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/2598/


--
To view, visit http://gerrit.cloudera.org:8080/10277
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If8240551e726c6da546a926a1ce3444f41ef87fe
Gerrit-Change-Number: 10277
Gerrit-PatchSet: 7
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:06:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7077. Add a configuration for the max number of partitions to load

2018-06-05 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10584 )

Change subject: IMPALA-7077. Add a configuration for the max number of 
partitions to load
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@2067
PS1, Line 2067: if (curPartitionCount + allHmsPartitionsToAdd.size() > 
HdfsTable.MAX_PARTITIONS_PER_TABLE) {
> Yea, we use 90 chars for Java too. Gerrit defaults to 100.
You can re-configure Gerrit for 90 in some config option if that's your thing.


http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/test/java/org/apache/impala/catalog/CatalogTest.java
File fe/src/test/java/org/apache/impala/catalog/CatalogTest.java:

http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/test/java/org/apache/impala/catalog/CatalogTest.java@523
PS1, Line 523:   public void testPartitionLimit() throws CatalogException {
You added the exception handling in "updatePartitionsFromHms()" and in some 
alter table path (i.e., two places), but there seems to be only one code path 
tested here. Should we test the other one too here, or is it enough that we 
test it in the QueryTest?



--
To view, visit http://gerrit.cloudera.org:8080/10584
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife99a97a891ed14675303ea472abb2932a72cb51
Gerrit-Change-Number: 10584
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:06:36 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7077. Add a configuration for the max number of partitions to load

2018-06-05 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10584 )

Change subject: IMPALA-7077. Add a configuration for the max number of 
partitions to load
..


Patch Set 1:

(2 comments)

Todd, what are your thoughts on having a "per table" property that can 
blacklist that particular table from loading in Impala? For example, something 
like

alter table foo set tblproperties ("load-in-impala" = "false")

The issue with having a global max partitions is that

- It is a start up flag (requires a restart)
- Lets say if someone sets it to 100k, we can still have multiple tables with 
90k partitions or something like that
- In many cases users would like to blacklist some specific problematic tables 
(say with large incremental stats) and having just a partition limit may not 
help


Vuk and I were discussing this offline today and I thought I'd bring this up in 
this CR. It is unclear if both these flags can co-exist or we should just have 
one of them. Thoughts?

http://gerrit.cloudera.org:8080/#/c/10584/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/10584/1//COMMIT_MSG@21
PS1, Line 21: worth the complexity to try to
: somehow limit the addition of partitions during a DML
> The problem is that the catalog update happens at the end of the DML, after
I lean towards (2) but bubble up a meaningful error message for the DML query 
and not the generic message that we are not able to load the table. I prefer 
(2) because (1) leaves the table on HDFS in an inconsistent state.


http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/10584/1/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@2067
PS1, Line 2067: if (curPartitionCount + allHmsPartitionsToAdd.size() > 
HdfsTable.MAX_PARTITIONS_PER_TABLE) {
> its not explicitly stated for java in the style guide, but given the limits
Yea, we use 90 chars for Java too. Gerrit defaults to 100.



--
To view, visit http://gerrit.cloudera.org:8080/10584
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife99a97a891ed14675303ea472abb2932a72cb51
Gerrit-Change-Number: 10584
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 22:00:04 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10560 )

Change subject: Fix Indents from IMPALA-4970
..


Patch Set 2: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/2595/


--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 2
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:59:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] [experimental] Clang Tidy Diff trial balloon

2018-06-05 Thread Jim Apple (Code Review)
Jim Apple has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/9751 )

Change subject: [experimental] Clang Tidy Diff trial balloon
..


Patch Set 2:

Any updates on this?


--
To view, visit http://gerrit.cloudera.org:8080/9751
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie2fb6a13400367fd3d12a4738bbb2dfc944466a7
Gerrit-Change-Number: 9751
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:54:19 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Tianyi Wang (Code Review)
Tianyi Wang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..


Patch Set 5:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/10571/3/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/10571/3/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@972
PS3, Line 972: .toSql()
> - FromClause does not extend from StatementBase.
I tried "SELECT * FROM (SELECT (CASE WHEN TRUE THEN 1 ELSE id END) FROM 
functional.alltypes UNION SELECT 1 + 1) v" on toRewrittenSql and it returns the 
unrewritten query. Isn't it wired to print the rewritten outmost select block 
but left the inline view original?


http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java
File fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java:

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java@117
PS5, Line 117: stmt.sqlString_ = null;
I don't quite understand why you chose to restore the sql string this way. Have 
you tried removing this line? I don't know what it's for but I expect we don't 
fundamentally need it.



--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:47:59 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] Guard S3 access check with variable

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10607 )

Change subject: Guard S3 access check with variable
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10607
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1df16b2d037afafb7748f52b64732f680883f16c
Gerrit-Change-Number: 10607
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:47:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] Guard S3 access check with variable

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10607 )

Change subject: Guard S3 access check with variable
..

Guard S3 access check with variable

We've recently seen automated build failures where multiple inclusions
of impala-config.sh led to the S3 access check being performed multiple
times. This would trigger rate limiting on S3's side, which then caused
the jobs to fail.

This change adds a guard to only perform the check once, even when
including impala-config.sh multiple times.

To test this I included impala-config.sh multiple times on my local
shell and observed that the check was only performed once.

Change-Id: I1df16b2d037afafb7748f52b64732f680883f16c
Reviewed-on: http://gerrit.cloudera.org:8080/10607
Reviewed-by: Lars Volker 
Tested-by: Impala Public Jenkins 
---
M bin/impala-config.sh
1 file changed, 11 insertions(+), 4 deletions(-)

Approvals:
  Lars Volker: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10607
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I1df16b2d037afafb7748f52b64732f680883f16c
Gerrit-Change-Number: 10607
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..


Patch Set 5:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@949
PS5, Line 949: toSqlString();
> I think this method name is confusing. I think we should rename this. Any s
The more I think about it, I don't think it's easy to figure out if the expr 
has been rewritten if not from the call-site. Calling reset() is one way of 
determining if an expr is rewritten, but it is also possible to clone() the 
expr so the reset() never gets called and it'll be hard to figure out if the 
expr is rewritten.

I'm also a bit hesitant of adding another state since we already have so many 
states and we also need to figure out whether or not to clear the state in the 
reset().



--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:43:09 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7016: Implement ALTER DATABASE SET OWNER

2018-06-05 Thread Adam Holley (Code Review)
Adam Holley has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10471 )

Change subject: IMPALA-7016: Implement ALTER DATABASE SET OWNER
..


Patch Set 3:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/10471/3/common/thrift/JniCatalog.thrift
File common/thrift/JniCatalog.thrift:

http://gerrit.cloudera.org:8080/#/c/10471/3/common/thrift/JniCatalog.thrift@55
PS3, Line 55:   COMMENT_ON
need comma


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/cup/sql-parser.cup
File fe/src/main/cup/sql-parser.cup:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/cup/sql-parser.cup@1030
PS3, Line 1030:   {:
Can you add a short comment as to why USER cannot be a keyword?


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/java/org/apache/impala/analysis/OwnerType.java
File fe/src/main/java/org/apache/impala/analysis/OwnerType.java:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/main/java/org/apache/impala/analysis/OwnerType.java@23
PS3, Line 23: public class OwnerType {
nit: Can the name be changed since it's not just type but includes the name?  
e.g. just Owner or OwnerContext


http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/test/java/org/apache/impala/analysis/ParserTest.java
File fe/src/test/java/org/apache/impala/analysis/ParserTest.java:

http://gerrit.cloudera.org:8080/#/c/10471/3/fe/src/test/java/org/apache/impala/analysis/ParserTest.java@3757
PS3, Line 3757: ParserError("ALTER DATABASE SET OWNER ROLE foo");
Add test for no parms after OWNER.
e.g. ALTER DATABASE db SET OWNER



--
To view, visit http://gerrit.cloudera.org:8080/10471
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie3b923021ebce5192d2d64784e7ddb952ba82bc3
Gerrit-Change-Number: 10471
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Adam Holley 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:30:46 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-5552: Add support for authorized proxy groups

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10510 )

Change subject: IMPALA-5552: Add support for authorized proxy groups
..


Patch Set 8:

Philip, can you review this and give a +2 if you don't have any concerns?


--
To view, visit http://gerrit.cloudera.org:8080/10510
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6953f89c293b06b72f523b11802232133d9d6cbb
Gerrit-Change-Number: 10510
Gerrit-PatchSet: 8
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Greg Rahn 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:11:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10606 )

Change subject: IMPALA-7120: remove sonatype repository
..

IMPALA-7120: remove sonatype repository

>From what I can tell, we no longer need this because we get java-cup
from a different repository.

I started a build and we built the frontend ok, so unless I'm missing
something, that seems to confirm that it works.

Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Reviewed-on: http://gerrit.cloudera.org:8080/10606
Reviewed-by: Philip Zeyliger 
Tested-by: Tim Armstrong 
---
M impala-parent/pom.xml
1 file changed, 0 insertions(+), 26 deletions(-)

Approvals:
  Philip Zeyliger: Looks good to me, approved
  Tim Armstrong: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10606 )

Change subject: IMPALA-7120: remove sonatype repository
..


Patch Set 2: Verified+1

Yep, it worked after I blew away ~/.m2/repository


--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:10:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..


Patch Set 5:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@949
PS5, Line 949: toSqlString();
> how do we know that the expression has been rewritten? you're controlling t
I think this method name is confusing. I think we should rename this. Any 
suggestion on the name?

It's basically a toSql() that doesn't use any SQL string that was previously 
stored.



--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:07:14 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6917: Implement COMMENT ON TABLE/VIEW

2018-06-05 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10478 )

Change subject: IMPALA-6917: Implement COMMENT ON TABLE/VIEW
..


Patch Set 6: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10478
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I497c17342f79ff7c99931fd8a0ddec0c79303dbf
Gerrit-Change-Number: 10478
Gerrit-PatchSet: 6
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:03:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7106: Print rewritten SQL correctly when log trace is enabled

2018-06-05 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10571 )

Change subject: IMPALA-7106: Print rewritten SQL correctly when log trace is 
enabled
..


Patch Set 5:

(1 comment)

thanks for the change. this will be a useful step to make it easier to 
debug/make sense of errors after the rewrite phase. main comment is about 
guarantees for the new method.

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/10571/5/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@949
PS5, Line 949: toSqlString();
how do we know that the expression has been rewritten? you're controlling this 
from one specific call-site, but I'd prefer that this expr know whether its 
been rewritten and whether this method makes sense to call. same goes for the 
other impls.
we know that the expr is analyzed-- that's a start. is additional state needed 
to record if rewritten?



--
To view, visit http://gerrit.cloudera.org:8080/10571
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iab58b0cc865135d261dd4a7f72be130f2e7bde53
Gerrit-Change-Number: 10571
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 21:02:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10606 )

Change subject: IMPALA-7120: remove sonatype repository
..


Patch Set 2:

I'll try that also to confirm.


--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:55:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10606 )

Change subject: IMPALA-7120: remove sonatype repository
..


Patch Set 2: Code-Review+2

If this passes the build, it should be good. I hope you tested it after blowing 
away your ~/.m2/repository so that the test would be clean, but I'm happy to 
let Jenkins robots try it.


--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:49:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10606 )

Change subject: IMPALA-7120: remove sonatype repository
..


Patch Set 2:

I looked at the mvn logs and it appears that this change prevented it from 
contacting oss.sonatype.org


--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:47:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6917: Implement COMMENT ON TABLE/VIEW

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10478 )

Change subject: IMPALA-6917: Implement COMMENT ON TABLE/VIEW
..


Patch Set 6:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10478/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/10478/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3554
PS5, Line 3554:   msTbl.getParameters().put("comment", co
> loadTableMetadata sets the catalog version, so this line can be removed.
Good catch! Done.



--
To view, visit http://gerrit.cloudera.org:8080/10478
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I497c17342f79ff7c99931fd8a0ddec0c79303dbf
Gerrit-Change-Number: 10478
Gerrit-PatchSet: 6
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:46:28 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10561 )

Change subject: IMPALA-7100: [DOCS] Consistent memory alloc across executor 
nodes
..

IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Reviewed-on: http://gerrit.cloudera.org:8080/10561
Reviewed-by: Balazs Jeszenszky 
Reviewed-by: Alex Rodoni 
Tested-by: Impala Public Jenkins 
---
M docs/topics/impala_prereqs.xml
1 file changed, 57 insertions(+), 39 deletions(-)

Approvals:
  Balazs Jeszenszky: Looks good to me, but someone else must approve
  Alex Rodoni: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Gerrit-Change-Number: 10561
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-6917: Implement COMMENT ON TABLE/VIEW

2018-06-05 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/10478 )

Change subject: IMPALA-6917: Implement COMMENT ON TABLE/VIEW
..

IMPALA-6917: Implement COMMENT ON TABLE/VIEW

This patch implements updating comment on a table or view.

Syntax:
COMMENT ON TABLE t IS 'comment'
COMMENT ON VIEW v IS 'comment'

Testing:
- Added new front-end tests
- Ran all front-end tests
- Added new end-to-end tests
- Ran end-to-end DDL tests

Change-Id: I497c17342f79ff7c99931fd8a0ddec0c79303dbf
---
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/CommentOnDbStmt.java
M fe/src/main/java/org/apache/impala/analysis/CommentOnStmt.java
A fe/src/main/java/org/apache/impala/analysis/CommentOnTableOrViewStmt.java
A fe/src/main/java/org/apache/impala/analysis/CommentOnTableStmt.java
A fe/src/main/java/org/apache/impala/analysis/CommentOnViewStmt.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M fe/src/test/java/org/apache/impala/analysis/AuthorizationTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M tests/metadata/test_ddl.py
M tests/metadata/test_ddl_base.py
13 files changed, 339 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/78/10478/6
--
To view, visit http://gerrit.cloudera.org:8080/10478
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I497c17342f79ff7c99931fd8a0ddec0c79303dbf
Gerrit-Change-Number: 10478
Gerrit-PatchSet: 6
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-6987: [DOCS] Refactor the INVALIDATE METADATA and REFRESH docs

2018-06-05 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10339 )

Change subject: IMPALA-6987: [DOCS] Refactor the INVALIDATE METADATA and 
REFRESH docs
..


Patch Set 5:

Hi Vuk,
Could you review the latest patch that addressed your comments?
Thank you!


--
To view, visit http://gerrit.cloudera.org:8080/10339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2124e14900d0f82569c061cc46006447bb054b36
Gerrit-Change-Number: 10339
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Behm 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Dimitris Tsirogiannis 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:39:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix errors from error-prone

2018-06-05 Thread Todd Lipcon (Code Review)
Hello Bharath Vissapragada, Philip Zeyliger,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10583

to look at the new patch set (#2).

Change subject: IMPALA-7110. Fix errors from error-prone
..

IMPALA-7110. Fix errors from error-prone

* Fixes several cases of Preconditions.checkNotNull() which were
  ineffective. Instead, we should use Preconditions.checkState.

* Fixed a case of synchronizing on a non-final class member, which is a
  bad practice.

* Fixed a case of trying to stringify an array instead of using Joiner
  to join it to a human-readable representation.

* Fixed a couple cases where the format string in String.format didn't
  match the number of arguments passed.

* Fixed a test where PrimitiveType objects were being used to look up
  elements in a Map. This would always return null,
  so the test was ineffective.

I elected not to enable error-prone in the default maven profile since
it increases compilation time noticeably. It can be enabled by running
'mvn -Perrorprone'.

Even with this patch there are a bunch of warnings -- this only
addresses the issues that errorprone considers "errors".

Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
---
M fe/pom.xml
M fe/src/main/java/org/apache/impala/analysis/CreateUdfStmt.java
M fe/src/main/java/org/apache/impala/analysis/LiteralExpr.java
M fe/src/main/java/org/apache/impala/analysis/Path.java
M fe/src/main/java/org/apache/impala/catalog/DiskIdMapper.java
M fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java
M fe/src/main/java/org/apache/impala/catalog/StructType.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java
11 files changed, 51 insertions(+), 14 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/83/10583/2
--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 


[Impala-ASF-CR] IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10561 )

Change subject: IMPALA-7100: [DOCS] Consistent memory alloc across executor 
nodes
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/308/


--
To view, visit http://gerrit.cloudera.org:8080/10561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Gerrit-Change-Number: 10561
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:36:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

2018-06-05 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10561 )

Change subject: IMPALA-7100: [DOCS] Consistent memory alloc across executor 
nodes
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Gerrit-Change-Number: 10561
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:36:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7110. Fix some warnings from error-prone

2018-06-05 Thread Todd Lipcon (Code Review)
Todd Lipcon has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10583 )

Change subject: IMPALA-7110. Fix some warnings from error-prone
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/10583/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/10583/1//COMMIT_MSG@21
PS1, Line 21: I elected not to enable error-prone in the maven settings since it
> I'll give this a try, though usually when I attempt to do this kind of thin
turned out to be not that bad to add a profile so I just added to this patch


http://gerrit.cloudera.org:8080/#/c/10583/1/fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java
File fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java:

http://gerrit.cloudera.org:8080/#/c/10583/1/fe/src/main/java/org/apache/impala/catalog/ScalarFunction.java@85
PS1, Line 85: argTypes
> interesting, wonder why it didn't catch this one. Will do.
argTypes is a List, so the built-in toString actually works. My fix down below 
was on the wrong spot in the line - it was actually complaining about the 
exception stack trace and I mis-fixed the error.



--
To view, visit http://gerrit.cloudera.org:8080/10583
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie9bfadecb5b92bba4fc7921a6f87f249ed90e771
Gerrit-Change-Number: 10583
Gerrit-PatchSet: 1
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:36:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

2018-06-05 Thread Balazs Jeszenszky (Code Review)
Balazs Jeszenszky has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10561 )

Change subject: IMPALA-7100: [DOCS] Consistent memory alloc across executor 
nodes
..


Patch Set 2: Code-Review+1

Feel free to carry the +1 if you've addressed the comments.


--
To view, visit http://gerrit.cloudera.org:8080/10561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Gerrit-Change-Number: 10561
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:33:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5380: [DOCS] Added additional filesystems supported in URI

2018-06-05 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10551 )

Change subject: IMPALA-5380: [DOCS] Added additional filesystems supported in 
URI
..


Patch Set 2:

Hi Lars,
Could you review the patch and approve the changes?
Thanks!


--
To view, visit http://gerrit.cloudera.org:8080/10551
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I11d57efd564fb5779539a88d7b209cee93d313c6
Gerrit-Change-Number: 10551
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:33:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7100: [DOCS] Consistent memory alloc across executor nodes

2018-06-05 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10561 )

Change subject: IMPALA-7100: [DOCS] Consistent memory alloc across executor 
nodes
..


Patch Set 2:

HI Balazs,
Could you review my latest patch and approve?
Thank you!


--
To view, visit http://gerrit.cloudera.org:8080/10561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I22926eb6050f9501624d2041f594fe4ef15be73b
Gerrit-Change-Number: 10561
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:32:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7050: [DOCS] Document the max serialized incremental stat size setting

2018-06-05 Thread Balazs Jeszenszky (Code Review)
Balazs Jeszenszky has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10457 )

Change subject: IMPALA-7050: [DOCS] Document the max serialized incremental 
stat size setting
..


Patch Set 3:

(7 comments)

initial pass

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml
File docs/topics/impala_perf_stats.xml:

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@709
PS3, Line 709: When you need to run COMPUTE INCREMENTAL STATS 
on
 : very large tables, you can use the configuration 
setting
 :   inc_stats_size_limit_bytes to 
reduce the load on
 : the catalog server.
turn into passive?


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@714
PS3, Line 714: This limit is set as a safety check, to prevent the JVM from 
hitting
 : a maximum array limit of 1 GB or runs out of memory.
the limit is 2GB with suggested JVM versions. Readers would need to understand 
that this is only a part of the entire table's metadata (all of which together 
must be below 2GB when serialized as Thrift). So while we're trying to avoid a 
2GB limit, increasing the 200MB limit can easily lead to crashes (other parts 
of the table's metadata can add up quickly).


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@724
PS3, Line 724: To change the inc_stats_size_limit_bytes value
This is a startup option for the impala daemons and the catalogd, not a client 
(impala-shell) side config.


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@733
PS3, Line 733: 1 GB
I'd put 500MB


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@733
PS3, Line 733: spike in heap usage
as well as crashes.


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_perf_stats.xml@736
PS3, Line 736: 
 : Setting inc_stats_size_limit_bytes 
to a big value,
 : such as 1 GB or more, can result in a spike in heap 
usage.
 :   
duplicate of above?


http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_shell_options.xml
File docs/topics/impala_shell_options.xml:

http://gerrit.cloudera.org:8080/#/c/10457/3/docs/topics/impala_shell_options.xml@570
PS3, Line 570: --inc_stats_size_limit_bytes
this is an impalad-side option



--
To view, visit http://gerrit.cloudera.org:8080/10457
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifa80325f0008d42a9cc8178e7c144fc2b49d7d4e
Gerrit-Change-Number: 10457
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:31:20 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 6:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2600/


--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 6
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:26:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 5:

Tim, please take a look at the changes to the test in patchset 5 vs patchset 3.


--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 5
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:26:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Dan Hecht (Code Review)
Hello Tim Armstrong,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10601

to look at the new patch set (#5).

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..

IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

If we hit EOS, we'll wait for all the backends to report status (to try
to get a complete profile). But if the query is closed after this point,
then we can get stuck waiting since once the query is closed,
ImpalaServer won't know about this coordinator and so it will stop
forwarding on the ReportStatus RPCs.

The real fix for this is IMPALA-6984, but in the mean time, add another
special case for this JIRA (see the other TODO IMPALA-6984 in
coordinator.cc).

The cancellation test only finds this race once in a while (several
hours) indirectly in a COMPUTE STATS query because the
ChildQueryExecutor will do a CloseOperation() while the execution thread
is inside Fetch(). To make this more reproducible, modify the
cancellation test to allow the close and fetch rpcs to execute
concurrently (don't join the test's fetch thread until after
close). This makes the race reproducible in a few iterations and a few
minutes.

Testing:
- Loop test_cancellation.py

Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
---
M be/src/runtime/coordinator.cc
M tests/query_test/test_cancellation.py
2 files changed, 47 insertions(+), 9 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/01/10601/5
--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 5
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 4:

Oops, hold on one second.


--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 4
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:24:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Dan Hecht (Code Review)
Dan Hecht has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10601 )

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..


Patch Set 3:

(3 comments)

Tim, could you take a look at the new version of the test?

http://gerrit.cloudera.org:8080/#/c/10601/3/tests/query_test/test_cancellation.py
File tests/query_test/test_cancellation.py:

http://gerrit.cloudera.org:8080/#/c/10601/3/tests/query_test/test_cancellation.py@a148
PS3, Line 148:
> I realized last night that it's best if we sometimes keep this join here, s
Done


http://gerrit.cloudera.org:8080/#/c/10601/3/tests/query_test/test_cancellation.py@152
PS3, Line 152: # If the query is cancelled while it's in the fetch rpc, 
it gets unregistered and
> This comment seems out-of-date
Done


http://gerrit.cloudera.org:8080/#/c/10601/3/tests/query_test/test_cancellation.py@230
PS3, Line 230: TestCancellatio
> that should be TestCancellationFullSort (otherwise the super method is not
Done



--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 3
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 20:23:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

2018-06-05 Thread Dan Hecht (Code Review)
Hello Tim Armstrong,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10601

to look at the new patch set (#4).

Change subject: IMPALA-7101: Fix race between Fetch and Close RPCs that can 
lead to hang
..

IMPALA-7101: Fix race between Fetch and Close RPCs that can lead to hang

If we hit EOS, we'll wait for all the backends to report status (to try
to get a complete profile). But if the query is closed after this point,
then we can get stuck waiting since once the query is closed,
ImpalaServer won't know about this coordinator and so it will stop
forwarding on the ReportStatus RPCs.

The real fix for this is IMPALA-6984, but in the mean time, add another
special case for this JIRA (see the other TODO IMPALA-6984 in
coordinator.cc).

The cancellation test only finds this race once in a while (several
hours) indirectly in a COMPUTE STATS query because the
ChildQueryExecutor will do a CloseOperation() while the execution thread
is inside Fetch(). To make this more reproducible, modify the
cancellation test to allow the close and fetch rpcs to execute
concurrently (don't join the test's fetch thread until after
close). This makes the race reproducible in a few iterations and a few
minutes.

Testing:
- Loop test_cancellation.py

Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
---
M be/src/runtime/coordinator.cc
M tests/query_test/test_cancellation.py
2 files changed, 47 insertions(+), 9 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/01/10601/4
--
To view, visit http://gerrit.cloudera.org:8080/10601
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7c147550f86d81b818ecbdd34cf2919ced7ff8c5
Gerrit-Change-Number: 10601
Gerrit-PatchSet: 4
Gerrit-Owner: Dan Hecht 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6812: Fix flaky Kudu scan tests

2018-06-05 Thread David Ribeiro Alves (Code Review)
David Ribeiro Alves has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10503 )

Change subject: IMPALA-6812: Fix flaky Kudu scan tests
..


Patch Set 1:

READ_AT_SNAPSHOT was reverted because it caused flakyness in stress tests iirc.

Note also that both READ_AT_SNAPSHOT and READ_YOUR_WRITES need timestamp 
propagation between clients/queries to work properly.

READ_AT_SNAPSHOT, while not consistent without timestamp propagation, 
effectively provided RYW by forcing the read to happen at now(), forcibly after 
the last write and without coordination. But READ_AT_SNAPSHOT requires an 
effective leader to work, which were sometimes missing in stress tests.

I realize that impala added a kudu client cache to the backend. If that makes 
it so that the same client is used across different queries, then RYW should 
both solve consistency issues and be more lenient regarding leaderless tablets. 
If the same client is not always used, then RYW might help, but likely won't 
completely solve the problem.


--
To view, visit http://gerrit.cloudera.org:8080/10503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I70df84f2cbc663107f2ad029565d3c15bdfbd47c
Gerrit-Change-Number: 10503
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: David Ribeiro Alves 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 05 Jun 2018 18:54:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] Add missing namespace qualifiers

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10608 )

Change subject: Add missing namespace qualifiers
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2599/


--
To view, visit http://gerrit.cloudera.org:8080/10608
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Gerrit-Change-Number: 10608
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 18:48:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7120: remove sonatype repository

2018-06-05 Thread Tim Armstrong (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10606

to look at the new patch set (#2).

Change subject: IMPALA-7120: remove sonatype repository
..

IMPALA-7120: remove sonatype repository

>From what I can tell, we no longer need this because we get java-cup
from a different repository.

I started a build and we built the frontend ok, so unless I'm missing
something, that seems to confirm that it works.

Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
---
M impala-parent/pom.xml
1 file changed, 0 insertions(+), 26 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/06/10606/2
--
To view, visit http://gerrit.cloudera.org:8080/10606
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8b7acee041019eecc8f664f2093dd3df72341a7e
Gerrit-Change-Number: 10606
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] Add missing namespace qualifiers

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10608 )

Change subject: Add missing namespace qualifiers
..


Patch Set 1: Code-Review+2

Thanks! Seems ok to go in now.


--
To view, visit http://gerrit.cloudera.org:8080/10608
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Gerrit-Change-Number: 10608
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 05 Jun 2018 18:43:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] Add missing namespace qualifiers

2018-06-05 Thread Lars Volker (Code Review)
Lars Volker has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10608


Change subject: Add missing namespace qualifiers
..

Add missing namespace qualifiers

As part of an upcoming rebase of the KRPC code, this change adds missing
namespace qualifiers in some places.

Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
---
M be/src/exec/kudu-util.h
M be/src/runtime/descriptors.h
2 files changed, 2 insertions(+), 2 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/10608/1
--
To view, visit http://gerrit.cloudera.org:8080/10608
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I3ef8f5c8f05100418dec90c2cd2e2b3dcd827c27
Gerrit-Change-Number: 10608
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 


[Impala-ASF-CR] Fix Indents from IMPALA-4970

2018-06-05 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10560 )

Change subject: Fix Indents from IMPALA-4970
..


Patch Set 2:

Thanks! Sorry to be pedantic


--
To view, visit http://gerrit.cloudera.org:8080/10560
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4ea53e0780cca9f791808c2a73c7f72ba27c2159
Gerrit-Change-Number: 10560
Gerrit-PatchSet: 2
Gerrit-Owner: Rahul Shivu Mahadev 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jim Apple 
Gerrit-Reviewer: Rahul Shivu Mahadev 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 05 Jun 2018 18:38:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5737: Tighten minicluster memory limit

2018-06-05 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10277 )

Change subject: IMPALA-5737: Tighten minicluster memory limit
..


Patch Set 7:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2598/


--
To view, visit http://gerrit.cloudera.org:8080/10277
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If8240551e726c6da546a926a1ce3444f41ef87fe
Gerrit-Change-Number: 10277
Gerrit-PatchSet: 7
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 05 Jun 2018 18:37:14 +
Gerrit-HasComments: No


  1   2   >