[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4866/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 6
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 24 Oct 2019 05:35:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Quanlong Huang (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14527

to look at the new patch set (#6).

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..

IMPALA-9071: Handle translated external HDFS table in CATS

After upgrading Hive-3 to a version containing HIVE-22158, it's not
allowed for managed tables to be non transactional. Creating non ACID
tables will result in creating an external table with table property
'external.table.purge' set to true.

In Hive-3, the default location of external HDFS tables will locate in
'metastore.warehouse.external.dir' if it's set. This property is added
by HIVE-19837 in Hive 2.7, but hasn't been added to Hive in cdh6 yet.

In CTAS statement, we create a temporary HMS Table for the analysis on
the Insert part. The table path is created assuming it's a managed
table, and the Insert part will use this path for insertion. However, in
Hive-3, the created table is translated to an external table. It's not
the same as we passed to the HMS API. The created table is located in
'metastore.warehouse.external.dir', while the table path we assumed is
in 'metastore.warehouse.dir'. This introduces bugs when these two
properties are different. CTAS statement will create table in one place
and insert data in another place.

This patch adds a new method in MetastoreShim to wrap the difference for
getting the default table path for non transactional tables between
Hive-2 and Hive-3.

Changes in the infra:
 - To support customizing hive configuration, add a env var,
   CUSTOM_CLASSPATH in bin/set-classpath.sh to be put in front of
   existing CLASSPATH. The customized hive-site.xml should be put inside
   CUSTOM_CLASSPATH.
 - Change hive-site.xml.py to generate a hive-site.xml with non default
   'metastore.warehouse.external.dir'
 - Add an option, --env_vars, in bin/start-impala-cluster.py to pass
   down CUSTOM_CLASSPATH.

Tests:
 - Add a custom cluster test to start Hive with
   metastore.warehouse.external.dir being set to non default value. Run
   it locally using CDP components with HIVE-22158.
 - Run CORE tests using CDH components

Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
---
M bin/create-test-configuration.sh
M bin/set-classpath.sh
M bin/start-impala-cluster.py
M fe/src/compat-hive-2/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/compat-hive-3/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/test/resources/hive-site.xml.py
M tests/common/custom_cluster_test_suite.py
A tests/custom_cluster/test_custom_hive_configs.py
9 files changed, 156 insertions(+), 12 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/27/14527/6
--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 6
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 14: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 14
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 24 Oct 2019 04:41:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..

IMPALA-8999: make union scheduling work with mt_dop

This change unifies mt_dop scheduling between the
union and scan cases.

Testing:
Manually checked that fragments with unions get parallelised
to the correct degree, both as a result of scans within the
fragment and input fragments.

Extend TestMtDopAdmissionSlots (renamed to TestMtDopScheduling)
to confirm that queries that were not parallelised before are
now parallelised. These tests verify the number of instances
of each operator using the ExecSummary embedded in the profile.

Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Reviewed-on: http://gerrit.cloudera.org:8080/14384
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/scheduling/query-schedule.cc
M be/src/scheduling/scheduler-test.cc
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
M fe/src/main/java/org/apache/impala/planner/UnionNode.java
D 
testdata/workloads/functional-query/queries/QueryTest/mt-dop-parquet-admission-slots.test
A 
testdata/workloads/functional-query/queries/QueryTest/mt-dop-parquet-scheduling.test
M tests/query_test/test_mt_dop.py
8 files changed, 382 insertions(+), 219 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 15
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4865/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 5
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 24 Oct 2019 01:40:51 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8995: Fix synchronization in dequeue thread

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14539 )

Change subject: IMPALA-8995: Fix synchronization in dequeue thread
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4864/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14539
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I91080ce54e59cc7e6361f7c50d6b2156a8a180c8
Gerrit-Change-Number: 14539
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 24 Oct 2019 01:39:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7984: Port runtime filter from Thrift RPC to KRPC

2019-10-23 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/13882 )

Change subject: IMPALA-7984: Port runtime filter from Thrift RPC to KRPC
..


Patch Set 25:

(26 comments)

http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator-backend-state.cc
File be/src/runtime/coordinator-backend-state.cc:

http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator-backend-state.cc@556
PS25, Line 556: LOG(ERROR) << "PublishFilter() rpc failed: " << 
rpc_status.ToString();
return here as there is no point in continuing to print the result status if 
RPC failed.


http://gerrit.cloudera.org:8080/#/c/13882/24/be/src/runtime/coordinator.cc
File be/src/runtime/coordinator.cc:

http://gerrit.cloudera.org:8080/#/c/13882/24/be/src/runtime/coordinator.cc@1134
PS24, Line 1134: r is neither an always true filter nor an
   :   // always false filter, then it must be the case that a 
non-empty sidecar slice
> I actually found out that if we are using "std::move(sidecar_slice.ToString
Sorry for the wrong advice. I believe the RVO should take care of the 
copy-elision so no need for the std::move. In other words, it should be 
sufficient to just call sidecar_slice.ToString();


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc
File be/src/runtime/coordinator.cc:

http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1010
PS25, Line 1010: // >>> IMPALA-7984: Port runtime filter from Thrift RPC to 
KRPC
To be removed ?!


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1022
PS25, Line 1022: std::
nit: no need for std::

Also, may make sense to move this declaration to somewhere closer to its use 
(e.g. line 1068 below).

Same for rpc_params and target_fragment_idxs above.


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1072
PS25, Line 1072:   BloomFilterPB& aggregated_filter = state->bloom_filter();
   :   
aggregated_filter.Swap(rpc_params.mutable_bloom_filter());
This swapping tricks seems unnecessary now that the bloom_filter_directory is a 
separate structure.


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1074
PS25, Line 1074: bloom_filter_directory
Is there a reason to assign state->bloom_filter_directory() to this local 
variable ? Why cannot we use state->bloom_filter_directory() directly below ?


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1098
PS25, Line 1098:  reinterpret_cast(&(bloom_filter_directory[0])
state->bloom_filter_directory().data()


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1153
PS25, Line 1153: bloom_filter_ = BloomFilterPB(params.bloom_filter());
bloom_filter_ = params.bloom_filter();

The above should be sufficient.


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/coordinator.cc@1155
PS25, Line 1155: std::string(
   :   reinterpret_cast(sidecar_slice.data()), sidecar_slice.size())
Can keep the original suggestion of sidecar_slice.ToString(). C++ ROV should 
take care of avoiding the copy so the TODO can be removed. There is no easy way 
to avoid copying the sidecar from the network buffer so we have to copy at 
least once.


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/runtime-filter-bank.cc
File be/src/runtime/runtime-filter-bank.cc:

http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/runtime-filter-bank.cc@190
PS25, Line 190: std::
nit: no need for std::


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/runtime-filter-bank.cc@204
PS25, Line 204: std::
nit: no need for std::


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/runtime-filter-bank.cc@205
PS25, Line 205:   ++num_inflight_rpcs_;
DCHECK_GE(num_inflight_rpcs_, 0); before increment.


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/runtime-filter-bank.cc@323
PS25, Line 323: std::
nit: no need for std::


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/timestamp-value.h
File be/src/runtime/timestamp-value.h:

http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/timestamp-value.h@170
PS25, Line 170:   // Store the binary representation of this TimestampValue in 
'tvalue'.
  :   void ToTColumnValue(TColumnValue* tvalue) const {
  : const uint8_t* data = reinterpret_cast(this);
  : tvalue->timestamp_val.assign(data, data + Size());
  : tvalue->__isset.timestamp_val = true;
  :   }
Is this not used anymore ?


http://gerrit.cloudera.org:8080/#/c/13882/25/be/src/runtime/timestamp-value.h@183
PS25, Line 183:   // Returns a new TimestampValue created from the value in 
'tvalue'.
  :   static TimestampValue FromTColumnValue(const TColumnValue& 
tvalue) {
  : TimestampValue value;
 

[Impala-ASF-CR] IMPALA-8995: Fix synchronization in dequeue thread

2019-10-23 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14539 )

Change subject: IMPALA-8995: Fix synchronization in dequeue thread
..


Patch Set 1:

(1 comment)

Fix makes sense, just want to be sure that we're not leaving things in a state 
where more subtle bugs will go un-noticed.

http://gerrit.cloudera.org:8080/#/c/14539/1/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/14539/1/be/src/scheduling/admission-controller.cc@1375
PS1, Line 1375: dequeue_cv_.Wait(lock);
I feel like this usage pattern of condition variables is really asking for 
trouble - the pattern of

  while (!condition) {
cv_.wait(lock);
  }

Is a lot more robust. Can we switch to using this pattern so that it's more 
explicit, e.g. have a pending_dequeue_ variable that is explicitly set when 
signalling the variable to make sure that the wakeup doesn't get lost.



--
To view, visit http://gerrit.cloudera.org:8080/14539
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I91080ce54e59cc7e6361f7c50d6b2156a8a180c8
Gerrit-Change-Number: 14539
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 24 Oct 2019 01:10:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Quanlong Huang (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14527

to look at the new patch set (#5).

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..

IMPALA-9071: Handle translated external HDFS table in CATS

After upgrading Hive-3 to a version containing HIVE-22158, it's not
allowed for managed tables to be non transactional. Creating non ACID
tables will result in creating an external table with table property
'external.table.purge' set to true.

In Hive-3, the default location of external HDFS tables will locate in
'metastore.warehouse.external.dir' if it's set. This property is added
by HIVE-19837 in Hive 2.7, but hasn't been added to Hive in cdh6 yet.

In CTAS statement, we create a temporary HMS Table for the analysis on
the Insert part. The table path is created assuming it's a managed
table, and the Insert part will use this path for insertion. However, in
Hive-3, the created table is translated to an external table. It's not
the same as we passed to the HMS API. The created table is located in
'metastore.warehouse.external.dir', while the table path we assumed is
in 'metastore.warehouse.dir'. This introduces bugs when these two
properties are different. CTAS statement will create table in one place
and insert data in another place.

This patch adds a new method in MetastoreShim to wrap the difference for
getting the default table path for non transactional tables between
Hive-2 and Hive-3.

This patch also bumps the CDP version to contain HIVE-22158. Note that
Ranger version also bumps from 1.2 to 2.0 due to this CDP upgrade, which
causes some test failures. They are disabled for follow-up fixs in
IMPALA-9047.

Changes in the infra:
 - To support customizing hive configuration, add a env var,
   CUSTOM_CLASSPATH in bin/set-classpath.sh to be put in front of
   existing CLASSPATH. The customized hive-site.xml should be put inside
   CUSTOM_CLASSPATH.
 - Change hive-site.xml.py to generate a hive-site.xml with non default
   'metastore.warehouse.external.dir'
 - Add an option, --env_vars, in bin/start-impala-cluster.py to pass
   down CUSTOM_CLASSPATH.

Tests:
 - Add a custom cluster test to start Hive with
   metastore.warehouse.external.dir being set to non default value. Run
   it locally using CDP components with HIVE-22158.
 - Run CORE tests using CDH components

Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
---
M bin/create-test-configuration.sh
M bin/impala-config.sh
M bin/set-classpath.sh
M bin/start-impala-cluster.py
M fe/src/compat-hive-2/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/compat-hive-3/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/test/java/org/apache/impala/authorization/AuthorizationStmtTest.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
M fe/src/test/resources/hive-site.xml.py
M testdata/pom.xml
M tests/authorization/test_authorized_proxy.py
M tests/authorization/test_ranger.py
M tests/common/custom_cluster_test_suite.py
A tests/custom_cluster/test_custom_hive_configs.py
15 files changed, 255 insertions(+), 14 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/27/14527/5
--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 5
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-8995: Fix synchronization in dequeue thread

2019-10-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/14539


Change subject: IMPALA-8995: Fix synchronization in dequeue thread
..

IMPALA-8995: Fix synchronization in dequeue thread

The admission controller's dequeue thread currently wakes up either
when queries release their admission resources or when a statestore
update is received. The dequeue loop releases the admission lock at
the end of the loop, then acquires it back and calls wait on it.
In this small window, a query can complete, update the admission
stats by acquiring the admission lock, and then call send a notify
to wake the dequeue thread. But since the dequeue thread has not
called wait yet, it can miss this notify. Moreover if the statestore
is down there is no way of waking it up. This will cause the queued
queries to eventually timeout. This patch attempts to fix this by
removing that window.

Testing:
Was able to trigger this manually by adding a sleep right before
the dequeue loop acquires the admission lock.

Change-Id: I91080ce54e59cc7e6361f7c50d6b2156a8a180c8
---
M be/src/scheduling/admission-controller.cc
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/39/14539/1
--
To view, visit http://gerrit.cloudera.org:8080/14539
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I91080ce54e59cc7e6361f7c50d6b2156a8a180c8
Gerrit-Change-Number: 14539
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 14:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5129/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 14
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 24 Oct 2019 00:23:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 14: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 14
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 24 Oct 2019 00:23:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 13: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 13
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 23 Oct 2019 23:37:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 13:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4863/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 13
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 23 Oct 2019 22:46:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14384 )

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..


Patch Set 12:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/14384/12/be/src/scheduling/scheduler.cc
File be/src/scheduling/scheduler.cc:

http://gerrit.cloudera.org:8080/#/c/14384/12/be/src/scheduling/scheduler.cc@338
PS12, Line 338:  for the scan range
> nit: remove?
Done


http://gerrit.cloudera.org:8080/#/c/14384/12/be/src/scheduling/scheduler.cc@344
PS12, Line 344: Note that this takes into account all of the input fragments,
  : // not just the leftmost because we expect unions to be 
symmetrical for purposes of
  : // planning, unlike joins.
> nit: how about: "Note that step 1 is modified to run on fragments with unio
Done


http://gerrit.cloudera.org:8080/#/c/14384/12/be/src/scheduling/scheduler.cc@392
PS12, Line 392:   vector scan_hosts;
> nit: maybe add a Dcheck(scan_node_ids.size() == 1 || has_union) so that we
Done



--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 12
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 23 Oct 2019 22:01:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8999: make union scheduling work with mt dop

2019-10-23 Thread Tim Armstrong (Code Review)
Hello Andrew Sherman, Bikramjeet Vig, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14384

to look at the new patch set (#13).

Change subject: IMPALA-8999: make union scheduling work with mt_dop
..

IMPALA-8999: make union scheduling work with mt_dop

This change unifies mt_dop scheduling between the
union and scan cases.

Testing:
Manually checked that fragments with unions get parallelised
to the correct degree, both as a result of scans within the
fragment and input fragments.

Extend TestMtDopAdmissionSlots (renamed to TestMtDopScheduling)
to confirm that queries that were not parallelised before are
now parallelised. These tests verify the number of instances
of each operator using the ExecSummary embedded in the profile.

Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
---
M be/src/scheduling/query-schedule.cc
M be/src/scheduling/scheduler-test.cc
M be/src/scheduling/scheduler.cc
M be/src/scheduling/scheduler.h
M fe/src/main/java/org/apache/impala/planner/UnionNode.java
D 
testdata/workloads/functional-query/queries/QueryTest/mt-dop-parquet-admission-slots.test
A 
testdata/workloads/functional-query/queries/QueryTest/mt-dop-parquet-scheduling.test
M tests/query_test/test_mt_dop.py
8 files changed, 382 insertions(+), 219 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/84/14384/13
--
To view, visit http://gerrit.cloudera.org:8080/14384
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I0d2e9c86b530da3053e49d42b837dca0b1348ff2
Gerrit-Change-Number: 14384
Gerrit-PatchSet: 13
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9047: Bump CDP BUILD NUMBER to 1507246

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14529 )

Change subject: IMPALA-9047: Bump CDP_BUILD_NUMBER to 1507246
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4862/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14529
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifbc42a73fe98c437edc20a8f57aba51e3096a09d
Gerrit-Change-Number: 14529
Gerrit-PatchSet: 2
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 23 Oct 2019 18:02:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9047: Bump CDP BUILD NUMBER to 1507246

2019-10-23 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/14529 )

Change subject: IMPALA-9047: Bump CDP_BUILD_NUMBER to 1507246
..

IMPALA-9047: Bump CDP_BUILD_NUMBER to 1507246

This patch bumps CDP_BUILD_NUMBER to 1507246. Some test cases would fail
due to this change. For example, some Ranger related E2E tests and FE
tests would fail when we are using a newer version of Ranger (e.g.,
ranger-2.0.0.7.0.2.0-108 instead of ranger-1.2.0.7.1.0.0-33) due to the
changes to the default Ranger policies described at
https://issues.apache.org/jira/browse/RANGER-2536.

To address this issue, this patch temporarily disables those affected
Ranger tests. Specifically, the affected tests in the following test
files are disabled for now.

1. test_authorized_proxy.py
2. test_ranger.py
3. AuthorizationStmtTest.java
4. RangerAuditLogTest.java

Testing:
- This patch passes the affected Ranger tests listed above on a local
  machine.

Change-Id: Ifbc42a73fe98c437edc20a8f57aba51e3096a09d
---
M bin/impala-config.sh
M fe/src/test/java/org/apache/impala/authorization/AuthorizationStmtTest.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
M tests/authorization/test_authorized_proxy.py
M tests/authorization/test_ranger.py
5 files changed, 101 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/14529/2
--
To view, visit http://gerrit.cloudera.org:8080/14529
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ifbc42a73fe98c437edc20a8f57aba51e3096a09d
Gerrit-Change-Number: 14529
Gerrit-PatchSet: 2
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-7504/KUDU-2979 ParseKerberosPrincipal() should use krb5 parse name() instead

2019-10-23 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14433 )

Change subject: IMPALA-7504/KUDU-2979 ParseKerberosPrincipal() should use 
krb5_parse_name() instead
..


Patch Set 6:

(3 comments)

Thanks for this update

http://gerrit.cloudera.org:8080/#/c/14433/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/14433/5//COMMIT_MSG@21
PS5, Line 21: format principal, new error code is 2 instead of original 112
> What kind of end-to-end test? The one included in impala-private-parameteri
yes, maybe there's a better name but I think this name is generally understood


http://gerrit.cloudera.org:8080/#/c/14433/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/14433/6//COMMIT_MSG@19
PS6, Line 19: Add two authentication-test
Add two tests to authentication-test


http://gerrit.cloudera.org:8080/#/c/14433/6/be/src/rpc/authentication-test.cc
File be/src/rpc/authentication-test.cc:

http://gerrit.cloudera.org:8080/#/c/14433/6/be/src/rpc/authentication-test.cc@200
PS6, Line 200:   EXPECT_ERROR(sa.InitKerberos(" ", "/etc/hosts"), 2);
This says we will get an error, but do we know it is the right error?
We should check the return code or message or something more specific



--
To view, visit http://gerrit.cloudera.org:8080/14433
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0e64ebdc10f102dbdc5b87f6fe3f2a0310b1be24
Gerrit-Change-Number: 14433
Gerrit-PatchSet: 6
Gerrit-Owner: Xiaomeng Zhang 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Xiaomeng Zhang 
Gerrit-Comment-Date: Wed, 23 Oct 2019 17:03:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 4:

This is still WIP. Still have errors in data-loading when USE_CDP_HIVE=true.


--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 4
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 23 Oct 2019 15:01:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4861/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 4
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 23 Oct 2019 13:05:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managerd table without HMS config

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14398 )

Change subject: IMPALA-8974: Fixed a bug when create kudu managerd table 
without HMS config
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4860/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 7
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 23 Oct 2019 12:13:51 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managed table without HMS config

2019-10-23 Thread wangsheng (Code Review)
wangsheng has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14398 )

Change subject: IMPALA-8974: Fixed a bug when create kudu managed table without 
HMS config
..


Patch Set 8:

(4 comments)

> (5 comments)
 >
 > Thank you for adding the test. I have some concerns about how it
 > changes config files around, so I proposed an alternative solution.

Thanks for your review, Tim. I've already solved the problems you mentioned 
above. If there is a chance, I will try to write custom cluster test in Python 
instead of Java.

http://gerrit.cloudera.org:8080/#/c/14398/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/14398/5//COMMIT_MSG@7
PS5, Line 7: managed
> spelling: managed (also a few cases below).
Done


http://gerrit.cloudera.org:8080/#/c/14398/5/fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java
File 
fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java:

http://gerrit.cloudera.org:8080/#/c/14398/5/fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java@35
PS5, Line 35: managed
> spelling: managed
Done


http://gerrit.cloudera.org:8080/#/c/14398/5/fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java@66
PS5, Line 66: sultSet
> spelling: managed
Done


http://gerrit.cloudera.org:8080/#/c/14398/5/fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java@84
PS5, Line 84: }
> I don't really like this test modifying global test configs, it really risk
Done



--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 8
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 23 Oct 2019 12:21:04 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Quanlong Huang (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14527

to look at the new patch set (#4).

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..

IMPALA-9071: Handle translated external HDFS table in CATS

After upgrading Hive-3 to a version containing HIVE-22158, it's not
allowed for managed tables to be non transactional. Creating non ACID
tables will result in creating an external table with table property
'external.table.purge' set to true.

In Hive-3, the default location of external HDFS tables will locate in
'metastore.warehouse.external.dir' if it's set. This property is added
by HIVE-19837 in Hive 2.7, but hasn't been added to Hive in cdh6 yet.

In CTAS statement, we create a temporary HMS Table for the analysis on
the Insert part. The table path is created assuming it's a managed
table, and the Insert part will use this path for insertion. However, in
Hive-3, the created table is translated to an external table. It's not
the same as we passed to the HMS API. The created table is located in
'metastore.warehouse.external.dir', while the table path we assumed is
in 'metastore.warehouse.dir'. This introduces bugs when these two
properties are different. CTAS statement will create table in one place
and insert data in another place.

This patch adds a new method in MetastoreShim to wrap the difference for
getting the default table path for non transactional tables between
Hive-2 and Hive-3.

This patch also bumps the CDP version to contain HIVE-22158. Note that
Ranger version also bumps from 1.2 to 2.0 due to this CDP upgrade, which
causes some test failures. They are disabled for follow-up fixs in
IMPALA-9047.

Changes in the infra:
 - To support customizing hive configuration, add a env var,
   CUSTOM_CLASSPATH in bin/set-classpath.sh to be put in front of
   existing CLASSPATH. The customized hive-site.xml should be put inside
   CUSTOM_CLASSPATH.
 - Change hive-site.xml.py to generate a hive-site.xml with non default
   'metastore.warehouse.external.dir'
 - Add an option, --env_vars, in bin/start-impala-cluster.py to pass
   down CUSTOM_CLASSPATH.

Tests:
 - Add a custom cluster test to start Hive with
   metastore.warehouse.external.dir being set to non default value. Run
   it locally using CDP components with HIVE-22158.
 - Run CORE tests using CDH components

Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
---
M bin/create-test-configuration.sh
M bin/impala-config.sh
M bin/set-classpath.sh
M bin/start-impala-cluster.py
M fe/src/compat-hive-2/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/compat-hive-3/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/test/java/org/apache/impala/authorization/AuthorizationStmtTest.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
M fe/src/test/resources/hive-site.xml.py
M testdata/pom.xml
M tests/authorization/test_authorized_proxy.py
M tests/authorization/test_ranger.py
M tests/common/custom_cluster_test_suite.py
A tests/custom_cluster/test_custom_hive_configs.py
15 files changed, 253 insertions(+), 14 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/27/14527/4
--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 4
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managed table without HMS config

2019-10-23 Thread wangsheng (Code Review)
Hello Quanlong Huang, Hao Hao, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14398

to look at the new patch set (#8).

Change subject: IMPALA-8974: Fixed a bug when create kudu managed table without 
HMS config
..

IMPALA-8974: Fixed a bug when create kudu managed table without HMS
config

When catalogd connected to mysql/postgresql directly instead of
use HMS, a kudu managerd table created DDL would failed due to
Preconditions checked failed.

The patch fixed the bug by using the if condition instead of the
Preconditions functions which lead to exception when lack of HMS config.

Tests:
  * Add test for create/drop kudu managerd table without HMS config
  * Ran all front-end tests

Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
---
M bin/start-daemon.sh
M bin/start-impala-cluster.py
M fe/src/main/java/org/apache/impala/catalog/KuduTable.java
A 
fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java
A fe/src/test/resources/hive-site-without-hms.xml.py
5 files changed, 177 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/14398/8
--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 8
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managerd table without HMS config

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14398 )

Change subject: IMPALA-8974: Fixed a bug when create kudu managerd table 
without HMS config
..


Patch Set 6:

Build Failed

https://jenkins.impala.io/job/gerrit-code-review-checks/4859/ : Initial code 
review checks failed. See linked job for details on the failure.


--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 6
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 23 Oct 2019 12:13:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managerd table without HMS config

2019-10-23 Thread wangsheng (Code Review)
Hello Quanlong Huang, Hao Hao, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14398

to look at the new patch set (#7).

Change subject: IMPALA-8974: Fixed a bug when create kudu managerd table 
without HMS config
..

IMPALA-8974: Fixed a bug when create kudu managerd table without HMS
config

When catalogd connected to mysql/postgresql directly instead of
use HMS, a kudu managerd table created DDL would failed due to
Preconditions checked failed.

The patch fixed the bug by using the if condition instead of the
Preconditions functions which lead to exception when lack of HMS config.

Tests:
  * Add test for create/drop kudu managerd table without HMS config
  * Ran all front-end tests

Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
---
M bin/start-daemon.sh
M bin/start-impala-cluster.py
M fe/src/main/java/org/apache/impala/catalog/KuduTable.java
A 
fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java
A fe/src/test/resources/hive-site-without-hms.xml.py
5 files changed, 177 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/14398/7
--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 7
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managerd table without HMS config

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14398 )

Change subject: IMPALA-8974: Fixed a bug when create kudu managerd table 
without HMS config
..


Patch Set 6:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/14398/6/bin/start-impala-cluster.py
File bin/start-impala-cluster.py:

http://gerrit.cloudera.org:8080/#/c/14398/6/bin/start-impala-cluster.py@136
PS6, Line 136: ,
flake8: E231 missing whitespace after ','


http://gerrit.cloudera.org:8080/#/c/14398/6/bin/start-impala-cluster.py@136
PS6, Line 136: ,
flake8: E999 SyntaxError: invalid syntax



--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 6
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 23 Oct 2019 11:32:13 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8974: Fixed a bug when create kudu managerd table without HMS config

2019-10-23 Thread wangsheng (Code Review)
Hello Quanlong Huang, Hao Hao, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/14398

to look at the new patch set (#6).

Change subject: IMPALA-8974: Fixed a bug when create kudu managerd table 
without HMS config
..

IMPALA-8974: Fixed a bug when create kudu managerd table without HMS
config

When catalogd connected to mysql/postgresql directly instead of
use HMS, a kudu managerd table created DDL would failed due to
Preconditions checked failed.

The patch fixed the bug by using the if condition instead of the
Preconditions functions which lead to exception when lack of HMS config.

Tests:
  * Add test for create/drop kudu managerd table without HMS config
  * Ran all front-end tests

Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
---
M bin/start-daemon.sh
M bin/start-impala-cluster.py
M fe/src/main/java/org/apache/impala/catalog/KuduTable.java
A 
fe/src/test/java/org/apache/impala/customcluster/CreateKuduTableWithoutHMSTest.java
A fe/src/test/resources/hive-site-without-hms.xml.py
5 files changed, 177 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/14398/6
--
To view, visit http://gerrit.cloudera.org:8080/14398
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iacc53801a660c033869cb4747910c98a80e08297
Gerrit-Change-Number: 14398
Gerrit-PatchSet: 6
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Hao Hao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/4858/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 3
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 23 Oct 2019 11:01:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/14527/3/tests/custom_cluster/test_custom_hive_configs.py
File tests/custom_cluster/test_custom_hive_configs.py:

http://gerrit.cloudera.org:8080/#/c/14527/3/tests/custom_cluster/test_custom_hive_configs.py@25
PS3, Line 25: class TestCustomHiveConfigs(CustomClusterTestSuite):
flake8: E302 expected 2 blank lines, found 1



--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 3
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 23 Oct 2019 10:15:13 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9071: Handle translated external HDFS table in CATS

2019-10-23 Thread Quanlong Huang (Code Review)
Quanlong Huang has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/14527 )

Change subject: IMPALA-9071: Handle translated external HDFS table in CATS
..

IMPALA-9071: Handle translated external HDFS table in CATS

After upgrading Hive-3 to a version containing HIVE-22158, it's not
allowed for managed tables to be non transactional. Creating non ACID
tables will result in creating an external table with table property
'external.table.purge' set to true.

In Hive-3, the default location of external HDFS tables will locate in
'metastore.warehouse.external.dir' if it's set. This property is added
by HIVE-19837 in Hive 2.7, but hasn't been added to Hive in cdh6 yet.

In CTAS statement, we create a temporary HMS Table for the analysis on
the Insert part. The table path is created assuming it's a managed
table, and the Insert part will use this path for insertion. However, in
Hive-3, the created table is translated to an external table. It's not
the same as we passed to the HMS API. The created table is located in
'metastore.warehouse.external.dir', while the table path we assumed is
in 'metastore.warehouse.dir'. This introduces bugs when these two
properties are different. CTAS statement will create table in one place
and insert data in another place.

This patch adds a new method in MetastoreShim to wrap the difference for
getting the default table path for non transactional tables between
Hive-2 and Hive-3.

This patch also bumps the CDP version to contain HIVE-22158. Note that
Ranger version also bumps from 1.2 to 2.0 due to this CDP upgrade, which
causes some test failures. They are disabled for follow-up fixs in
IMPALA-9047.

Changes in the infra:
 - To support customizing hive configuration, add a env var,
   CUSTOM_CLASSPATH in bin/set-classpath.sh to be put in front of
   existing CLASSPATH. The customized hive-site.xml should be put inside
   CUSTOM_CLASSPATH.
 - Change hive-site.xml.py to generate a hive-site.xml with non default
   'metastore.warehouse.external.dir'
 - Add an option, --env_vars, in bin/start-impala-cluster.py to pass
   down CUSTOM_CLASSPATH.

Tests:
 - Add a custom cluster test to start Hive with
   metastore.warehouse.external.dir being set to non default value. Run
   it locally using CDP components with HIVE-22158.
 - Run CORE tests using CDH components

Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
---
M bin/create-test-configuration.sh
M bin/impala-config.sh
M bin/set-classpath.sh
M bin/start-impala-cluster.py
M fe/src/compat-hive-2/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/compat-hive-3/java/org/apache/impala/compat/MetastoreShim.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/test/java/org/apache/impala/authorization/AuthorizationStmtTest.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
M fe/src/test/resources/hive-site.xml.py
M testdata/pom.xml
M tests/authorization/test_authorized_proxy.py
M tests/authorization/test_ranger.py
M tests/common/custom_cluster_test_suite.py
A tests/custom_cluster/test_custom_hive_configs.py
15 files changed, 253 insertions(+), 14 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/27/14527/3
--
To view, visit http://gerrit.cloudera.org:8080/14527
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I460a57dc877ef68ad7dd0864a33b1599b1e9a8d9
Gerrit-Change-Number: 14527
Gerrit-PatchSet: 3
Gerrit-Owner: Quanlong Huang