[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 3: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6297/


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Sat, 15 Aug 2020 04:49:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Quanlong Huang (Code Review)
Quanlong Huang has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..

IMPALA-4364: Query option to refresh updated HMS partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Reviewed-on: http://gerrit.cloudera.org:8080/16308
Reviewed-by: Quanlong Huang 
Tested-by: Impala Public Jenkins 
---
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
11 files changed, 464 insertions(+), 106 deletions(-)

Approvals:
  Quanlong Huang: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 9
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 8: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 8
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Sat, 15 Aug 2020 01:57:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9382: part 1: transposed profile prototype

2020-08-14 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15798 )

Change subject: IMPALA-9382: part 1: transposed profile prototype
..


Patch Set 16:

(3 comments)

I made it through most of the runtime-profile.h. The transformation of the 
averaged profile to the aggregated profile and resulting class hierarchy seems 
reasonable to me.

http://gerrit.cloudera.org:8080/#/c/15798/16//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15798/16//COMMIT_MSG@30
PS16, Line 30: with RuntimeProfile and AggregatedRuntimeProfile
 : base classes
"base classes" -> "subclasses"?


http://gerrit.cloudera.org:8080/#/c/15798/16//COMMIT_MSG@44
PS16, Line 44: These stats can now be
 : automatically as aggregations of counters.
Some words seem to be missing.


http://gerrit.cloudera.org:8080/#/c/15798/16/common/thrift/RuntimeProfile.thrift
File common/thrift/RuntimeProfile.thrift:

http://gerrit.cloudera.org:8080/#/c/15798/16/common/thrift/RuntimeProfile.thrift@249
PS16, Line 249: an averaged profile
  : // for the fragment is also included with averaged counter 
values.
Does the thrift for V1's average profile now also contain the 
TAggregatedRuntimeProfileNode struct? If so, are there any size concerns?



--
To view, visit http://gerrit.cloudera.org:8080/15798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0838c6a0872f57c696267ff4e92d29c08748eb7a
Gerrit-Change-Number: 15798
Gerrit-PatchSet: 16
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Sat, 15 Aug 2020 00:44:59 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 45: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6295/


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Sat, 15 Aug 2020 00:44:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 8: Code-Review+2

Thanks for working on this!


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 8
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Sat, 15 Aug 2020 00:40:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6942/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:41:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 38: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 38
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:39:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6297/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:35:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:35:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 2: Code-Review+2

LGTM


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:31:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16341/1/be/src/statestore/statestore.cc
File be/src/statestore/statestore.cc:

http://gerrit.cloudera.org:8080/#/c/16341/1/be/src/statestore/statestore.cc@396
PS1, Line 396:   // IMPALA-7714: log warning to aid debugging.
> Should the logging go before the DCHECK?
This was mainly targeted at release builds, but yeah, makes sense. I also added 
the topic id to the DCHECK cause that'd be useful.



--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 23:20:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Tim Armstrong (Code Review)
Hello Andrew Sherman, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16341

to look at the new patch set (#2).

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..

IMPALA-7714: try to avoid be test crash in statestore

We didn't get to a clear root cause for this, so I'm going
to try two things.

First, under the theory that the problem is somehow the
destruction of the strings, convert them to char char*
which does not require destruction on process teardown.

Second, add some logging if the map lookup fails so
we can better understand what may have happened.

Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
---
M be/src/statestore/statestore.cc
M be/src/statestore/statestore.h
2 files changed, 9 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/41/16341/2
--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 38:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6941/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 38
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 22:59:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..

IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

The root cause for crash is that QueryState::Cancel() was called
before thread unsafe function QueryState::Init() was completed.
This patch fixs the race condition between QueryState::Cancel()
and QueryState::Init(). QueryState::Init() is safe to be called
at any time.

Testing:
 - The issue could be reproduced by running expr-test for 10-20
   iterations. Verified the fixing by running expr-test over 1000
   iterations without crash.
 - Passed TestProcessFailures::test_kill_coordinator.
 - Passed core tests.

Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Reviewed-on: http://gerrit.cloudera.org:8080/16313
Tested-by: Impala Public Jenkins 
Reviewed-by: Thomas Tauber-Marshall 
---
M be/src/runtime/query-state.cc
M be/src/runtime/query-state.h
2 files changed, 32 insertions(+), 7 deletions(-)

Approvals:
  Impala Public Jenkins: Verified
  Thomas Tauber-Marshall: Looks good to me, approved

--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 4
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 14 Aug 2020 22:32:52 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 14 Aug 2020 22:29:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..


Patch Set 4: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6293/


--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 4
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 22:27:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16341/1/be/src/statestore/statestore.cc
File be/src/statestore/statestore.cc:

http://gerrit.cloudera.org:8080/#/c/16341/1/be/src/statestore/statestore.cc@396
PS1, Line 396:   // IMPALA-7714: log warning to aid debugging.
Should the logging go before the DCHECK?



--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 14 Aug 2020 22:10:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 37:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/16220/37/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/16220/37/be/src/scheduling/admission-controller.cc@911
PS37, Line 911:   if ( not_admitted_details )
  : *not_admitted_details = "";
nit: dont think this is necessary since a string object has a default 
initialization of an empty string. But in any case if you want to explicitly 
init it, it would be better to do this during QueueNode's initializer list


http://gerrit.cloudera.org:8080/#/c/16220/36/common/thrift/generate_error_codes.py
File common/thrift/generate_error_codes.py:

http://gerrit.cloudera.org:8080/#/c/16220/36/common/thrift/generate_error_codes.py@337
PS36, Line 337:  Details:
> Addressed in AdmissionController::CanAdmitRequest() by init the details str
What I meant was that if I see an error message like:
"Admission for query exceeded timeout 6ms in pool root.poolA. Queued 
reason: Queue non-empty Details: "

Here the details is empty, so i was proposing to get rid of "Details" if it 
does not exist and only print the error msg like:
"Admission for query exceeded timeout 6ms in pool root.poolA. Queued 
reason: Queue non-empty"



--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 37
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:45:34 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 37:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6940/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 37
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:30:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 8:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6939/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 8
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:16:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#37). ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..

IMPALA-9989 Improve admission control pool stats logging

This work addresses the current limitation in admission controller by
appending the last known memory consumption statistics about a pool
to the existing memory exhaustion message. The statistics is
logged in impalad.INFO when a query is queued or timed out due to
memory pressure in the pool or on the host. The statistics can also be
part of the query profile.

The BNF of the new memory consumption statistics is as follows.

  topN_query_stats ::=
queries: a list of query Ids and memory consumed for up to 5 queries
 with top memory consumptions
total_consumed: total memory consumed by these topN queries
fraction_of_pool_total_mem: total memory consumed divided
  by pool memory usage (if
  feasible to report)

  all_query_stats ::=
num_running: the total number of queries running
min: the minimal memory consumption of all running queries
max: the maximal memory consumption of all running queries
pool_total_mem: the total memory consumption of all running queries
average: the average memory consumption of all running queries
 (if feasible to report)

  pool_stats ::=
  ":"
 
 

  stats_on_host ::=
 "Stats for host " 
 List of 

  aggregated_pool_stats ::=
"Aggregated stats for pool " 


  memory_consumption_statistics ::=
  | 

The stats_on_host describes memory consumption for every pool on
a host and is useful in analyzing memory exhaustion on that host.
The aggregated_pool_stats describes the aggregated memory consumption
on all hosts for a pool for a set of queries and is useful in analyzing
memory exhaustion in that pool.

Example of stats_on_host for pool root.queueB and root.queueC on
host host1:25000.

Stats for host host1:25000
   pool_name=root.queueB:
  topN_query_stats:
 queries=[
id=0001:0004, consumed=20.00 MB,
id=0001:0003, consumed=19.00 MB,
id=0001:0002, consumed=8.00 MB
 ],
 total_consumed=47.00 MB
 fraction_of_pool_total_mem=0.47
  all_query_stats:
 num_running=4,
 min=5.00 MB,
 max=20.00 MB,
 pool_total_mem=100.00 MB,
 average=25.00 MB
   pool_name=root.queueC:
  topN_query_stats:
 queries=[
id=0002:, consumed=18.00 MB,
id=0002:0001, consumed=12.00 MB
 ],
 total_consumed=30.00 MB
 fraction_of_pool_total_mem=0.06
  all_query_stats:
 num_running=40,
 min=10.00 MB,
 max=200.00 MB,
 pool_total_mem=500.00 MB,
 average=12.50 MB

Example of aggregated_pool_stats over all hosts for pool root.queueC:

Aggregated stats for pool root.queueC:
   topN_query_stats:
  queries=[
 id=0002:0001, consumed=32.00 MB,
 id=0002:0004, consumed=26.00 MB,
 id=0002:, consumed=21.00 MB,
 id=0002:0002, consumed=17.00 MB,
 id=0002:000e, consumed=9.00 MB
  ],
  total_consumed=105.00 MB
  fraction_of_pool_total_mem=0.82

When a query request is queued due to memory exhaustion, the above
memory_consumption_statistics is logged when the logging is set
at level 2 or higher.

When a query request is timed out due to memory exhaustion, the above
memory_consumption_statistics is reported when the logging is set
at level 1 or higher.

Testing:
1. Added a new test TopNQueryCheck in admission-controller-test.cc to
   verify that the topN query memory consumption details are reported
   correctly.
2. Add two new tests in test_admission_controller.py to simulate
   queries being queued and then timed out due to pool or host memory
   pressure.
3. Added a new test TopN in mem-tracker-test.cc to
   verify that the topN query memory consumption details are computed
   correctly from a mem tracker hierarchy.
4. Ran Core tests successfully.

Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
---
M be/src/runtime/mem-tracker-test.cc
M be/src/runtime/mem-tracker.cc
M be/src/runtime/mem-tracker.h
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/util/container-util.h
M common/thrift/StatestoreService.thrift
M common/thrift/generate_error_codes.py
M tests/custom_cluster/test_admission_controller.py
10 files changed, 916 insertions(+), 47 

[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6938/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 7
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:09:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6937/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 6
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:06:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6936/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 5
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 21:01:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6935/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 4
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 20:59:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 8:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6296/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 8
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 20:50:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..


Patch Set 7:

rebased to latest master.


--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 7
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 20:50:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..

IMPALA-4364: Query option to refresh updated HMS partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
---
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
11 files changed, 464 insertions(+), 106 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/16308/8
--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 8
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..

IMPALA-4364: Query option to refresh updated HMS partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
---
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
11 files changed, 464 insertions(+), 106 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/16308/7
--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 7
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..

IMPALA-4364: Query option to refresh updated HMS partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
---
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
11 files changed, 460 insertions(+), 106 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/16308/6
--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 6
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated HMS partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated HMS partitions
..

IMPALA-4364: Query option to refresh updated HMS partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
---
M be/src/service/client-request-state.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
12 files changed, 461 insertions(+), 107 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/16308/5
--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 5
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-4364: Query option to refresh updated partitions

2020-08-14 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/16308 )

Change subject: IMPALA-4364: Query option to refresh updated partitions
..

IMPALA-4364: Query option to refresh updated partitions

This patch introduces a new boolean query option
REFRESH_UPDATED_HMS_PARTITIONS. When this query option is set
the refresh table command reloads the partitions which have been
modified in HMS in addition to adding [removing] the new [removed]
partitions.

In order to do this the refresh table command needs to fetch all
the partitions instead of the just the partition names which can
cause the performance of refresh table to degrade when the query
option is set. However for certain use-cases currently there is
no way to detect changed partitions using refresh table command.
For instance, if certain partition locations have been changed,
a refresh table will not update those partitions.

Testing:
1. Added a new test which sets the query option and makes sure
that the updated partitions from hive are reloaded after refresh
table command.
2. Ran exhaustive tests with the patch.

Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
---
M be/src/service/client-request-state.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/CatalogService.thrift
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsPartition.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_reset_metadata.py
12 files changed, 461 insertions(+), 107 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/08/16308/4
--
To view, visit http://gerrit.cloudera.org:8080/16308
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I50e8680509f4eb0712e7bb3de44df5f2952179af
Gerrit-Change-Number: 16308
Gerrit-PatchSet: 4
Gerrit-Owner: Vihang Karajgaonkar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vihang Karajgaonkar 


[Impala-ASF-CR] IMPALA-9955,IMPALA-9957: Fix not enough reservation for large pages in GroupingAggregator

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16240 )

Change subject: IMPALA-9955,IMPALA-9957: Fix not enough reservation for large 
pages in GroupingAggregator
..


Patch Set 9:

(3 comments)

Ok, I think I'm nearly happy with this.

I thought a bit more about the other operators and I don't think there are 
similar bugs. AnalyticEvalNode and the plan root sink only have single streams, 
so there's no complicated logic moving reservations between streams. 
PartitionedHashJoinNode doesn't have the same problem here - it can always 
immediately unpin a stream to spill each partition. I.e. it doesn't have the 
problem where it needs to allocate additional memory to spill a partition.

http://gerrit.cloudera.org:8080/#/c/16240/9/be/src/exec/grouping-aggregator-partition.cc
File be/src/exec/grouping-aggregator-partition.cc:

http://gerrit.cloudera.org:8080/#/c/16240/9/be/src/exec/grouping-aggregator-partition.cc@234
PS9, Line 234:   --parent->num_pinned_hash_partitions_;
If we were going to keep maintaining this, I think we'd want a wrapper that 
decremented this along with setting hash_tbl to null, to maintain the invariant 
that it's equal to the number of partitions where is_spilled() is true.


http://gerrit.cloudera.org:8080/#/c/16240/9/be/src/exec/grouping-aggregator-partition.cc@243
PS9, Line 243: void GroupingAggregator::Partition::Close(bool finalize_rows) {
Do we need to decrement num_pinned_hash_partitions here too, if it was counted 
as pinned before?

I looked and I guess by the time we're closing partitions we won't spill any 
more, so it might not matter as far as the code working, but it's confusing if 
the count can get out of sync.


http://gerrit.cloudera.org:8080/#/c/16240/9/be/src/exec/grouping-aggregator.cc
File be/src/exec/grouping-aggregator.cc:

http://gerrit.cloudera.org:8080/#/c/16240/9/be/src/exec/grouping-aggregator.cc@651
PS9, Line 651:   while (num_pinned_hash_partitions_ > 0) {
I think I actually preferred computing num_pinned_hash_partitions in this 
function because there was less state to reason about - I don't have to check 
that the counter is updated whenever a partition is spilled.. I think 
num_pinned_hash_partitions_ gets inconsistent when you close the hash 
partitions so it's a little confusing.



--
To view, visit http://gerrit.cloudera.org:8080/16240
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d9c3a2e7f0da60071b920dec979729e86459775
Gerrit-Change-Number: 16240
Gerrit-PatchSet: 9
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 20:33:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10073: Create shaded dependency for S3A and aws-java-sdk-bundle

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16342 )

Change subject: IMPALA-10073: Create shaded dependency for S3A and 
aws-java-sdk-bundle
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6934/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16342
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0939f73be986f83cc1fd07921563b4d9201780f2
Gerrit-Change-Number: 16342
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 14 Aug 2020 20:01:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 45:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6933/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:58:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16341 )

Change subject: IMPALA-7714: try to avoid be test crash in statestore
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6932/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:48:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10073: Create shaded dependency for S3A and aws-java-sdk-bundle

2020-08-14 Thread Sahil Takiar (Code Review)
Sahil Takiar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16342


Change subject: IMPALA-10073: Create shaded dependency for S3A and 
aws-java-sdk-bundle
..

IMPALA-10073: Create shaded dependency for S3A and aws-java-sdk-bundle

The aws-java-sdk-bundle is one of the largest dependencies in the Impala
Docker images and continues to grow. The jar includes SDKs for
every single AWS service.

This patch removes most of the unnecessary SDKs from the
aws-java-sdk-bundle, thus drastically decreasing the size of the
dependency. The Maven shade plugin is used to do this, and the
implementation is similar to what is currently done for the hive-exec
jar.

This patch takes a conservative approach to removing packages from the
aws-java-sdk-bundle jar, and I ensured no direct dependencies of the S3
SDK were removed. The idea is to only remove dependencies that S3A would
never conceivably need. Given the huge number of AWS services, I only
focused on removing the largest SDKs (the size of each SDK is estimated
by the number of classes in the SDK).

This decreases the size of the Docker images by about 100 MB.

Testing:
* Ran core tests against S3

Change-Id: I0939f73be986f83cc1fd07921563b4d9201780f2
---
M CMakeLists.txt
M ext-data-source/CMakeLists.txt
M fe/CMakeLists.txt
M fe/pom.xml
D shaded-deps/.gitignore
C shaded-deps/hive-exec/CMakeLists.txt
R shaded-deps/hive-exec/pom.xml
R shaded-deps/s3a-aws-sdk/CMakeLists.txt
A shaded-deps/s3a-aws-sdk/pom.xml
9 files changed, 212 insertions(+), 9 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/42/16342/1
--
To view, visit http://gerrit.cloudera.org:8080/16342
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I0939f73be986f83cc1fd07921563b4d9201780f2
Gerrit-Change-Number: 16342
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 45:

(8 comments)

http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h
File be/src/util/sse2neon.h:

http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@213
PS45, Line 213: // 
https://msdn.microsoft.com/en-us/library/bb514059%28v=vs.120%29.aspx?f=255=-2147217396
line too long (99 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@406
PS45, Line 406: // 
https://docs.microsoft.com/en-us/previous-versions/visualstudio/visual-studio-2010/whtfzhzk(v=vs.100)
line too long (104 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@413
PS45, Line 413: // 
https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_mm_set1_epi64x=4961
line too long (97 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@1054
PS45, Line 1054: // 
https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_mm_shuffle_epi8=5146
line too long (98 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@1199
PS45, Line 1199: // 
https://docs.microsoft.com/en-us/previous-versions/visualstudio/visual-studio-2010/y41dkk37(v=vs.100)
line too long (104 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@1645
PS45, Line 1645: // 
https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_mm_test_all_zeros=5871
line too long (100 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@3581
PS45, Line 3581: // 
https://github.com/ColinIanKing/linux-next-mirror/blob/b5f466091e130caaf0735976648f72bd5e09aa84/crypto/aegis128-neon-inner.c#L52
line too long (131 > 90)


http://gerrit.cloudera.org:8080/#/c/15531/45/be/src/util/sse2neon.h@3681
PS45, Line 3681: // 
cpp-compiler-developer-guide-and-reference-allocating-and-freeing-aligned-memory-blocks
line too long (98 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:37:47 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 45:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6295/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:37:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 45: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:37:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 44:

(1 comment)

I looked at the startup failure and the problem was pretty clear, so I will 
rerun with that part of the change removed.

http://gerrit.cloudera.org:8080/#/c/15531/44/be/src/codegen/gen_ir_descriptions.py
File be/src/codegen/gen_ir_descriptions.py:

http://gerrit.cloudera.org:8080/#/c/15531/44/be/src/codegen/gen_ir_descriptions.py@245
PS44, Line 245: if platform.processor() != "aarch64":
I missed that this function was removed in 
http://gerrit.cloudera.org:8080/15683. So I don't think we need to modify this 
file. That's causing the startup failure cause the file no longer exists



--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 44
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 19:37:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded a new patch set (#45) to the change originally 
created by zhaoren...@hotmail.com. ( http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..

IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

Replace Intel's SSE instructions with ARM's NEON instructions
Replace Intel's crc32 instructions with ARM's instructions
Replace Intel's popcntq instruction with ARM's mechanism
Replace Intel's pcmpestri and pcmpestrm instructions
with ARM mechanism

Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
---
M CMakeLists.txt
M be/CMakeLists.txt
M be/src/benchmarks/bswap-benchmark.cc
M be/src/benchmarks/int-hash-benchmark.cc
M be/src/codegen/CMakeLists.txt
M be/src/codegen/llvm-codegen-test.cc
M be/src/exec/delimited-text-parser.inline.h
M be/src/kudu/util/block_bloom_filter.cc
M be/src/kudu/util/group_varint-inl.h
M be/src/kudu/util/group_varint-test.cc
A be/src/kudu/util/sse2neon.h
M be/src/util/bit-util-test.cc
M be/src/util/bit-util.cc
M be/src/util/bit-util.h
M be/src/util/bloom-filter.cc
M be/src/util/bloom-filter.h
M be/src/util/hash-util-ir.cc
M be/src/util/hash-util.h
M be/src/util/sse-util.h
A be/src/util/sse2neon.h
M bin/rat_exclude_files.txt
21 files changed, 3,966 insertions(+), 35 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/31/15531/45
--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 45
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7714: try to avoid be test crash in statestore

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16341


Change subject: IMPALA-7714: try to avoid be test crash in statestore
..

IMPALA-7714: try to avoid be test crash in statestore

We didn't get to a clear root cause for this, so I'm going
to try two things.

First, under the theory that the problem is somehow the
destruction of the strings, convert them to char char*
which does not require destruction on process teardown.

Second, add some logging if the map lookup fails so
we can better understand what may have happened.

Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
---
M be/src/statestore/statestore.cc
M be/src/statestore/statestore.h
2 files changed, 8 insertions(+), 4 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/41/16341/1
--
To view, visit http://gerrit.cloudera.org:8080/16341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Id4363a93addb8a808d292906cac44ebd25c16889
Gerrit-Change-Number: 16341
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 36:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/16220/34/be/src/runtime/mem-tracker.cc
File be/src/runtime/mem-tracker.cc:

http://gerrit.cloudera.org:8080/#/c/16220/34/be/src/runtime/mem-tracker.cc@461
PS34, Line 461:   heavMemoryQuery.__set_queryId(tracker->query_id_);
> nit: can you print the mem-tracker->label here so thats its easy to debug i
can you address this too


http://gerrit.cloudera.org:8080/#/c/16220/36/be/src/scheduling/admission-controller.h
File be/src/scheduling/admission-controller.h:

http://gerrit.cloudera.org:8080/#/c/16220/36/be/src/scheduling/admission-controller.h@640
PS36, Line 640: friend class MemTracker;
why do we need to add this as a friend class?


http://gerrit.cloudera.org:8080/#/c/16220/36/be/src/scheduling/admission-controller.cc
File be/src/scheduling/admission-controller.cc:

http://gerrit.cloudera.org:8080/#/c/16220/36/be/src/scheduling/admission-controller.cc@1626
PS36, Line 1626: << " Details:" << queue_node->not_admitted_details;
see comment in generate_error_codes.py


http://gerrit.cloudera.org:8080/#/c/16220/36/common/thrift/generate_error_codes.py
File common/thrift/generate_error_codes.py:

http://gerrit.cloudera.org:8080/#/c/16220/36/common/thrift/generate_error_codes.py@337
PS36, Line 337:  Details:
nit: details might not exist. So can get confusing if its left empty. You can 
probably just add $3 and append an empty string if it doesnt exist.



--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 36
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 18:48:51 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9225: Query option for retryable queries to spool all results before returning any to the client

2020-08-14 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16323 )

Change subject: IMPALA-9225: Query option for retryable queries to spool all 
results before returning any to the client
..


Patch Set 6:

(10 comments)

starting to look good

http://gerrit.cloudera.org:8080/#/c/16323/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/16323/6//COMMIT_MSG@20
PS6, Line 20: spool_all_results_for_retries
might be worth mentioning that if retry_failed_queries = false or if 
spool_query_results = false, this option won't take any effect either.


http://gerrit.cloudera.org:8080/#/c/16323/6//COMMIT_MSG@23
PS6, Line 23: To implement this, this patch defers the time when a retryable 
query
: runs into the FINISHED stat
I think this merits a bit more clarification. in the normal case, a query is in 
the FINISHED state as soon as rows are ready to be fetched. conceptually, we 
aren't really changing anything about this state, even when this feature is 
triggered , a transition to the FINISHED state indicates that rows are ready to 
be fetched.

the only difference is what it means for "rows are ready to be fetched". when 
this feature is *not* triggered, rows are ready to be fetched as soon as *any* 
rows are available. when this feature is triggered, rows are only ready to be 
fetched when *all* rows are available.


http://gerrit.cloudera.org:8080/#/c/16323/6//COMMIT_MSG@25
PS6, Line 25: results spooling will be
: implicitly enabled
I think this feature should also be dependent on result spooling being enabled 
by the user (e.g. spool_query_results = true). otherwise users might find it a 
bit confusing that result spooling is used even when spool_query_results is 
false.

I think users might want to set spool_all_results_for_retries to false if they 
find that spooling all results introduces too much overhead.

I think it would be rare that they set spool_query_results = false, the only 
reason I can see that happening is if there is a bug in the implementation.


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/exec/buffered-plan-root-sink.cc
File be/src/exec/buffered-plan-root-sink.cc:

http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/exec/buffered-plan-root-sink.cc@82
PS6, Line 82: if (state->query_options().retry_failed_queries
: && 
state->query_options().spool_all_results_for_retries) {
:   state->LogError(ErrorMsg(TErrorCode::GENERAL, "Cannot 
spool all results. "
:   "Query retry will be skipped if any results have 
been returned."));
it would be nice if this log statement was moved elsewhere (perhaps the 
coordinator?) to avoid leaking the query retry feature into the result spooling 
implementation. right now the two are separate, and it would be nice to keep it 
that way.


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/exec/buffered-plan-root-sink.cc@87
PS6, Line 87: discard_result(spooled_all_results_.Set(Status::OK()));
so I think what is happening here is that if the queue is full, and no more 
results can be spooled. we set spooled_all_results_ in order to unblock the 
coordinator, and as long as no results have been fetched, the query can still 
be retried, is that correct?

that is definitely a nice property to have here. the one nit is that perhaps we 
should rename spooled_all_results_ to something else then? technically this 
line is setting spooled_all_results_ to true, even though all results have not 
been spooled.


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/coordinator.cc
File be/src/runtime/coordinator.cc:

http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/coordinator.cc@803
PS6, Line 803: Status status = 
UpdateExecState(coord_instance_->WaitForOpen(),
nit: the typical pattern is to wrap the call to `UpdateExecState` in 
`RETURN_IF_ERROR`, and then return Status::OK() on line 814


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/fragment-instance-state.cc
File be/src/runtime/fragment-instance-state.cc:

http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/fragment-instance-state.cc@91
PS6, Line 91: VLOG_QUERY << "Set opened_promise_ after Prepare()";
nit: delete?


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/fragment-instance-state.cc@100
PS6, Line 100:   VLOG_QUERY << "Set opended_promise_ after Open";
nit: delete?


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/fragment-instance-state.cc@112
PS6, Line 112:   
reinterpret_cast(sink_)->MarkSpooledAllResults(status);
can you just do this in the FlushFinal method of BufferedPlanRootSink? I don't 
think setting the spooled_all_results_ flag has to be done specifically when 
query retries are enabled either


http://gerrit.cloudera.org:8080/#/c/16323/6/be/src/runtime/fragment-instance-state.cc@438
PS6, Line 438: 

[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 44: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6292/


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 44
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:25:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6294/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:19:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 4
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:18:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6293/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 4
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:18:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 3
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:18:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6931/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 14 Aug 2020 17:17:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

2020-08-14 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/16313 )

Change subject: IMPALA-10039 (part 2): Fixed Expr-test crash due to race 
condition
..

IMPALA-10039 (part 2): Fixed Expr-test crash due to race condition

The root cause for crash is that QueryState::Cancel() was called
before thread unsafe function QueryState::Init() was completed.
This patch fixs the race condition between QueryState::Cancel()
and QueryState::Init(). QueryState::Init() is safe to be called
at any time.

Testing:
 - The issue could be reproduced by running expr-test for 10-20
   iterations. Verified the fixing by running expr-test over 1000
   iterations without crash.
 - Passed TestProcessFailures::test_kill_coordinator.
 - Passed core tests.

Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
---
M be/src/runtime/query-state.cc
M be/src/runtime/query-state.h
2 files changed, 32 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/16313/3
--
To view, visit http://gerrit.cloudera.org:8080/16313
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib0d3b9c59924a25b70fa20afeb6e8ca93016eca9
Gerrit-Change-Number: 16313
Gerrit-PatchSet: 3
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6930/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 3
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Fri, 14 Aug 2020 16:45:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7782: fix constant NOT IN subqueries that can return 0 rows

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16338 )

Change subject: IMPALA-7782: fix constant NOT IN subqueries that can return 0 
rows
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6929/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I66c726f0f66ce2f609e6ba44057191f5929a67fc
Gerrit-Change-Number: 16338
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 14 Aug 2020 16:31:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7779 Parquet Scanner can write binary data into profile

2020-08-14 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/16331 )

Change subject: IMPALA-7779 Parquet Scanner can write binary data into profile
..

IMPALA-7779 Parquet Scanner can write binary data into profile

This fix addresses the current limitation in that an ill-formatted
Parquet version string is not properly formatted before appearing
in an error message or impalad.INFO. With the fix, any such string is
converted to a hex string first. The hex string is a sequence of
four hex digit groups separated by spaces and each group is one or
two hex digits, such as "6c 65 2e a".

Testing:
 Ran "core" tests successfully.

Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
---
M be/src/exec/parquet/hdfs-parquet-scanner.cc
M common/thrift/generate_error_codes.py
M testdata/workloads/functional-query/queries/QueryTest/parquet.test
3 files changed, 4 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/31/16331/3
--
To view, visit http://gerrit.cloudera.org:8080/16331
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I281d6fa7cb2f88f04588110943e3e768678b9cf1
Gerrit-Change-Number: 16331
Gerrit-PatchSet: 3
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 44:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6292/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 44
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 16:16:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 44: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 44
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 16:16:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON instructions

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15531 )

Change subject: IMPALA-9544 Replace Intel's SSE instructions with ARM's NEON 
instructions
..


Patch Set 43: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15531
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7dfe17125b2910ece54e7dd18b4e4b25d7de8b9
Gerrit-Change-Number: 15531
Gerrit-PatchSet: 43
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 16:16:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7782: fix constant NOT IN subqueries that can return 0 rows

2020-08-14 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16338


Change subject: IMPALA-7782: fix constant NOT IN subqueries that can return 0 
rows
..

IMPALA-7782: fix constant NOT IN subqueries that can return 0 rows

The bug was the the statement rewriter converted NOT IN 
predicates to !=  predicates when the subquery could
be an empty set. This was invalid, because NOT IN ()
is true, but != () is false.

Testing:
Added targeted planner and end-to-end tests.

Ran exhaustive tests.

Change-Id: I66c726f0f66ce2f609e6ba44057191f5929a67fc
---
M fe/src/main/java/org/apache/impala/analysis/StmtRewriter.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/subquery-rewrite.test
M 
testdata/workloads/functional-query/queries/QueryTest/subquery-in-constant-lhs.test
3 files changed, 115 insertions(+), 5 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/38/16338/1
--
To view, visit http://gerrit.cloudera.org:8080/16338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I66c726f0f66ce2f609e6ba44057191f5929a67fc
Gerrit-Change-Number: 16338
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..


Patch Set 36:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6928/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16220
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
Gerrit-Change-Number: 16220
Gerrit-PatchSet: 36
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 14:26:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9989 Improve admission control pool stats logging

2020-08-14 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#36). ( 
http://gerrit.cloudera.org:8080/16220 )

Change subject: IMPALA-9989 Improve admission control pool stats logging
..

IMPALA-9989 Improve admission control pool stats logging

This work addresses the current limitation in admission controller by
appending the last known memory consumption statistics about a pool
to the existing memory exhaustion message. The statistics is
logged in impalad.INFO when a query is queued or timed out due to
memory pressure in the pool or on the host. The statistics can also be
part of the query profile.

The BNF of the new memory consumption statistics is as follows.

  topN_query_stats ::=
queries: a list of query Ids and memory consumed for up to 5 queries
 with top memory consumptions
total_consumed: total memory consumed by these topN queries
fraction_of_pool_total_mem: total memory consumed divided
  by pool memory usage (if
  feasible to report)

  all_query_stats ::=
num_running: the total number of queries running
min: the minimal memory consumption of all running queries
max: the maximal memory consumption of all running queries
pool_total_mem: the total memory consumption of all running queries
average: the average memory consumption of all running queries
 (if feasible to report)

  pool_stats ::=
  ":"
 
 

  stats_on_host ::=
 "Stats for host " 
 List of 

  aggregated_pool_stats ::=
"Aggregated stats for pool " 


  memory_consumption_statistics ::=
  | 

The stats_on_host describes memory consumption for every pool on
a host and is useful in analyzing memory exhaustion on that host.
The aggregated_pool_stats describes the aggregated memory consumption
on all hosts for a pool for a set of queries and is useful in analyzing
memory exhaustion in that pool.

Example of stats_on_host for pool root.queueB and root.queueC on
host host1:25000.

Stats for host host1:25000
   pool_name=root.queueB:
  topN_query_stats:
 queries=[
id=0001:0004, consumed=20.00 MB,
id=0001:0003, consumed=19.00 MB,
id=0001:0002, consumed=8.00 MB
 ],
 total_consumed=47.00 MB
 fraction_of_pool_total_mem=0.47
  all_query_stats:
 num_running=4,
 min=5.00 MB,
 max=20.00 MB,
 pool_total_mem=100.00 MB,
 average=25.00 MB
   pool_name=root.queueC:
  topN_query_stats:
 queries=[
id=0002:, consumed=18.00 MB,
id=0002:0001, consumed=12.00 MB
 ],
 total_consumed=30.00 MB
 fraction_of_pool_total_mem=0.06
  all_query_stats:
 num_running=40,
 min=10.00 MB,
 max=200.00 MB,
 pool_total_mem=500.00 MB,
 average=12.50 MB

Example of aggregated_pool_stats over all hosts for pool root.queueC:

Aggregated stats for pool root.queueC:
   topN_query_stats:
  queries=[
 id=0002:0001, consumed=32.00 MB,
 id=0002:0004, consumed=26.00 MB,
 id=0002:, consumed=21.00 MB,
 id=0002:0002, consumed=17.00 MB,
 id=0002:000e, consumed=9.00 MB
  ],
  total_consumed=105.00 MB
  fraction_of_pool_total_mem=0.82

When a query request is queued due to memory exhaustion, the above
memory_consumption_statistics is logged when the logging is set
at level 2 or higher.

When a query request is timed out due to memory exhaustion, the above
memory_consumption_statistics is reported when the logging is set
at level 1 or higher.

Testing:
1. Added a new test TopNQueryCheck in admission-controller-test.cc to
   verify that the topN query memory consumption details are reported
   correctly.
2. Add two new tests in test_admission_controller.py to simulate
   queries being queued and then timed out due to pool or host memory
   pressure.
3. Added a new test TopN in mem-tracker-test.cc to
   verify that the topN query memory consumption details are computed
   correctly from a mem tracker hierarchy.
4. Ran Core tests successfully.

Change-Id: Id995a9d044082c3b8f044e1ec25bb4c64347f781
---
M be/src/runtime/mem-tracker-test.cc
M be/src/runtime/mem-tracker.cc
M be/src/runtime/mem-tracker.h
M be/src/scheduling/admission-controller-test.cc
M be/src/scheduling/admission-controller.cc
M be/src/scheduling/admission-controller.h
M be/src/util/container-util.h
M common/thrift/StatestoreService.thrift
M common/thrift/generate_error_codes.py
M tests/custom_cluster/test_admission_controller.py
10 files changed, 914 insertions(+), 47 

[Impala-ASF-CR] IMPALA-7658: Proper codegen for HiveUdfCall

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16314 )

Change subject: IMPALA-7658: Proper codegen for HiveUdfCall
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/6927/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16314
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
Gerrit-Change-Number: 16314
Gerrit-PatchSet: 7
Gerrit-Owner: Daniel Becker 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 09:45:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7658: Proper codegen for HiveUdfCall

2020-08-14 Thread Daniel Becker (Code Review)
Daniel Becker has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/16314 )

Change subject: IMPALA-7658: Proper codegen for HiveUdfCall
..

IMPALA-7658: Proper codegen for HiveUdfCall

Implementing codegen for HiveUdfCall.

TODO: Testing
TODO: Benchmarks

Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/codegen/llvm-codegen.cc
M be/src/codegen/llvm-codegen.h
M be/src/exprs/CMakeLists.txt
A be/src/exprs/hive-udf-call-ir.cc
M be/src/exprs/hive-udf-call.cc
M be/src/exprs/hive-udf-call.h
8 files changed, 471 insertions(+), 39 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/14/16314/7
--
To view, visit http://gerrit.cloudera.org:8080/16314
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
Gerrit-Change-Number: 16314
Gerrit-PatchSet: 7
Gerrit-Owner: Daniel Becker 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7658: Proper codegen for HiveUdfCall

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16314 )

Change subject: IMPALA-7658: Proper codegen for HiveUdfCall
..


Patch Set 6:

Build Failed

https://jenkins.impala.io/job/gerrit-code-review-checks/6926/ : Initial code 
review checks failed. See linked job for details on the failure.


--
To view, visit http://gerrit.cloudera.org:8080/16314
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
Gerrit-Change-Number: 16314
Gerrit-PatchSet: 6
Gerrit-Owner: Daniel Becker 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Fri, 14 Aug 2020 08:25:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7658: Proper codegen for HiveUdfCall

2020-08-14 Thread Daniel Becker (Code Review)
Daniel Becker has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/16314 )

Change subject: IMPALA-7658: Proper codegen for HiveUdfCall
..

IMPALA-7658: Proper codegen for HiveUdfCall

Implementing codegen for HiveUdfCall.

TODO: Testing
TODO: Benchmarks

Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/codegen/llvm-codegen.cc
M be/src/codegen/llvm-codegen.h
M be/src/exprs/CMakeLists.txt
A be/src/exprs/hive-udf-call-ir.cc
M be/src/exprs/hive-udf-call.cc
M be/src/exprs/hive-udf-call.h
8 files changed, 470 insertions(+), 39 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/14/16314/6
--
To view, visit http://gerrit.cloudera.org:8080/16314
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2f994dac550f297ed3c88491816403f237d4d747
Gerrit-Change-Number: 16314
Gerrit-PatchSet: 6
Gerrit-Owner: Daniel Becker 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-5022: Outer join simplification

2020-08-14 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16266 )

Change subject: IMPALA-5022: Outer join simplification
..


Patch Set 8:

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6291/


--
To view, visit http://gerrit.cloudera.org:8080/16266
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaa7804033fac68e93f33c387dc68ef67f803e93e
Gerrit-Change-Number: 16266
Gerrit-PatchSet: 8
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Shant Hovsepian 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 14 Aug 2020 06:41:40 +
Gerrit-HasComments: No