[Impala-ASF-CR] IMPALA-7942 (part 2): Add query hints for predicate selectivities

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18023 )

Change subject: IMPALA-7942 (part 2): Add query hints for predicate 
selectivities
..


Patch Set 12:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12386/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18023
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2776b9bbd878b8a21d9c866b400140a454f59e1b
Gerrit-Change-Number: 18023
Gerrit-PatchSet: 12
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Amogh Margoor 
Gerrit-Reviewer: Fucun Chu 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Thu, 16 Feb 2023 06:43:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7942 (part 2): Add query hints for predicate selectivities

2023-02-15 Thread wangsheng (Code Review)
wangsheng has uploaded a new patch set (#12). ( 
http://gerrit.cloudera.org:8080/18023 )

Change subject: IMPALA-7942 (part 2): Add query hints for predicate 
selectivities
..

IMPALA-7942 (part 2): Add query hints for predicate selectivities

Currently, Impala only uses simple estimation to compute selectivity
for some predicates, and this may lead to worse query plan due to CBO.
Hence, we add new hints to reduce such errors. Maybe in the future,
we can use histograms to get more precise query plan.

This patch adds another query hints: 'SELECTIVITY', we can use this
hint to original selectivity computing.

Format like this:

  select col from t where (a=1) /* +SELECTIVITY(0.5) */;

Besides, this hint is also valid for compound predicate like this:

  select col from t where (a=1 and b=2) /* +SELECTIVITY(0.5) */;

But pay attention, if we want to use 'SELECTIVITY' hint for predicate,
we need to wrap the predicate by braket, even for single binary
predicate.

Testing:
- Added new fe tests in 'PlannerTest'
- Added new fe tests in 'AnalyzeStmtsTest' for negative cases

Change-Id: I2776b9bbd878b8a21d9c866b400140a454f59e1b
---
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/BinaryPredicate.java
M fe/src/main/java/org/apache/impala/analysis/CompoundPredicate.java
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/InPredicate.java
M fe/src/main/java/org/apache/impala/analysis/IsNullPredicate.java
M fe/src/main/java/org/apache/impala/analysis/Predicate.java
M fe/src/main/java/org/apache/impala/rewrite/BetweenToCompoundRule.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTest.java
A 
testdata/workloads/functional-planner/queries/PlannerTest/predicate-selectivity-hint.test
12 files changed, 334 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/23/18023/12
--
To view, visit http://gerrit.cloudera.org:8080/18023
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2776b9bbd878b8a21d9c866b400140a454f59e1b
Gerrit-Change-Number: 18023
Gerrit-PatchSet: 12
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Amogh Margoor 
Gerrit-Reviewer: Fucun Chu 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 34: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 34
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Thu, 16 Feb 2023 05:31:18 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Thu, 16 Feb 2023 04:46:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 34:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12385/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 34
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Thu, 16 Feb 2023 04:30:54 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 7: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 7
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 16 Feb 2023 04:24:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Xiaoqing Gao (Code Review)
Xiaoqing Gao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 33:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3934
PS33, Line 3934: assigned
> nit: created
Done


http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3940
PS33, Line 3940: List primaryColumnNames = new ArrayList<>();
   : for (String primaryColumnName : 
kuduTable.getPrimaryKeyColumnNames()) {
   :   if 
(KuduUtil.getAutoIncrementingColumnName().equals(primaryColumnName)) continue;
   :   primaryColumnNames.add(primaryColumnName);
   : }
> nit: a little optimization to avoid checking auto-incrementing column for t
Done



--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 33
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Thu, 16 Feb 2023 04:12:53 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Xiaoqing Gao (Code Review)
Hello Quanlong Huang, Kurt Deschler, Alexey Serbin, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18729

to look at the new patch set (#34).

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..

IMPALA-4052: CREATE TABLE LIKE for Kudu tables

This commit implements cloning between Kudu tables, including clone the
schema and hash partitions. But there is one limitation, cloning of
Kudu tables with range paritions is not supported. For cloning range
partitions, it's tracked by IMPALA-11912.

Cloning Kudu tables from other types of tables is not implemented,
because the table creation statements are different.

Testing:
 - e2e tests
 - AnalyzeDDLTest tests

Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
---
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeStmt.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/util/KuduUtil.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M testdata/datasets/functional/functional_schema_template.sql
A 
testdata/workloads/functional-query/queries/QueryTest/kudu_create_table_like_table.test
M tests/custom_cluster/test_kudu.py
M tests/query_test/test_kudu.py
8 files changed, 232 insertions(+), 17 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/18729/34
--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 34
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 


[Impala-ASF-CR] IMPALA-11886: Data cache should support asynchronous writes

2023-02-15 Thread Anonymous Coward (Code Review)
18770832...@163.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19475 )

Change subject: IMPALA-11886: Data cache should support asynchronous writes
..


Patch Set 4:

(9 comments)

Thanks for the code review!

http://gerrit.cloudera.org:8080/#/c/19475/4//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19475/4//COMMIT_MSG@24
PS4, Line 24: Testing:
> What circumstances show better performance?

How can I show that? Maybe post a record of the comparative tests? I have no 
relevant experience, please give me some advice.


http://gerrit.cloudera.org:8080/#/c/19475/4/be/src/runtime/io/data-cache-test.cc
File be/src/runtime/io/data-cache-test.cc:

http://gerrit.cloudera.org:8080/#/c/19475/4/be/src/runtime/io/data-cache-test.cc@106
PS4, Line 106: while (cache.current_buffer_size_.Load() != 0) continue;
> Let's add a short sleep so that this is not spinning.
Done


http://gerrit.cloudera.org:8080/#/c/19475/4/be/src/runtime/io/data-cache.h
File be/src/runtime/io/data-cache.h:

http://gerrit.cloudera.org:8080/#/c/19475/4/be/src/runtime/io/data-cache.h@271
PS4, Line 271: uint8_t* buffer_;
> I think it would be cleaner for this to be a unique_ptr.

Did you mean using unique_ptr instead of uint8_t*? I tried but did 
not reduce the amount of code, did I misunderstand something?


http://gerrit.cloudera.org:8080/#/c/19475/4/be/src/runtime/io/data-cache.h@203
PS4, Line 203:
 :   /// The key used for look up in the cache.
 :   struct CacheKey {
 :public:
 : explicit CacheKey(const string& filename, int64_t mtime, 
int64_t offset)
 :   : key_(filename.size() + sizeof(mtime) + sizeof(offset)) {
 :   DCHECK_GE(mtime, 0);
 :   DCHECK_GE(offset, 0);
 :   key_.append(, sizeof(mtime));
 :   key_.append(, sizeof(offset));
 :   key_.append(filename);
 : }
 :
 : int64_t Hash() const {
 :   return HashUtil::FastHash64(key_.data(), key_.size(), 0);
 : }
 :
 : Slice filename() const {
 :   return Slice(key_.data() + OFFSETOF_FILENAME, key_.size() 
- OFFSETOF_FILENAME);
 : }
 :
 : int64_t mtime() const {
 :   return UNALIGNED_LOAD64(key_.data() + OFFSETOF_MTIME);
 : }
 :
 : int64_t offset() const {
 :   return UNALIGNED_LOAD64(key_.data() + OFFSETOF_OFFSET);
 : }
 :
 : Slice ToSlice() const {
 :   return key_;
 : }
 :
 :private:
 : // Key encoding stored in key_:
 : //
 : //  int64_t mtime;
 : //  int64_t offset;
 : //   filename;
 : static constexpr int OFFSETOF_MTIME = 0;
 : static constexpr int OFFSETOF_OFFSET = OFFSETOF_MTIME + 
sizeof(int64_t);
 : static constexpr int OFFSETOF_FILENAME = OFFSETOF_OFFSET + 
sizeof(int64_t);
 : kudu::faststring key_;
 :   };
 :
 :   /// The class to abstruct store behavior, including copying 
the buffer and holding it
 :   /// until store complete.
 :   class StoreTask {
 :public:
 : /// Creating a store task requires the filename, mtime, 
offset that constitutes the
 : /// cache key, and the buffer and length of the cached data 
is required too. We
 : /// allocate a new buffer in the constructor and copy the 
cache data and update
 : /// total_size which keeps track of the total buffer size 
allocate by all store tasks.
 : explicit StoreTask(const std::string& filename, int64_t 
mtime, int64_t offset,
 : const uint8_t* buffer, int64_t buffer_len, AtomicInt64& 
total_size);
 :
 : /// When the store task is destroyed, the allocated buffer 
is freed and total_size is
 : /// updated.
 : ~StoreTask();
 :
 : const CacheKey& key() const { return key_; }
 : const uint8_t* buffer() const { return buffer_; }
 : int64_t buffer_len() const { return buffer_len_; }
 :
 :private:
 : DISALLOW_COPY_AND_ASSIGN(StoreTask);
 :
 : CacheKey key_;
 : uint8_t* buffer_;
 : int64_t buffer_len_;
 : AtomicInt64& total_size_;
 :   };
> Small thing: It would be nice to keep these structures defined in
 > data-cache.cc. The way to do that is to have a forward 

[Impala-ASF-CR] IMPALA-11886: Data cache should support asynchronous writes

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19475 )

Change subject: IMPALA-11886: Data cache should support asynchronous writes
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12384/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19475
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I878f7486d485b6288de1a9145f49576b7155d312
Gerrit-Change-Number: 19475
Gerrit-PatchSet: 5
Gerrit-Owner: Anonymous Coward <18770832...@163.com>
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Thu, 16 Feb 2023 03:09:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11922 Verify JWKS URL server TLS certificate by default.

2023-02-15 Thread Alexey Serbin (Code Review)
Alexey Serbin has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19503 )

Change subject: IMPALA-11922 Verify JWKS URL server TLS certificate by default.
..


Patch Set 1:

(13 comments)

http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@13
PS1, Line 13: https
nit: HTTPS


http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@46
PS1, Line 46: pem
nit: PEM


http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@46
PS1, Line 46: certificates
Should those be CA certificates or non-CA certs (e.g., exact TLS server 
certificates without CA capability) are also accepted?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.h
File be/src/kudu/util/curl_util.h:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.h@74
PS1, Line 74: pem
nit: PEM


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.h@74
PS1, Line 74: certificates
Are these should be CA certificates or non-CA certs are also accepted?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.h@75
PS1, Line 75: https
nit: HTTPS


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.cc
File be/src/kudu/util/curl_util.cc:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/kudu/util/curl_util.cc@122
PS1, Line 122: CHECK_EQ
Does it make sense to switch to using CURL_RETURN_NOT_OK() here instead?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/rpc/authentication.cc
File be/src/rpc/authentication.cc:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/rpc/authentication.cc@171
PS1, Line 171: jwks_insecure_tls
nit: 'jwks_insecure_tls' sounds a bit vague to me: it might be 
authentication-only TLS channel, not verifying certs on either of the sides, 
using weak ciphers for the handshake, using weak ciphers to encrypt the data 
sent over the established channel, etc.

Maybe, something like 'jwks_verify_server_tls_cert' or similar would be more 
descriptive of what this flag is actually for?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util-internal.h
File be/src/util/jwt-util-internal.h:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util-internal.h@374
PS1, Line 374: to trust
nit: what exactly this 'trust' covers?  Is this just to verify authenticity of 
the JWKS server's TLS certificate or the certificates in the bundle are used 
for something else as well?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util.h
File be/src/util/jwt-util.h:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util.h@59
PS1, Line 59: const std::string& jwks_file_path
Does it make sense to make this a parameter of one of the constructors for this 
class and have just one Init() method with the signature

  Status Init();

?


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util.h@63
PS1, Line 63: Init
ditto


http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java
File fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java:

http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java@83
PS1, Line 83: the
for ?


http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java@440
PS1, Line 440: webserverTLSCert
Is this certificate also has the CA capability?  If not, I'm a bit surprised a 
non-CA certificate is accepted here.

Overall, is it possible to pass here not the server's certificate as is, but 
the CA certificate that the server's cert is signed with?  I guess that would 
be the expected use case in the wild, no?



--
To view, visit http://gerrit.cloudera.org:8080/19503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5f1e887fae39b5fb82fa9a40352e4b507b7d8d35
Gerrit-Change-Number: 19503
Gerrit-PatchSet: 1
Gerrit-Owner: Jason Fehr 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 16 Feb 2023 03:06:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11886: Data cache should support asynchronous writes

2023-02-15 Thread Anonymous Coward (Code Review)
Hello Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/19475

to look at the new patch set (#5).

Change subject: IMPALA-11886: Data cache should support asynchronous writes
..

IMPALA-11886: Data cache should support asynchronous writes

This patch implements asynchronous write to the data cache to improve
scan performance when cache miss happens.
Previously, writes to the data cache are synchronized with hdfs file
reads, and both are handled by remote hdfs IO threads. In other words,
if a cache miss occurs,  the IO thread needs to take additional
responsibility for cache writes,  which will lead to scan performance
deterioration.
This patch uses a thread pool for asynchronous writes, and the number of
threads in the pool is determined by the new configuration
'data_cache_num_write_threads'. In asynchronous write mode, the IO
thread only needs to copy data to the temporary buffer when storing data
into the data cache. The additional memory consumption caused by
temporary buffers can be limited, depending on the new configuration
'data_cache_write_buffer_limit'.

Testing:
- Add test cases for asynchronous data writing to the original
DataCacheTest using different number of threads.
- Add DataCacheTest,#OutOfWriteBufferLimit
Used to test the limit of memory consumed by temporary buffers in the
case of asynchronous writes

Change-Id: I878f7486d485b6288de1a9145f49576b7155d312
---
M be/src/runtime/io/data-cache-test.cc
M be/src/runtime/io/data-cache.cc
M be/src/runtime/io/data-cache.h
M be/src/util/impalad-metrics.cc
M be/src/util/impalad-metrics.h
M common/thrift/metrics.json
6 files changed, 366 insertions(+), 69 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/75/19475/5
--
To view, visit http://gerrit.cloudera.org:8080/19475
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I878f7486d485b6288de1a9145f49576b7155d312
Gerrit-Change-Number: 19475
Gerrit-PatchSet: 5
Gerrit-Owner: Anonymous Coward <18770832...@163.com>
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-11713: Switch to C++17

2023-02-15 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19183 )

Change subject: IMPALA-11713: Switch to C++17
..


Patch Set 5:

Rebased, will do a performance run


--
To view, visit http://gerrit.cloudera.org:8080/19183
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iadac41817fe5eaaa469a5f0e9f94056a409c14b9
Gerrit-Change-Number: 19183
Gerrit-PatchSet: 5
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 16 Feb 2023 01:29:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11922 Verify JWKS URL server TLS certificate by default.

2023-02-15 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19503 )

Change subject: IMPALA-11922 Verify JWKS URL server TLS certificate by default.
..


Patch Set 1:

(7 comments)

Thanks to work on this and added certificate verification to kudu::EasyCurl. 
curl_util.h/curl_util.cc were synced from Kudu repo, we need to port these 
changes to Kudu repo later.

http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@12
PS1, Line 12:
nit: one extra space


http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@24
PS1, Line 24:
nit: one extra space


http://gerrit.cloudera.org:8080/#/c/19503/1//COMMIT_MSG@48
PS1, Line 48:
add a Testing section


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/rpc/authentication.cc
File be/src/rpc/authentication.cc:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/rpc/authentication.cc@1418
PS1, Line 1418: }
Check jwks_ca_certificate is not empty if jwks_insecure_tls is set as false.


http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util.h
File be/src/util/jwt-util.h:

http://gerrit.cloudera.org:8080/#/c/19503/1/be/src/util/jwt-util.h@64
PS1, Line 64: bool is_local_file
do we still need this variable?


http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java
File fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java:

http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java@397
PS1, Line 397:
It's better to give a certificate which does not match the certificate returned 
from server.


http://gerrit.cloudera.org:8080/#/c/19503/1/fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java@420
PS1, Line 420:
nit: extra spaces



--
To view, visit http://gerrit.cloudera.org:8080/19503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5f1e887fae39b5fb82fa9a40352e4b507b7d8d35
Gerrit-Change-Number: 19503
Gerrit-PatchSet: 1
Gerrit-Owner: Jason Fehr 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 16 Feb 2023 00:05:09 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11920: [DOCS] Cleanup and update spill examples

2023-02-15 Thread Shajini Thayasingh (Code Review)
Shajini Thayasingh has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19504 )

Change subject: IMPALA-11920: [DOCS] Cleanup and update spill examples
..


Patch Set 1: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/19504
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Gerrit-Change-Number: 19504
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:38:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12383/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:39:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9052/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:34:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: [DOCS] Cleanup and update spill examples

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19504 )

Change subject: IMPALA-11920: [DOCS] Cleanup and update spill examples
..


Patch Set 1: Verified+1

Build Successful

https://jenkins.impala.io/job/gerrit-docs-auto-test/700/ : Doc tests passed.


-- 
To view, visit http://gerrit.cloudera.org:8080/19504
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Gerrit-Change-Number: 19504
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:30:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Michael Smith (Code Review)
Michael Smith has removed Shajini Thayasingh from this change.  ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Removed reviewer Shajini Thayasingh.
--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteReviewer
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Yida Wu 


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Michael Smith (Code Review)
Hello Yida Wu, Shajini Thayasingh, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/19496

to look at the new patch set (#5).

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..

IMPALA-11920: Support spill to HDFS address by service ID

Allows addressing HDFS (and Ozone) filesystems in `scratch_dirs` by a
service identifier that doesn't include a port number. Examples
- "hdfs://hdfs1/:10G" uses the root directory of HDFS with a 10G limit
- "ofs://ozone1/tmp::" uses /tmp in Ozone with default limit/priority

Updates `scratch_dirs` parsing to allow whitespace after each specifier,
as in "hfs://hdfs1/ , /tmp". This is unambiguous and avoids failures for
simple mistakes.

Testing:
- new backend test cases run with HDFS and Ozone
- manually tested that Impala starts with
  --impalad_args=--scratch_dirs=ofs://localhost/tmp,/tmp
  creates impala-scratch in both locations

Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
---
M be/src/runtime/tmp-file-mgr-test.cc
M be/src/runtime/tmp-file-mgr.cc
2 files changed, 57 insertions(+), 34 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/96/19496/5
--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 


[Impala-ASF-CR] IMPALA-11920: [DOCS] Cleanup and update spill examples

2023-02-15 Thread Michael Smith (Code Review)
Michael Smith has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19504


Change subject: IMPALA-11920: [DOCS] Cleanup and update spill examples
..

IMPALA-11920: [DOCS] Cleanup and update spill examples

Updates documentation to include examples with service identifier. Also
fixes inconsistent use of ASCII quotes for example text, highlighting
code and variable names, and normalizes descriptions between
S3/HDFS/Ozone. Removes "priority" from remote descriptions as it is
optional and does nothing.

Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
---
M docs/topics/impala_disk_space.xml
1 file changed, 96 insertions(+), 57 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/04/19504/1
--
To view, visit http://gerrit.cloudera.org:8080/19504
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Gerrit-Change-Number: 19504
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 4:

I think putting docs in the same patch as code changes is confusing the Jenkins 
auto-review jobs. Should I split them up?


--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:17:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: [DOCS] Cleanup and update spill examples

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19504 )

Change subject: IMPALA-11920: [DOCS] Cleanup and update spill examples
..


Patch Set 1:

Build Started https://jenkins.impala.io/job/gerrit-docs-auto-test/700/

Testing docs change - this change appears to modify docs/ and no code. This is 
experimental - please report any issues to tarmstr...@cloudera.com or on this 
JIRA: IMPALA-7317


--
To view, visit http://gerrit.cloudera.org:8080/19504
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I624a607bda33ab47100e1540ff1d66c8d19a7329
Gerrit-Change-Number: 19504
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:19:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11922 Verify JWKS URL server TLS certificate by default.

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19503 )

Change subject: IMPALA-11922 Verify JWKS URL server TLS certificate by default.
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12382/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5f1e887fae39b5fb82fa9a40352e4b507b7d8d35
Gerrit-Change-Number: 19503
Gerrit-PatchSet: 1
Gerrit-Owner: Jason Fehr 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:14:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 7:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9051/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 7
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 23:14:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11922 Verify JWKS URL server TLS certificate by default.

2023-02-15 Thread Jason Fehr (Code Review)
Jason Fehr has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19503


Change subject: IMPALA-11922 Verify JWKS URL server TLS certificate by default.
..

IMPALA-11922 Verify JWKS URL server TLS certificate by default.

 BREAKING CHANGE 

JWT Auth has an option to specify the location of the
JSON Web Key Set (JWKS) using a URL.  If that URL is
accessed over https, the TLS certificate presented by the
server is not verified.

This means that Impala only requires the server to return
a TLS certificate, whether or not Impala trusts the signing
certificate chain.

The implications of this setup is that a fully secure chain
of trust cannot be established throughout the entire JWT
authentication lifecycle and thus creates an attack vector
where a bad actor could trick Impala into trusting an
actor-controlled JWKS.  The bad actor can then generate
a JWT with any claims they chose and Impala will accept it.

This change introduces:
  1. verification of JWKS server TLS certificate by default
  2. jwks_insecure_tls Impala startup flag
  3. jwks_ca_certificate Impala startup flag

1. While previously, the JWKS URL was always called without
   verifying its TLS certificate, the default is to now to
   verify that cert.  Thus, any cases where the JWKS was
   retrieved from an untrusted URL will now cause Impala
   to fail to start.

2. The new flag jwks_insecure_tls controls whether or not
   Impala verifies the TLS certificate presented by the
   JWKS server.  It defaults to "false" meaning that the
   certificate will be verified.  Setting this value to
   "true" will restore the previous behavior where
   untrusted TLS certificates are accepted.

3. The new flag jwks_ca_certificate enables specifying
   a pem bundle of certificates to trust when calling to
   the JWKS URL.

Change-Id: I5f1e887fae39b5fb82fa9a40352e4b507b7d8d35
---
M be/src/kudu/util/curl_util.cc
M be/src/kudu/util/curl_util.h
M be/src/rpc/authentication.cc
M be/src/service/impala-server.cc
M be/src/util/jwt-util-internal.h
M be/src/util/jwt-util-test.cc
M be/src/util/jwt-util.cc
M be/src/util/jwt-util.h
M fe/src/test/java/org/apache/impala/customcluster/JwtHttpTest.java
9 files changed, 200 insertions(+), 56 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/03/19503/1
--
To view, visit http://gerrit.cloudera.org:8080/19503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I5f1e887fae39b5fb82fa9a40352e4b507b7d8d35
Gerrit-Change-Number: 19503
Gerrit-PatchSet: 1
Gerrit-Owner: Jason Fehr 


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2023-02-15 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 48:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/19033/48/fe/src/main/java/org/apache/impala/planner/Planner.java
File fe/src/main/java/org/apache/impala/planner/Planner.java:

http://gerrit.cloudera.org:8080/#/c/19033/48/fe/src/main/java/org/apache/impala/planner/Planner.java@470
PS48, Line 470: blockingAwareCost
nit: this result seems not used now. Add "TODO" comment



--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 48
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 15 Feb 2023 21:00:49 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12381/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 7
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 20:11:55 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Anonymous Coward (Code Review)
eubn...@gmail.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 7:

I didn't notice that "commons-io.version" has been missing since IMPALA-11670.
(https://github.com/apache/impala/commit/83c5e6e4098d8ed75de09a7e228d6ef10de2ee12)
I fixed this error by adding it.


--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 7
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 20:09:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12380/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 6
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 20:04:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Anonymous Coward (Code Review)
eubn...@gmail.com has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..

IMPALA-10408: Support build using Apache components

Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
---
M README-build.md
M bin/bootstrap_toolchain.py
M bin/impala-config.sh
M buildall.sh
M fe/pom.xml
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerImpaladAuthorizationManager.java
M java/TableFlattener/pom.xml
M java/pom.xml
9 files changed, 157 insertions(+), 47 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/77/18977/7
--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 7
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Anonymous Coward (Code Review)
eubn...@gmail.com has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..

IMPALA-10408: Support build using Apache components

Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
---
M README-build.md
M bin/bootstrap_toolchain.py
M bin/impala-config.sh
M buildall.sh
M fe/pom.xml
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerImpaladAuthorizationManager.java
M java/TableFlattener/pom.xml
M java/pom.xml
9 files changed, 157 insertions(+), 47 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/77/18977/6
--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 6
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-10408: Support build using Apache components

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18977 )

Change subject: IMPALA-10408: Support build using Apache components
..


Patch Set 6:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18977/6/bin/bootstrap_toolchain.py
File bin/bootstrap_toolchain.py:

http://gerrit.cloudera.org:8080/#/c/18977/6/bin/bootstrap_toolchain.py@614
PS6, Line 614: o
flake8: W504 line break after binary operator



--
To view, visit http://gerrit.cloudera.org:8080/18977
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8730dd182b367c9daa94303937ad249db72b1399
Gerrit-Change-Number: 18977
Gerrit-PatchSet: 6
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 19:45:13 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2023-02-15 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 48:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/19033/48//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19033/48//COMMIT_MSG@325
PS48, Line 325: IMPALA-2805
As the comments in 
https://github.com/apache/impala/blob/master/fe/src/main/java/org/apache/impala/analysis/Expr.java#L79-L81,
 it seems relative costs defined in IMPALA-2805 are not accurate. We may need 
to tune the numbers a little.


http://gerrit.cloudera.org:8080/#/c/19033/48//COMMIT_MSG@346
PS48, Line 346: Testing:
Could you attach the bench mark which show effective parallelism improvement?


http://gerrit.cloudera.org:8080/#/c/19033/43/fe/src/main/java/org/apache/impala/planner/ExchangeNode.java
File fe/src/main/java/org/apache/impala/planner/ExchangeNode.java:

http://gerrit.cloudera.org:8080/#/c/19033/43/fe/src/main/java/org/apache/impala/planner/ExchangeNode.java@263
PS43, Line 263: return deferredBatchQueueSize;
> I intended this to be a serialization/deserialization cost per row.
This assume the total cost for a row batch is 1. Is it right estimation?


http://gerrit.cloudera.org:8080/#/c/19033/48/fe/src/main/java/org/apache/impala/planner/ScanNode.java
File fe/src/main/java/org/apache/impala/planner/ScanNode.java:

http://gerrit.cloudera.org:8080/#/c/19033/48/fe/src/main/java/org/apache/impala/planner/ScanNode.java@359
PS48, Line 359: 1.0f / getRowBatchSize(queryOptions);
In ExchangeNode.estimateProcessingCostPerRow(), the cost per row is calculated 
as 1 / (getRowBatchSize(queryOptions) / avg-row-size). Show we do same?



--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 48
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 15 Feb 2023 19:39:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19499 )

Change subject: IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/19499
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53238612f4037572abb6d2cc913dd74ee830a9c9
Gerrit-Change-Number: 19499
Gerrit-PatchSet: 3
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Peter Rozsa 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Comment-Date: Wed, 15 Feb 2023 19:22:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 33:

(2 comments)

Thanks to fix the issue for non unique primary key. It looks good to me, only a 
few nits.

http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3934
PS33, Line 3934: assigned
nit: created


http://gerrit.cloudera.org:8080/#/c/18729/33/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3940
PS33, Line 3940: List primaryColumnNames = new ArrayList<>();
   : for (String primaryColumnName : 
kuduTable.getPrimaryKeyColumnNames()) {
   :   if 
(KuduUtil.getAutoIncrementingColumnName().equals(primaryColumnName)) continue;
   :   primaryColumnNames.add(primaryColumnName);
   : }
nit: a little optimization to avoid checking auto-incrementing column for table 
with unique key.
List primaryColumnNames = new 
ArrayList(kuduTable.getPrimaryKeyColumnNames());
if (kuduTable.hasAutoIncrementingColumn()) {
  primaryColumnNames.remove(KuduUtil.getAutoIncrementingColumnName());
}



--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 33
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Wed, 15 Feb 2023 18:20:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Michael Smith (Code Review)
Hello Yida Wu, Shajini Thayasingh, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/19496

to look at the new patch set (#4).

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..

IMPALA-11920: Support spill to HDFS address by service ID

Allows addressing HDFS (and Ozone) filesystems in `scratch_dirs` by a
service identifier that doesn't include a port number. Examples
- "hdfs://hdfs1/:10G" uses the root directory of HDFS with a 10G limit
- "ofs://ozone1/tmp::" uses /tmp in Ozone with default limit/priority

Updates `scratch_dirs` parsing to allow whitespace after each specifier,
as in "hfs://hdfs1/ , /tmp". This is unambiguous and avoids failures for
simple mistakes.

Updates documentation to include examples with service identifier. Also
fixes inconsistent use of ASCII quotes for example text, highlighting
code and variable names, and normalizes descriptions between
S3/HDFS/Ozone. Removes "priority" from remote descriptions as it is
optional and does nothing.

Testing:
- new backend test cases run with HDFS and Ozone
- manually tested that Impala starts with
  --impalad_args=--scratch_dirs=ofs://localhost/tmp,/tmp
  creates impala-scratch in both locations

Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
---
M be/src/runtime/tmp-file-mgr-test.cc
M be/src/runtime/tmp-file-mgr.cc
M docs/topics/impala_disk_space.xml
3 files changed, 153 insertions(+), 91 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/96/19496/4
--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 3:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/19496/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19496/3//COMMIT_MSG@17
PS3, Line 17: priority
> Just to confirm that the priority is removed in the document for the remote
Yes, it'll still be parsed in the code.


http://gerrit.cloudera.org:8080/#/c/19496/3/be/src/runtime/tmp-file-mgr-test.cc
File be/src/runtime/tmp-file-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/19496/3/be/src/runtime/tmp-file-mgr-test.cc@2240
PS3, Line 2240: hdfs
> Maybe it is not related, but when I test with a space behind the url like -
Ok. That happens because 
https://github.com/apache/impala/blob/feb4a76ed4cb5b688143eb21370f78ec93133c56/be/src/runtime/tmp-file-mgr.cc#L290
 only does trim left. trim_copy would be fine there.


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml
File docs/topics/impala_disk_space.xml:

http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@298
PS3, Line 298: /dir1::0
> The alignment looks weird compared to the previous version, does this part
It fixes the indentation in 
https://impala.apache.org/docs/build/asf-site-html/topics/impala_disk_space.html.
 All whitespace is included in a codeblock, and having the examples indented 
looks wrong to me.


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@486
PS3, Line 486: ozone1
> Just a question that what is the difference between Ozone service identifie
Ozone Manager might be addressed via a hostname. 
https://ci-hadoop.apache.org/view/Hadoop%20Ozone/job/ozone-doc-master/lastSuccessfulBuild/artifact/hadoop-hdds/docs/public/feature/om-ha.html
 discusses service IDs for an HA deployment.


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@488
PS3, Line 488:
> nit. looks good to have a comma here
Ack



--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 3
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 17:50:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11802: Optimize count(*) queries for Iceberg V2 position delete tables

2023-02-15 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19494 )

Change subject: IMPALA-11802: Optimize count(*) queries for Iceberg V2 position 
delete tables
..


Patch Set 2: Code-Review+1

(2 comments)

Awesome work, Penglin! Thank you so much?

http://gerrit.cloudera.org:8080/#/c/19494/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19494/2//COMMIT_MSG@31
PS2, Line 31: ArithmeticExpr(ADD)
Nice! This plan is much better than the one that I proposed in the description!


http://gerrit.cloudera.org:8080/#/c/19494/2//COMMIT_MSG@44
PS2, Line 44: Testing:
Could you please add a planner test for

 select count(*) from 
iceberg_v2_positional_not_all_data_files_have_delete_files;



--
To view, visit http://gerrit.cloudera.org:8080/19494
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8172c805121bf91d23fe063f806493afe2f03d41
Gerrit-Change-Number: 19494
Gerrit-PatchSet: 2
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Wed, 15 Feb 2023 16:21:05 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19499 )

Change subject: IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12379/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19499
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53238612f4037572abb6d2cc913dd74ee830a9c9
Gerrit-Change-Number: 19499
Gerrit-PatchSet: 2
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Peter Rozsa 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Comment-Date: Wed, 15 Feb 2023 14:38:55 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs

2023-02-15 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19499 )

Change subject: IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/19499/1/fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java
File fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java:

http://gerrit.cloudera.org:8080/#/c/19499/1/fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java@232
PS1, Line 232:   UdfExecutor createUdfExecutor(String jarFile, String 
udfClassPath, Type retType,
> originalArgs is missing from the Javadoc comment
Done



--
To view, visit http://gerrit.cloudera.org:8080/19499
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53238612f4037572abb6d2cc913dd74ee830a9c9
Gerrit-Change-Number: 19499
Gerrit-PatchSet: 2
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Peter Rozsa 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Comment-Date: Wed, 15 Feb 2023 14:21:40 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19499 )

Change subject: IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/9050/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/19499
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53238612f4037572abb6d2cc913dd74ee830a9c9
Gerrit-Change-Number: 19499
Gerrit-PatchSet: 3
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Peter Rozsa 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Comment-Date: Wed, 15 Feb 2023 14:22:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11920: Support spill to HDFS address by service ID

2023-02-15 Thread Yida Wu (Code Review)
Yida Wu has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19496 )

Change subject: IMPALA-11920: Support spill to HDFS address by service ID
..


Patch Set 3:

(6 comments)

Thanks for the fix, it looks good, just some questions to understand the change.

http://gerrit.cloudera.org:8080/#/c/19496/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19496/3//COMMIT_MSG@17
PS3, Line 17: priority
Just to confirm that the priority is removed in the document for the remote 
scratch directory, however, can still be parsed in the code.


http://gerrit.cloudera.org:8080/#/c/19496/3/be/src/runtime/tmp-file-mgr-test.cc
File be/src/runtime/tmp-file-mgr-test.cc:

http://gerrit.cloudera.org:8080/#/c/19496/3/be/src/runtime/tmp-file-mgr-test.cc@2240
PS3, Line 2240: hdfs
Maybe it is not related, but when I test with a space behind the url like 
--impalad_args='--scratch_dirs="hdfs://localhost:8020/ , /tmp"', it will try to 
create a path on " hdfs://localhost:8020/ /impala-scratch", could you please 
take a look on the rtrim of the path and add a testcase for it if it is an 
issue?


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml
File docs/topics/impala_disk_space.xml:

http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@298
PS3, Line 298: /dir1::0
The alignment looks weird compared to the previous version, does this part show 
normally?


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@303
PS3, Line 303: /dir1:200GB
 : /dir1:200GB:
Same as above


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@486
PS3, Line 486: ozone1
Just a question that what is the difference between Ozone service identifier 
and Ozone Manager? Is it no difference for the Impala to parse the Ozone 
scratch path?


http://gerrit.cloudera.org:8080/#/c/19496/3/docs/topics/impala_disk_space.xml@488
PS3, Line 488:
nit. looks good to have a comma here



--
To view, visit http://gerrit.cloudera.org:8080/19496
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie069cba211df85fe90d174900b20a26fcc1f7167
Gerrit-Change-Number: 19496
Gerrit-PatchSet: 3
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Shajini Thayasingh 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 15 Feb 2023 10:53:25 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 33:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/12378/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 33
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Wed, 15 Feb 2023 10:15:27 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Xiaoqing Gao (Code Review)
Xiaoqing Gao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18729 )

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..


Patch Set 32:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18729/32/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/18729/32/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3930
PS32, Line 3930: 
createTableParams.setPrimary_key_column_names(kuduTable.getPrimaryKeyColumnNames());
   : List columns = new ArrayList<>();
   : for (Column col : kuduTable.getColumns()) {
   :   columns.add(col.toThrift());
   : }
> IMPALA-11809 was merged recently, which added support for non unique primar
I fixed it and added some tests. Thanks for the reminder.



--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 32
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 
Gerrit-Comment-Date: Wed, 15 Feb 2023 09:59:45 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4052: CREATE TABLE LIKE for Kudu tables

2023-02-15 Thread Xiaoqing Gao (Code Review)
Hello Quanlong Huang, Kurt Deschler, Alexey Serbin, Wenzhe Zhou, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18729

to look at the new patch set (#33).

Change subject: IMPALA-4052: CREATE TABLE LIKE for Kudu tables
..

IMPALA-4052: CREATE TABLE LIKE for Kudu tables

This commit implements cloning between Kudu tables, including clone the
schema and hash partitions. But there is one limitation, cloning of
Kudu tables with range paritions is not supported. For cloning range
partitions, it's tracked by IMPALA-11912.

Cloning Kudu tables from other types of tables is not implemented,
because the table creation statements are different.

Testing:
 - e2e tests
 - AnalyzeDDLTest tests

Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
---
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeStmt.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/util/KuduUtil.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M testdata/datasets/functional/functional_schema_template.sql
A 
testdata/workloads/functional-query/queries/QueryTest/kudu_create_table_like_table.test
M tests/custom_cluster/test_kudu.py
M tests/query_test/test_kudu.py
8 files changed, 232 insertions(+), 17 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/18729/33
--
To view, visit http://gerrit.cloudera.org:8080/18729
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ia3d276a6465301dbcfed17bb713aca06367d9a42
Gerrit-Change-Number: 18729
Gerrit-PatchSet: 33
Gerrit-Owner: Xiaoqing Gao 
Gerrit-Reviewer: Alexey Serbin 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiaoqing Gao 


[Impala-ASF-CR] IMPALA-11803: Impalad hit DCHECK when running union on empty table with MT DOP>1

2023-02-15 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19474 )

Change subject: IMPALA-11803: Impalad hit DCHECK when running union on  empty 
table with MT_DOP>1
..


Patch Set 3:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/19474/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19474/2//COMMIT_MSG@7
PS2, Line 7: Impalad hit DCHECK when running union on
   :  empty table with MT_DOP>1
> I've updated the description, should I change the title of the jira as well
No, we just need to change the commit title. The JIRA title is ok to just 
describe the problem. The commit title would be nice to focus on how the issue 
is fixed, e.g. "Fix hitting DCHECK when running union on empty table with 
MT_DOP>1"

BTW, the title should be in one line.


http://gerrit.cloudera.org:8080/#/c/19474/2//COMMIT_MSG@11
PS2, Line 11: w
> Done
Not done yet in other lines..


http://gerrit.cloudera.org:8080/#/c/19474/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/19474/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@2095
PS3, Line 2095: useMtScanNode_ = queryOptions.mt_dop > 0;
Could you add a comment like this? So we won't make the same mistake when 
adding codes in the future.

 // update 'useMtScanNode_' before any return cases. It's used in BE.



--
To view, visit http://gerrit.cloudera.org:8080/19474
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idbae5e1a78211327a214b2d936743bda767ae3c4
Gerrit-Change-Number: 19474
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 15 Feb 2023 08:56:47 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs

2023-02-15 Thread Peter Rozsa (Code Review)
Peter Rozsa has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19499 )

Change subject: IMPALA-11911: Fix NULL argument handling in Hive GenericUDFs
..


Patch Set 1: Code-Review+1

(1 comment)

Nice!

http://gerrit.cloudera.org:8080/#/c/19499/1/fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java
File fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java:

http://gerrit.cloudera.org:8080/#/c/19499/1/fe/src/test/java/org/apache/impala/hive/executor/UdfExecutorTest.java@232
PS1, Line 232:   Object originalArgs[], Object args[]) throws 
ImpalaException, TException {
originalArgs is missing from the Javadoc comment



--
To view, visit http://gerrit.cloudera.org:8080/19499
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53238612f4037572abb6d2cc913dd74ee830a9c9
Gerrit-Change-Number: 19499
Gerrit-PatchSet: 1
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Peter Rozsa 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Comment-Date: Wed, 15 Feb 2023 08:27:17 +
Gerrit-HasComments: Yes