[Impala-ASF-CR] IMPALA-6625: Skip computing parquet conjuncts for non-Parquet scans

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10704 )

Change subject: IMPALA-6625: Skip computing parquet conjuncts for non-Parquet 
scans
..


Patch Set 14: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/2772/


--
To view, visit http://gerrit.cloudera.org:8080/10704
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9d6c26d42db090c8a15c602f6419ad6399c329e7
Gerrit-Change-Number: 10704
Gerrit-PatchSet: 14
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 04 Jul 2018 04:46:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7240: Fix missing QueryMaintenance call in AddBatchStreaming

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10863 )

Change subject: IMPALA-7240: Fix missing QueryMaintenance call in 
AddBatchStreaming
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10863/1/be/src/exec/grouping-aggregator.cc
File be/src/exec/grouping-aggregator.cc:

http://gerrit.cloudera.org:8080/#/c/10863/1/be/src/exec/grouping-aggregator.cc@407
PS1, Line 407:   SCOPED_TIMER(build_timer_);
Do we have a similar bug here? I'm not sure how expr_results_pool_ gets cleared 
in the aggregator when driven from Open(). Same for the non-grouping aggregator.

It would be nice to avoid the need for these two sets of QueryMaintenance() 
calls to clear expr_results_pool_ at both levels. We could consider using 
ExecNode::expr_results_pool_, which reduces the granularity of some of the 
memory tracking but I think should be ok - the "result" allocations are 
expected to be small unless an expr misbehaves.

Otherwise we could have the parent ExecNode() clear the results pools on the 
aggregators at the same points as it's clearing its own pools to avoid this 
duplication.



--
To view, visit http://gerrit.cloudera.org:8080/10863
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I150a46d00acad139d186a150d9706ef47a10ac36
Gerrit-Change-Number: 10863
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 04 Jul 2018 02:37:36 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7240: Fix missing QueryMaintenance call in AddBatchStreaming

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10863 )

Change subject: IMPALA-7240: Fix missing QueryMaintenance call in 
AddBatchStreaming
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10863/1/be/src/exec/grouping-aggregator.cc
File be/src/exec/grouping-aggregator.cc:

http://gerrit.cloudera.org:8080/#/c/10863/1/be/src/exec/grouping-aggregator.cc@423
PS1, Line 423: RETURN_IF_ERROR(QueryMaintenance(state));
expr_results_pool_ is used in AddBatchStreamingImpl(), right ?



--
To view, visit http://gerrit.cloudera.org:8080/10863
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I150a46d00acad139d186a150d9706ef47a10ac36
Gerrit-Change-Number: 10863
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 04 Jul 2018 02:01:37 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7240: Fix missing QueryMaintenance call in AddBatchStreaming

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10863 )

Change subject: IMPALA-7240: Fix missing QueryMaintenance call in 
AddBatchStreaming
..


Patch Set 1: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/10863
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I150a46d00acad139d186a150d9706ef47a10ac36
Gerrit-Change-Number: 10863
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 04 Jul 2018 02:01:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7163: Implement a state machine for the QueryState class

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10813 )

Change subject: IMPALA-7163: Implement a state machine for the QueryState class
..


Patch Set 4:

(15 comments)

http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-exec-mgr.cc
File be/src/runtime/query-exec-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-exec-mgr.cc@128
PS4, Line 128:   qs->MonitorFInstances();
This seems to fit into IMPALA-4063 better. As the code stands in this patch, we 
are unnecessarily blocking this thread from exiting after starting all fragment 
instances. It makes sense in IMPALA-4063 as this is the thread responsible for 
aggregating and sending the profiles for all fragment instances.


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h
File be/src/runtime/query-state.h:

http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h@159
PS4, Line 159: IMPALA-2990
IMPALA-4063


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h@278
PS4, Line 278: CANCELLED,
No implication on whether all fragment instances have finished cancelling 
themselves, right ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h@292
PS4, Line 292: IMPALA-2990
Did you mean IMPALA-4063 ? Does it mean this comment needs to be removed. May 
be convert the stuff which needs to be added in IMPALA-4063 as a TODO instead


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h@350
PS4, Line 350: boost::scoped_ptr
std::unique_ptr


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.h@355
PS4, Line 355: boost::scoped_ptr
std::unique_ptr


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc
File be/src/runtime/query-state.cc:

http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@a350
PS4, Line 350:
 :
Was this a bug ?! Will we hang in the for loop at line 363 below if we fail to 
create a thread ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@141
PS4, Line 141: instances_prepared_barrier_
As discussed offline, the behavior during "preparation" phase of the query is 
that we will wait for fragment instances to finish preparing even if some of 
them run into errors. So, in that sense, the existing code seems to suffice 
unless I misunderstood something.


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@225
PS4, Line 225: DCHECK(true)
DCHECK(false) ?! Does this warrant a targeted BE test ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@230
PS4, Line 230: QueryState::HandleExecStateTransition
This function seems like a no-op in this patch. Can this be added in the next 
patch when necessary ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@278
PS4, Line 278:   if (!status.ok() && (!status.IsCancelled() || 
!IsTerminalState(old_state))) {
 : VLOG_QUERY << Substitute("BackendExecState: query id=$0 
finstance=$1  ($2 -> $3) "
 :  "status=$4",
 : PrintId(query_id()),
 : failed_instance != TUniqueId() ? 
PrintId(failed_instance) : "N/A",
 : BackendExecStateToString(old_state), 
BackendExecStateToString(new_state),
 : status.GetDetail());
Will this actually be duplicating the error message which we usually log 
already when there is an error ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@391
PS4, Line 391: return WaitForPrepareInternal().status_;
As discussed offline, there is no need to have a different implementation for 
waiting for all fis to finish preparing unless we can simplify it further 
somehow.


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@434
PS4, Line 434: IMPALA-2990
IMPALA-4063 ?


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@502
PS4, Line 502: QueryState::MonitorFInstances()
Please see comments in query-exec-mgr.cc


http://gerrit.cloudera.org:8080/#/c/10813/4/be/src/runtime/query-state.cc@542
PS4, Line 542:   // initiate cancellation if nobody has done so yet
 :   if (!status.ok()) Cancel();
Not sure what you think about the following. It appears to me that we may not 
necessarily need to UpdateBackendState() above to track the explicit state 
switching. Instead we can do something like below in this function and some add 
on to Cancel():

 if (!status.ok()) {
 Cancel();
 instances_finished_barrier_->NotifyRemaining(status);
 } else {
 instances_finished_barrier_->Notify();
 }

So, in IMPALA-4063, the query state thread will wait on this 
instances_finished_barrier_ after all fragment instances are done preparing. 
Once the wait returns, it may consider sending the profile right away 

[Impala-ASF-CR] IMPALA-3040: Remove cache directives if a partition is dropped externally

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directives if a partition is dropped 
externally
..

IMPALA-3040: Remove cache directives if a partition is dropped externally

HdfsTable.dropPartition() doesn't uncache the partition right now. If
the partition is dropped from Hive and refreshed in Impala, the
partition will be removed from the catalog but the cache directive
remains. Because Impala directly uses HMS client to drop a
table/database, the cache directive won't be removed even if the table
is dropped in Impala, if the backgroud loading is run concurrenty with
the HMS client RPC call. This patch removes the cache directive in
dropPartition() to fix this bug.

Change-Id: Id7701a499405e961456adea63f3592b43bd69170
---
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M tests/query_test/test_hdfs_caching.py
3 files changed, 28 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/92/10792/4
--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 4
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 


[Impala-ASF-CR] IMPALA-3040: Remove cache directives if a partition is dropped externally

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directives if a partition is dropped 
externally
..


Patch Set 4:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@7
PS3, Line 7: if a partition is dropped externally
> may be say external table drops?
Done


http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@9
PS3, Line 9:
> Add some context about what happens when it is dropped from Hive.
Done


http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@13
PS3, Line 13: table/database, the cache directive won't be removed even if the 
table
> Could you add a test for this in test_hdfs_caching?
Done


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
File fe/src/main/java/org/apache/impala/catalog/HdfsTable.java:

http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1158
PS3, Line 1158:* HdfsPartition that was dropped or null if the partition 
does not exist.
> Update that this drops the cache directive if its cached.
Done


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1191
PS3, Line 1191:   // If there are multiple partition ids corresponding to a 
literal, remove
> I think this should only run on the Catalog server?
I checked the callers of this function and it can only be called from catalogd.


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1193
PS3, Line 1193:   if (partitionIds.size() > 1) 
partitionIds.remove(partitionId);
> Do we need to remove a similar check from CatalogOpEx#alterTableDropPartiti
Done



--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 4
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 04 Jul 2018 00:54:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7240: Fix missing QueryMaintenance call in AddBatchStreaming

2018-07-03 Thread Thomas Marshall (Code Review)
Thomas Marshall has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10863


Change subject: IMPALA-7240: Fix missing QueryMaintenance call in 
AddBatchStreaming
..

IMPALA-7240: Fix missing QueryMaintenance call in AddBatchStreaming

A recent change, IMPALA-110 (part 2), refactored the aggregation code.
During this refactor, a call to QueryMaintenance() was inadvertantely
dropped in the AddBatchStreaming() path, causing
test_spilling_regression_exhaustive to fail by hitting the memory
limit.

Testing:
- Ran test_spilling_regression_exhaustive

Change-Id: I150a46d00acad139d186a150d9706ef47a10ac36
---
M be/src/exec/grouping-aggregator.cc
1 file changed, 1 insertion(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/63/10863/1
--
To view, visit http://gerrit.cloudera.org:8080/10863
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I150a46d00acad139d186a150d9706ef47a10ac36
Gerrit-Change-Number: 10863
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 2
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Taras Bobrovytsky 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:49:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..

IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES

This patch adds a missing "break" statement in a switch statement
changed by IMPALA-7102.
Also fixes an non-deterministic test case.

Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Reviewed-on: http://gerrit.cloudera.org:8080/10857
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/service/query-options.cc
M testdata/workloads/functional-query/queries/QueryTest/hdfs-erasure-coding.test
2 files changed, 2 insertions(+), 1 deletion(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 3
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Taras Bobrovytsky 


[Impala-ASF-CR] IMPALA-6034: Add CPU and scanned bytes limits per query

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10415 )

Change subject: IMPALA-6034: Add CPU and scanned bytes limits per query
..


Patch Set 6:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h
File be/src/runtime/coordinator-backend-state.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h@198
PS8, Line 198:
> For this fragment instance
Done


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h@207
PS8, Line 207: /// Collection of BYTES_READ_COUNTERs of all scan nodes in 
this fragment instance.
> peak_mem_counter_ was removed...
Done


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.cc
File be/src/runtime/coordinator-backend-state.cc:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.cc@294
PS8, Line 294:   // TODO: We're losing this profile information. Call 
ReportQuerySummary only after
> I'm considering adjusting this to use an O(1) algorithm - Update() could re
Done


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator.h
File be/src/runtime/coordinator.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator.h@177
PS8, Line 177:
> This isn't accurate, could be fragment instances, backend or query.
Done


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.h
File be/src/service/query-options.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.h@135
PS8, Line 135:TQueryOptionLevel::ADVANCED)\
> Need to fix tabs
Done


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.cc
File be/src/service/query-options.cc:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.cc@682
PS8, Line 682: // Parse the scan bytes limit and validate it.
> Useless comment.
Done



--
To view, visit http://gerrit.cloudera.org:8080/10415
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
Gerrit-Change-Number: 10415
Gerrit-PatchSet: 6
Gerrit-Owner: Mostafa Mokhtar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:40:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7238: Use custom timeout for create unique database

2018-07-03 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10862 )

Change subject: IMPALA-7238: Use custom timeout for create unique database
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10862/1/tests/conftest.py
File tests/conftest.py:

http://gerrit.cloudera.org:8080/#/c/10862/1/tests/conftest.py@391
PS1, Line 391:   with __auto_closed_conn(timeout=timeout) as conn:
Another way to fix it is to use a higher timeout universally.



--
To view, visit http://gerrit.cloudera.org:8080/10862
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4f2beb5bc027a4bb44e854bf1dd8919807a92ea0
Gerrit-Change-Number: 10862
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:30:53 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7238: Use custom timeout for create unique database

2018-07-03 Thread Joe McDonnell (Code Review)
Joe McDonnell has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10862


Change subject: IMPALA-7238: Use custom timeout for create unique database
..

IMPALA-7238: Use custom timeout for create unique database

test_kudu.TestCreateExternalTables() saw a timeout when
creating the unique database for its tests.

__unique_conn() opens a connection, creates a unique database,
then returns another connection in that database. It takes
a custom timeout argument, but the timeout is only for the
returned connection. The first connection to create the
unique database uses the default timeout of 45 seconds.

This patch changes the first connection to use the custom
timeout. For Kudu tests, this is 5 minutes rather than 45
seconds.

Change-Id: I4f2beb5bc027a4bb44e854bf1dd8919807a92ea0
---
M tests/conftest.py
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/62/10862/1
--
To view, visit http://gerrit.cloudera.org:8080/10862
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I4f2beb5bc027a4bb44e854bf1dd8919807a92ea0
Gerrit-Change-Number: 10862
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 


[Impala-ASF-CR] IMPALA-6034: Add CPU and scanned bytes limits per query

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded a new patch set (#9) to the change originally 
created by Mostafa Mokhtar. ( http://gerrit.cloudera.org:8080/10415 )

Change subject: IMPALA-6034: Add CPU and scanned bytes limits per query
..

IMPALA-6034: Add CPU and scanned bytes limits per query

To protect Impala from runaway queries add per query limits for CPU and
scanned bytes, limits are enforced via query options and applied to the entire
query, not per backend like mem_limit.
If a query exceeds any of the limits it will get cancelled.

Query profile is updated to include query wide and per backend metrics for Cpu
and scanned bytes.

Testing:
Added tests for various permutations for CPU_LIMIT_S and
SCAN_BYTES_LIMIT

Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
---
M be/src/runtime/coordinator-backend-state.cc
M be/src/runtime/coordinator-backend-state.h
M be/src/runtime/coordinator.cc
M be/src/runtime/coordinator.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M be/src/service/query-options-test.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
A 
testdata/workloads/functional-query/queries/QueryTest/query-resource-limits.test
M tests/query_test/test_cancellation.py
M tests/query_test/test_resource_limits.py
14 files changed, 351 insertions(+), 43 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/15/10415/9
--
To view, visit http://gerrit.cloudera.org:8080/10415
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
Gerrit-Change-Number: 10415
Gerrit-PatchSet: 9
Gerrit-Owner: Mostafa Mokhtar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-3956: [DOCS] Escape variables with '\' in impala-shell

2018-07-03 Thread Alex Rodoni (Code Review)
Alex Rodoni has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10861


Change subject: IMPALA-3956: [DOCS]  Escape variables with '\' in impala-shell
..

IMPALA-3956: [DOCS]  Escape variables with '\' in impala-shell

Change-Id: Ifb95785a143939a94d55d3565364afe1e26c1f3d
---
M docs/topics/impala_shell_commands.xml
M docs/topics/impala_shell_running_commands.xml
2 files changed, 125 insertions(+), 299 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/61/10861/1
--
To view, visit http://gerrit.cloudera.org:8080/10861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ifb95785a143939a94d55d3565364afe1e26c1f3d
Gerrit-Change-Number: 10861
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 6: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 6
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:21:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..

IMPALA-6883: [DOCS] Refactor impala_authorization doc

Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Reviewed-on: http://gerrit.cloudera.org:8080/10786
Reviewed-by: Alex Rodoni 
Tested-by: Impala Public Jenkins 
---
M docs/shared/impala_common.xml
M docs/topics/impala_authorization.xml
M docs/topics/impala_grant.xml
3 files changed, 543 insertions(+), 701 deletions(-)

Approvals:
  Alex Rodoni: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 7
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 6:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/339/


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 6
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:12:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 6: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 6
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 23:11:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Alex Rodoni (Code Review)
Hello Fredy Wijaya, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10786

to look at the new patch set (#6).

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..

IMPALA-6883: [DOCS] Refactor impala_authorization doc

Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
---
M docs/shared/impala_common.xml
M docs/topics/impala_authorization.xml
M docs/topics/impala_grant.xml
3 files changed, 543 insertions(+), 701 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/86/10786/6
--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 6
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-3040: Remove cache directives during background partition dropping

2018-07-03 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directives during background 
partition dropping
..


Patch Set 3:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@7
PS3, Line 7: during background partition dropping
may be say external table drops?


http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@9
PS3, Line 9:
Add some context about what happens when it is dropped from Hive.


http://gerrit.cloudera.org:8080/#/c/10792/3//COMMIT_MSG@13
PS3, Line 13:
Could you add a test for this in test_hdfs_caching?


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
File fe/src/main/java/org/apache/impala/catalog/HdfsTable.java:

http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1158
PS3, Line 1158:* HdfsPartition that was dropped or null if the partition 
does not exist.
Update that this drops the cache directive if its cached.


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1191
PS3, Line 1191: if (partition.isMarkedCached()) {
I think this should only run on the Catalog server?


http://gerrit.cloudera.org:8080/#/c/10792/3/fe/src/main/java/org/apache/impala/catalog/HdfsTable.java@1193
PS3, Line 1193: 
HdfsCachingUtil.removePartitionCacheDirective(partition);
Do we need to remove a similar check from 
CatalogOpEx#alterTableDropPartition()? No point in doing it twice.



--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 3
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 03 Jul 2018 22:39:48 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7213: Port ReportExecStatus() RPC to use KRPC

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10855 )

Change subject: IMPALA-7213: Port ReportExecStatus() RPC to use KRPC
..


Patch Set 2:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/10855/2/be/src/runtime/query-state.cc
File be/src/runtime/query-state.cc:

http://gerrit.cloudera.org:8080/#/c/10855/2/be/src/runtime/query-state.cc@263
PS2, Line 263: serialize_status.ok()
Need a test case for serialization failure. May be global debug action is 
useful ?


http://gerrit.cloudera.org:8080/#/c/10855/2/be/src/runtime/query-state.cc@303
PS2, Line 303: !RpcMgr::IsServerTooBusy(rpc_controller))
Need a test case for this.


http://gerrit.cloudera.org:8080/#/c/10855/2/be/src/service/control-service.cc
File be/src/service/control-service.cc:

http://gerrit.cloudera.org:8080/#/c/10855/2/be/src/service/control-service.cc@107
PS2, Line 107: (UNLIKELY(!deserialize_status.ok())
Need a test case for deserialization error.



--
To view, visit http://gerrit.cloudera.org:8080/10855
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7638583b433dcac066b87198e448743d90415ebe
Gerrit-Change-Number: 10855
Gerrit-PatchSet: 2
Gerrit-Owner: Michael Ho 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Comment-Date: Tue, 03 Jul 2018 22:30:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3040: Remove cache directives during background partition dropping

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directives during background 
partition dropping
..

IMPALA-3040: Remove cache directives during background partition dropping

HdfsTable.dropPartition() doesn't uncache the partition right now. If
the table is later dropped, the partition won't be uncached either
because it has been removed then. This patch removes the cache directive
in dropPartition() to fix this bug.

Change-Id: Id7701a499405e961456adea63f3592b43bd69170
---
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
1 file changed, 8 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/92/10792/3
--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 3
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 


[Impala-ASF-CR] IMPALA-7202: Add width bucket() to the decimal fuzz test

2018-07-03 Thread Taras Bobrovytsky (Code Review)
Taras Bobrovytsky has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10859


Change subject: IMPALA-7202: Add width_bucket() to the decimal fuzz test
..

IMPALA-7202: Add width_bucket() to the decimal fuzz test

In this patch we include the newly added width_bucket() function in the
decimal fuzz test.

Change-Id: I1f8a63733ebddc07f46c525ca51ad4794dd0
---
M tests/query_test/test_decimal_fuzz.py
1 file changed, 57 insertions(+), 3 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/59/10859/1
--
To view, visit http://gerrit.cloudera.org:8080/10859
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1f8a63733ebddc07f46c525ca51ad4794dd0
Gerrit-Change-Number: 10859
Gerrit-PatchSet: 1
Gerrit-Owner: Taras Bobrovytsky 


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/338/


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 21:14:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6086: Use of permanent function should require SELECT privilege on DB

2018-07-03 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10850 )

Change subject: IMPALA-6086: Use of permanent function should require SELECT 
privilege on DB
..


Patch Set 3: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/10850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
Gerrit-Change-Number: 10850
Gerrit-PatchSet: 3
Gerrit-Owner: Zoram Thanga 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Zoram Thanga 
Gerrit-Comment-Date: Tue, 03 Jul 2018 21:04:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6086: Use of permanent function should require SELECT privilege on DB

2018-07-03 Thread Zoram Thanga (Code Review)
Zoram Thanga has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10850 )

Change subject: IMPALA-6086: Use of permanent function should require SELECT 
privilege on DB
..


Patch Set 2:

(6 comments)

Thanks. Please see PS#3.

http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java
File fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java:

http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2084
PS2, Line 2084:
> nit: add two extra spaces for continued indentation
Done


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2087
PS2, Line 2087:   "select functional.to_lower('ABCDEF')")
> nit: move L2088 to L2087
Done


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2091
PS2, Line 2091: .ok(onDatabase("functional", TPrivilegeLevel.ALL))
> nit: move .ok(onDatabase("functional", TPrivilegeLevel.ALL)) to be at the b
Done


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2092
PS2, Line 2092: .ok(onDatabase("functional", TPrivilegeLevel.INSERT))
  : .ok(onDatabase("functional", 
TPrivilegeLevel.REFRESH))
> INSERT, REFRESH, and SELECT can be simplified to .ok(onDatabase("functional
Done. Want me to wrap ALL under viewMetadataPrivileges() as well?


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2096
PS2, Line 2096: TPrivilegeLevel.SELECT, TPrivilegeLevel.ALL,
  : TPrivilegeLevel.INSERT, 
TPrivilegeLevel.REFRESH
> can be simplified to allExcept(viewMetadataPrivileges())
Done


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2167
PS2, Line 2167: uriPath, symbolName, null, null,
> nit: move L2168 to L2167
Done



--
To view, visit http://gerrit.cloudera.org:8080/10850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
Gerrit-Change-Number: 10850
Gerrit-PatchSet: 2
Gerrit-Owner: Zoram Thanga 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Zoram Thanga 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:40:38 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2773/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 2
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Taras Bobrovytsky 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:38:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 2
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Taras Bobrovytsky 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:38:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7190: Remove unsupported format writer support

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10823 )

Change subject: IMPALA-7190: Remove unsupported format writer support
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I821dc7495a901f1658daa500daf3791b386c7185
Gerrit-Change-Number: 10823
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:34:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7190: Remove unsupported format writer support

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10823 )

Change subject: IMPALA-7190: Remove unsupported format writer support
..

IMPALA-7190: Remove unsupported format writer support

This patch removes write support for unsupported formats like Sequence,
Avro and compressed text. Also, the related query options
ALLOW_UNSUPPORTED_FORMATS and SEQ_COMPRESSION_MODE have been migrated
to the REMOVED query options type.

Testing:
Ran exhaustive build.

Change-Id: I821dc7495a901f1658daa500daf3791b386c7185
Reviewed-on: http://gerrit.cloudera.org:8080/10823
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/CMakeLists.txt
D be/src/exec/hdfs-avro-table-writer.cc
D be/src/exec/hdfs-avro-table-writer.h
D be/src/exec/hdfs-sequence-table-writer.cc
D be/src/exec/hdfs-sequence-table-writer.h
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-text-table-writer.cc
M be/src/exec/hdfs-text-table-writer.h
M be/src/service/query-options-test.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
M fe/src/test/java/org/apache/impala/planner/PlannerTestBase.java
M testdata/bad_avro_snap/README
D testdata/workloads/functional-query/queries/QueryTest/avro-writer.test
D testdata/workloads/functional-query/queries/QueryTest/seq-writer.test
M testdata/workloads/functional-query/queries/QueryTest/set.test
D testdata/workloads/functional-query/queries/QueryTest/text-writer.test
A testdata/workloads/functional-query/queries/QueryTest/unsupported-writers.test
M tests/common/test_dimensions.py
M tests/hs2/test_hs2.py
M tests/metadata/test_partition_metadata.py
M tests/query_test/test_compressed_formats.py
M tests/shell/test_shell_interactive.py
25 files changed, 121 insertions(+), 1,607 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/10823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I821dc7495a901f1658daa500daf3791b386c7185
Gerrit-Change-Number: 10823
Gerrit-PatchSet: 5
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6086: Use of permanent function should require SELECT privilege on DB

2018-07-03 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10850 )

Change subject: IMPALA-6086: Use of permanent function should require SELECT 
privilege on DB
..


Patch Set 2:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java
File fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java:

http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2084
PS2, Line 2084:
nit: add two extra spaces for continued indentation


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2087
PS2, Line 2087:   "select functional.to_lower('ABCDEF')")
nit: move L2088 to L2087


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2091
PS2, Line 2091: .ok(onDatabase("functional", TPrivilegeLevel.ALL))
nit: move .ok(onDatabase("functional", TPrivilegeLevel.ALL)) to be at the 
beginning (L2089) to be consistent with other code


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2092
PS2, Line 2092: .ok(onDatabase("functional", TPrivilegeLevel.INSERT))
  : .ok(onDatabase("functional", 
TPrivilegeLevel.REFRESH))
INSERT, REFRESH, and SELECT can be simplified to .ok(onDatabase("functional", 
viewMetadataPrivileges())


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2096
PS2, Line 2096: TPrivilegeLevel.SELECT, TPrivilegeLevel.ALL,
  : TPrivilegeLevel.INSERT, 
TPrivilegeLevel.REFRESH
can be simplified to allExcept(viewMetadataPrivileges())


http://gerrit.cloudera.org:8080/#/c/10850/2/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2167
PS2, Line 2167: uriPath, symbolName, null, null,
nit: move L2168 to L2167



--
To view, visit http://gerrit.cloudera.org:8080/10850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
Gerrit-Change-Number: 10850
Gerrit-PatchSet: 2
Gerrit-Owner: Zoram Thanga 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Zoram Thanga 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:15:21 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6086: Use of permanent function should require SELECT privilege on DB

2018-07-03 Thread Zoram Thanga (Code Review)
Hello Fredy Wijaya, Vuk Ercegovac,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10850

to look at the new patch set (#2).

Change subject: IMPALA-6086: Use of permanent function should require SELECT 
privilege on DB
..

IMPALA-6086: Use of permanent function should require SELECT privilege
on DB

To use a permanent UDF should require at leat SELECT privilege on the
database. Functions that have constant arguments get constant-folded
into string literals, losing their privilege requests in the process.

This patch saves the privilege requests found during the first phase
of query analysis, where all the objects and the privileges required
to access them are identified. The requests are added back to the
new analyzer created for re-analysis post expression rewrite.

New FE test cases have been added to AuthorizationStmtTest.

Manual tests were also done to identify the bug, as well as to test
the fix.

Ran private parameterized Jenkins job, exhaustive exploration and
covering all tests.

Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
---
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java
2 files changed, 56 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/50/10850/2
--
To view, visit http://gerrit.cloudera.org:8080/10850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
Gerrit-Change-Number: 10850
Gerrit-PatchSet: 2
Gerrit-Owner: Zoram Thanga 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Zoram Thanga 


[Impala-ASF-CR] IMPALA-6086: Use of permanent function should require SELECT privilege on DB

2018-07-03 Thread Zoram Thanga (Code Review)
Zoram Thanga has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10850 )

Change subject: IMPALA-6086: Use of permanent function should require SELECT 
privilege on DB
..


Patch Set 1:

(1 comment)

Thanks for the review. Please have a look at PS#2

http://gerrit.cloudera.org:8080/#/c/10850/1/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java
File fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java:

http://gerrit.cloudera.org:8080/#/c/10850/1/fe/src/test/java/org/apache/impala/analysis/AuthorizationStmtTest.java@2071
PS1, Line 2071: authorize("create function to_lower(string) returns string 
location " +
> As mentioned in the previous review, this code doesn't create a function in
Thanks for the education! I've made the suggested changes.



--
To view, visit http://gerrit.cloudera.org:8080/10850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iee70f15e4c04f7daaed9cac2400ec626e1fb0e57
Gerrit-Change-Number: 10850
Gerrit-PatchSet: 1
Gerrit-Owner: Zoram Thanga 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Reviewer: Zoram Thanga 
Gerrit-Comment-Date: Tue, 03 Jul 2018 20:02:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7027: Mulitple varchar cast fails with distinct

2018-07-03 Thread Adam Holley (Code Review)
Adam Holley has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10858


Change subject: IMPALA-7027: Mulitple varchar cast fails with distinct
..

IMPALA-7027: Mulitple varchar cast fails with distinct

When AggregateInfo removes duplicates of groupingExprs during the
second pass of rewrites, the CastExprs becomes StringLiterals.
Because the localEquals checks only comparase value, which in this case
is "" for both, the second cast expr is removed.  This causes the
second castTo() to fail.  This fix adds a check to StringLiteral.localEquals
to compare the type so that different items will not be removed.

Testing:
- Added test to validate distinct with casts
- Ran all FE tests
- Ran all E2E tests

Change-Id: I2fa5890eaa89787645c7d3d2eef976f54a34e7c0
---
M fe/src/main/java/org/apache/impala/analysis/StringLiteral.java
M fe/src/test/java/org/apache/impala/analysis/ExprRewriterTest.java
2 files changed, 12 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/58/10858/1
--
To view, visit http://gerrit.cloudera.org:8080/10858
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I2fa5890eaa89787645c7d3d2eef976f54a34e7c0
Gerrit-Change-Number: 10858
Gerrit-PatchSet: 1
Gerrit-Owner: Adam Holley 


[Impala-ASF-CR] IMPALA-3956 - Impala shell should ignore variables in comments

2018-07-03 Thread Adam Holley (Code Review)
Adam Holley has abandoned this change. ( http://gerrit.cloudera.org:8080/10784 )

Change subject: IMPALA-3956 - Impala shell should ignore variables in comments
..


Abandoned

Changed to doc fix.
--
To view, visit http://gerrit.cloudera.org:8080/10784
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: I9e442e169d512daf80c86606528300ce4e6f371c
Gerrit-Change-Number: 10784
Gerrit-PatchSet: 3
Gerrit-Owner: Adam Holley 
Gerrit-Reviewer: Adam Holley 
Gerrit-Reviewer: Fredy Wijaya 


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 19:27:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 19:16:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10786 )

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/336/


--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Jul 2018 19:17:19 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6883: [DOCS] Refactor impala authorization doc

2018-07-03 Thread Alex Rodoni (Code Review)
Hello Fredy Wijaya, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10786

to look at the new patch set (#5).

Change subject: IMPALA-6883: [DOCS] Refactor impala_authorization doc
..

IMPALA-6883: [DOCS] Refactor impala_authorization doc

Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
---
M docs/shared/impala_common.xml
M docs/topics/impala_authorization.xml
M docs/topics/impala_grant.xml
3 files changed, 488 insertions(+), 784 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/86/10786/5
--
To view, visit http://gerrit.cloudera.org:8080/10786
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I3df72adb25dcdcbc286934b048645f47d876b33d
Gerrit-Change-Number: 10786
Gerrit-PatchSet: 5
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-6034: Add CPU and scanned bytes limits per query

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10415 )

Change subject: IMPALA-6034: Add CPU and scanned bytes limits per query
..


Patch Set 8:

(6 comments)

There's some more cleanup to do after my first pass...

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h
File be/src/runtime/coordinator-backend-state.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h@198
PS8, Line 198: backend
For this fragment instance


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.h@207
PS8, Line 207: /// and peak_mem_counter_ from profile_.
peak_mem_counter_ was removed...


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.cc
File be/src/runtime/coordinator-backend-state.cc:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator-backend-state.cc@294
PS8, Line 294: 
resource_utilization_.Merge(entry.second->resource_utilization_);
I'm considering adjusting this to use an O(1) algorithm - Update() could return 
the delta in resource consumption and this could just be a Merge().


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator.h
File be/src/runtime/coordinator.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/runtime/coordinator.h@177
PS8, Line 177: backend
This isn't accurate, could be fragment instances, backend or query.


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.h
File be/src/service/query-options.h:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.h@135
PS8, Line 135:TQueryOptionLevel::ADVANCED)\
Need to fix tabs


http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.cc
File be/src/service/query-options.cc:

http://gerrit.cloudera.org:8080/#/c/10415/8/be/src/service/query-options.cc@682
PS8, Line 682: // Parse the scan bytes limit and validate it.
Useless comment.



--
To view, visit http://gerrit.cloudera.org:8080/10415
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
Gerrit-Change-Number: 10415
Gerrit-PatchSet: 8
Gerrit-Owner: Mostafa Mokhtar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 19:12:08 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6034: Add CPU and scanned bytes limits per query

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10415 )

Change subject: IMPALA-6034: Add CPU and scanned bytes limits per query
..


Patch Set 7:

(5 comments)

Addressed the previous comments and did various cleanup work.

http://gerrit.cloudera.org:8080/#/c/10415/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/10415/6//COMMIT_MSG@18
PS6, Line 18: Added tests for various permutations for CPU_LIMIT_S and
> Something I noticed after playing around: How about we rename this to have
Done


http://gerrit.cloudera.org:8080/#/c/10415/6/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/10415/6/be/src/service/impala-server.cc@1940
PS6, Line 1940:   if (session_timeout <= 0) return;
Fixed the null coordinator dereference and added a cancellation test that 
exercises the new code path.


http://gerrit.cloudera.org:8080/#/c/10415/6/be/src/service/impala-server.cc@1970
PS6, Line 1970: is();
> This comment applies to my previous work here, but "expired" is actually ki
Done


http://gerrit.cloudera.org:8080/#/c/10415/6/be/src/service/impala-server.cc@1978
PS6, Line 1978: if (session_state.second->ref_count > 0) continue;
> We still need to erase the event - now we have two copies of it so we've en
Done


http://gerrit.cloudera.org:8080/#/c/10415/6/be/src/service/impala-server.cc@1979
PS6, Line 1979: // A session closed by other means is in the process of 
being removed, and it's
> nit: tabs
Done



--
To view, visit http://gerrit.cloudera.org:8080/10415
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
Gerrit-Change-Number: 10415
Gerrit-PatchSet: 7
Gerrit-Owner: Mostafa Mokhtar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 19:07:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6034: Add CPU and scanned bytes limits per query

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded a new patch set (#7) to the change originally 
created by Mostafa Mokhtar. ( http://gerrit.cloudera.org:8080/10415 )

Change subject: IMPALA-6034: Add CPU and scanned bytes limits per query
..

IMPALA-6034: Add CPU and scanned bytes limits per query

To protect Impala from runaway queries add per query limits for CPU and
scanned bytes, limits are enforced via query options and applied to the entire
query, not per backend like mem_limit.
If a query exceeds any of the limits it will get cancelled.

Query profile is updated to include query wide and per backend metrics for Cpu
and scanned bytes.

Testing:
Added tests for various permutations for CPU_LIMIT_S and
SCAN_BYTES_LIMIT

Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
---
M be/src/runtime/coordinator-backend-state.cc
M be/src/runtime/coordinator-backend-state.h
M be/src/runtime/coordinator.cc
M be/src/runtime/coordinator.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M be/src/service/query-options-test.cc
M be/src/service/query-options.cc
M be/src/service/query-options.h
M common/thrift/ImpalaInternalService.thrift
M common/thrift/ImpalaService.thrift
A 
testdata/workloads/functional-query/queries/QueryTest/query-resource-limits.test
M tests/query_test/test_cancellation.py
M tests/query_test/test_resource_limits.py
14 files changed, 338 insertions(+), 39 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/15/10415/7
--
To view, visit http://gerrit.cloudera.org:8080/10415
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I4c6015e21da684bb9f33e236d71309dd4c178a20
Gerrit-Change-Number: 10415
Gerrit-PatchSet: 7
Gerrit-Owner: Mostafa Mokhtar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Mostafa Mokhtar 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Taras Bobrovytsky (Code Review)
Taras Bobrovytsky has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 1
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Taras Bobrovytsky 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:49:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6625: Skip computing parquet conjuncts for non-Parquet scans

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10704 )

Change subject: IMPALA-6625: Skip computing parquet conjuncts for non-Parquet 
scans
..


Patch Set 14:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2772/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/10704
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9d6c26d42db090c8a15c602f6419ad6399c329e7
Gerrit-Change-Number: 10704
Gerrit-PatchSet: 14
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:45:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6625: Skip computing parquet conjuncts for non-Parquet scans

2018-07-03 Thread Pooja Nilangekar (Code Review)
Pooja Nilangekar has uploaded a new patch set (#14). ( 
http://gerrit.cloudera.org:8080/10704 )

Change subject: IMPALA-6625: Skip computing parquet conjuncts for non-Parquet 
scans
..

IMPALA-6625: Skip computing parquet conjuncts for non-Parquet scans

This change ensures that the planner computes parquet conjuncts
only when for scans containing parquet files. Additionally, it
also handles PARQUET_DICTIONARY_FILTERING and
PARQUET_READ_STATISTICS query options in the planner.

Testing was carried out independently on parquet and non-parquet
scans:
  1. Parquet scans were tested via the existing parquet-filtering
 planner test. Additionally, a new test
 [parquet-filtering-disabled] was added to ensure that the
 explain plan generated skips parquet predicates based on the
 query options.
  2. Non-parquet scans were tested manually to ensure that the
 functions to compute parquet conjucts were not invoked.
 Additional test cases were added to the parquet-filtering
 planner test to scan non parquet tables and ensure that the
 plans do not contain conjuncts based on parquet statistics.
  3. A parquet partition was added to the alltypesmixedformat
 table in the functional database. Planner tests were added
 to ensure that Parquet conjuncts are constructed only when
 the Parquet partition is included in the query.

Change-Id: I9d6c26d42db090c8a15c602f6419ad6399c329e7
---
M fe/src/main/java/org/apache/impala/catalog/FeCatalogUtils.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/test/java/org/apache/impala/planner/PlannerTest.java
M testdata/bin/create-load-data.sh
M testdata/bin/load-dependent-tables.sql
M 
testdata/workloads/functional-planner/queries/PlannerTest/constant-folding.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/fk-pk-join-detection.test
A 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/resource-requirements.test
M testdata/workloads/functional-planner/queries/PlannerTest/tablesample.test
M testdata/workloads/functional-query/queries/QueryTest/mixed-format.test
M testdata/workloads/functional-query/queries/QueryTest/show-stats.test
M tests/query_test/test_rows_availability.py
14 files changed, 483 insertions(+), 34 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/04/10704/14
--
To view, visit http://gerrit.cloudera.org:8080/10704
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9d6c26d42db090c8a15c602f6419ad6399c329e7
Gerrit-Change-Number: 10704
Gerrit-PatchSet: 14
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] [DOCS] Clarification on admission control and DDL statements

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10829 )

Change subject: [DOCS] Clarification on admission control and DDL statements
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2e3e82bd34e88e7a13de1864aeb97f01023bc715
Gerrit-Change-Number: 10829
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:41:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] [DOCS] Clarification on admission control and DDL statements

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10829 )

Change subject: [DOCS] Clarification on admission control and DDL statements
..

[DOCS] Clarification on admission control and DDL statements

Removed the confusing example and paragraphs.

Change-Id: I2e3e82bd34e88e7a13de1864aeb97f01023bc715
Reviewed-on: http://gerrit.cloudera.org:8080/10829
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M docs/topics/impala_admission.xml
1 file changed, 61 insertions(+), 85 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I2e3e82bd34e88e7a13de1864aeb97f01023bc715
Gerrit-Change-Number: 10829
Gerrit-PatchSet: 4
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has restored this change. ( http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Restored
--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: restore
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 1
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Taras Bobrovytsky 


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has abandoned this change. ( http://gerrit.cloudera.org:8080/10857 )

Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..


Abandoned
--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 1
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Taras Bobrovytsky 


[Impala-ASF-CR] IMPALA-7236: Fix the parsing of ALLOW ERASURE CODED FILES

2018-07-03 Thread Tianyi Wang (Code Review)
Tianyi Wang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/10857


Change subject: IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES
..

IMPALA-7236: Fix the parsing of ALLOW_ERASURE_CODED_FILES

This patch adds a missing "break" statement in a switch statement
changed by IMPALA-7102.
Also fixes an non-deterministic test case.

Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
---
M be/src/service/query-options.cc
M testdata/workloads/functional-query/queries/QueryTest/hdfs-erasure-coding.test
2 files changed, 2 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/57/10857/1
--
To view, visit http://gerrit.cloudera.org:8080/10857
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ife1e791541e3f4fed6bec00945390c7d7681e824
Gerrit-Change-Number: 10857
Gerrit-PatchSet: 1
Gerrit-Owner: Tianyi Wang 


[Impala-ASF-CR] IMPALA-7212: Deprecate --use krpc flag and remove old DataStream services

2018-07-03 Thread Sailesh Mukil (Code Review)
Sailesh Mukil has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10835 )

Change subject: IMPALA-7212: Deprecate --use_krpc flag and remove old 
DataStream services
..


Patch Set 1:

(5 comments)

LGTM overall. Just some comments around the test code.

http://gerrit.cloudera.org:8080/#/c/10835/1/be/src/runtime/data-stream-test.cc
File be/src/runtime/data-stream-test.cc:

http://gerrit.cloudera.org:8080/#/c/10835/1/be/src/runtime/data-stream-test.cc@23
PS1, Line 23: #include "common/status.h"
I think I've found all the dead code in this patch, but if you want a 
reference, you can have a look at this patch to see if you need to remove 
anything else:

https://github.com/apache/impala/commit/ff86feaa67ff8bf703896e33d9a358e42739ae30#diff-0b7021d2a8bfaf517b243f11a53bcde8

I added that when I was making this test KRPC compatible.


http://gerrit.cloudera.org:8080/#/c/10835/1/be/src/runtime/data-stream-test.cc@128
PS1, Line 128: template  class DataStreamTestBase : public T {
 :  protected:
 :   virtual void SetUp() {}
 :   virtual void TearDown() {}
 : };
No longer needed.


http://gerrit.cloudera.org:8080/#/c/10835/1/be/src/runtime/data-stream-test.cc@134
PS1, Line 134: enum KrpcSwitch {
 :   USE_THRIFT,
 :   USE_KRPC
 : };
No longer needed.


http://gerrit.cloudera.org:8080/#/c/10835/1/be/src/runtime/data-stream-test.cc@139
PS1, Line 139: public DataStreamTestBase>
You can replace this with:
public testing::Test


http://gerrit.cloudera.org:8080/#/c/10835/1/tests/custom_cluster/test_rpc_exception.py
File tests/custom_cluster/test_rpc_exception.py:

http://gerrit.cloudera.org:8080/#/c/10835/1/tests/custom_cluster/test_rpc_exception.py@a73
PS1, Line 73:
> We should probably look into removing certain injected exception if it's no
Can't this happen without TransmitData() today?  Don't we also call it on 
ReportExecStatus() and CancelQueryFInstances() ?

https://github.com/apache/impala/blob/fb8ea6a9ccaedc41a71f6f6dcb367fc1facd73b6/be/src/runtime/backend-client.h#L58-L76



--
To view, visit http://gerrit.cloudera.org:8080/10835
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icfed200751508478a3d728a917448f2dabfc67c3
Gerrit-Change-Number: 10835
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Ho 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:36:28 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] [DOCS] Clarification on admission control and DDL statements

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10829 )

Change subject: [DOCS] Clarification on admission control and DDL statements
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/335/


--
To view, visit http://gerrit.cloudera.org:8080/10829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2e3e82bd34e88e7a13de1864aeb97f01023bc715
Gerrit-Change-Number: 10829
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:32:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7237: handle hex digits in ParseSmaps()

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10853 )

Change subject: IMPALA-7237: handle hex digits in ParseSmaps()
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3dad846dafb25b414bee1858eb63f3eda31d59ac
Gerrit-Change-Number: 10853
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:29:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7237: handle hex digits in ParseSmaps()

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10853 )

Change subject: IMPALA-7237: handle hex digits in ParseSmaps()
..

IMPALA-7237: handle hex digits in ParseSmaps()

Testing:
Manual. Added some temporary logging to print out which branch it took
with each line and confirmed it took the right branch for a line
starting with 'f'.

Change-Id: I3dad846dafb25b414bee1858eb63f3eda31d59ac
Reviewed-on: http://gerrit.cloudera.org:8080/10853
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/util/mem-info.cc
1 file changed, 2 insertions(+), 1 deletion(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/10853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I3dad846dafb25b414bee1858eb63f3eda31d59ac
Gerrit-Change-Number: 10853
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 


[Impala-ASF-CR] IMPALA-7095: clean up scan node profiles

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10810 )

Change subject: IMPALA-7095: clean up scan node profiles
..


Patch Set 7:

(14 comments)

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/data-source-scan-node.cc
File be/src/exec/data-source-scan-node.cc:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/data-source-scan-node.cc@359
PS7, Line 359: COUNTER_ADD(rows_read_counter_, rows_read);
Any idea why rows_returned_counter_ is updated at line 355 but 
rows_read_counter_ is updated inside this if-stmt ? Should they be updated at 
the same location for consistency ?


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hbase-scan-node.h
File be/src/exec/hbase-scan-node.h:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hbase-scan-node.h@119
PS7, Line 119: time
wall clock time


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hdfs-scan-node-base.h
File be/src/exec/hdfs-scan-node-base.h:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hdfs-scan-node-base.h@126
PS7, Line 126: throughput
nit: adding (bytes/sec) like below  in the definition of the field seems 
helpful when reading this comment.


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hdfs-scan-node-base.h@150
PS7, Line 150: WARN_UNUSED_RESULT;
nit: long line.


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/hdfs-scan-node-base.h@521
PS7, Line 521: //
nit: ///


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/kudu-scan-node-base.h
File be/src/exec/kudu-scan-node-base.h:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/kudu-scan-node-base.h@41
PS7, Line 41:
:   virtual Status Prepare(RuntimeState* state) override;
:   virtual Status Open(RuntimeState* state) override;
:   virtual Status GetNext(RuntimeState* state, RowBatch* 
row_batch, bool* eos)
:   override = 0;
:  protected:
:   virtual void DebugString(int indentation_level, 
std::stringstream* out) const override;
WARN_UNUSED_RESULT;


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/kudu-scan-node.h
File be/src/exec/kudu-scan-node.h:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/kudu-scan-node.h@44
PS7, Line 44:   virtual Status Prepare(RuntimeState* state) override;
WARN_UNUSED_RESULT; Same below .


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h
File be/src/exec/scan-node.h:

http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@87
PS7, Line 87: the fragment execution thread
scanner threads


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@202
PS7, Line 202: used by
only used by


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@209
PS7, Line 209: class ScannerThreadState {
I suppose this class is mostly thread-safe as most fields are backed by 
atomics, right ? The states can both be updated from both fragment execution 
thread and scanner threads.


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@211
PS7, Line 211: state
Mind elaborating ? This is mostly for initializing the counters, right ?


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@214
PS7, Line 214: state
Open() creates the row batch queue, right ?


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@255
PS7, Line 255:private:
nit: blank line before private makes it more legible.


http://gerrit.cloudera.org:8080/#/c/10810/7/be/src/exec/scan-node.h@271
PS7, Line 271: boost::scoped_ptr<
std::unique_ptr<> Aren't we moving away from boost in the long run ?



--
To view, visit http://gerrit.cloudera.org:8080/10810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I77286282d42e7764bfdf94c7ec47cec9d743f787
Gerrit-Change-Number: 10810
Gerrit-PatchSet: 7
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 03 Jul 2018 18:00:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-1760: Implement shutdown command

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10744 )

Change subject: IMPALA-1760: Implement shutdown command
..


Patch Set 8:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift
File common/thrift/ImpalaInternalService.thrift:

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift@931
PS8, Line 931:   // Called to initiate shutdown of this backend.
 :   TRemoteShutdownResult RemoteShutdown(1:TRemoteShutdownParams 
params);
> Is it possible for me to port this to KRPC now? I guess when I wrote this p
Yes, I think if you cherry-pick that patch mentioned above, you should be able 
to add a new RPC in control_service.proto and define the handler similar to 
what we did in ReportExecStatus() RPC. Again, didn't mean to hold you back so 
if this patch goes in first, we can do the porting as a separate patch. Looking 
at this patch, this should hopefully be straightforward albeit may be some 
question about timeout which I may need to take a closer look.



--
To view, visit http://gerrit.cloudera.org:8080/10744
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4d5606ccfec84db4482c1e7f0f198103aad141a0
Gerrit-Change-Number: 10744
Gerrit-PatchSet: 8
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:23:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-1760: Implement shutdown command

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10744 )

Change subject: IMPALA-1760: Implement shutdown command
..


Patch Set 8:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift
File common/thrift/ImpalaInternalService.thrift:

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift@931
PS8, Line 931:   // Called to initiate shutdown of this backend.
 :   TRemoteShutdownResult RemoteShutdown(1:TRemoteShutdownParams 
params);
> It's unfortunate timing but we are moving away from Thrift to KRPC (https:/
Is it possible for me to port this to KRPC now? I guess when I wrote this patch 
we could run without KRPC but that's being removed now.



--
To view, visit http://gerrit.cloudera.org:8080/10744
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4d5606ccfec84db4482c1e7f0f198103aad141a0
Gerrit-Change-Number: 10744
Gerrit-PatchSet: 8
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:19:25 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7190: Remove unsupported format writer support

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10823 )

Change subject: IMPALA-7190: Remove unsupported format writer support
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I821dc7495a901f1658daa500daf3791b386c7185
Gerrit-Change-Number: 10823
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:16:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7190: Remove unsupported format writer support

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10823 )

Change subject: IMPALA-7190: Remove unsupported format writer support
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2771/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/10823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I821dc7495a901f1658daa500daf3791b386c7185
Gerrit-Change-Number: 10823
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:16:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-1760: Implement shutdown command

2018-07-03 Thread Michael Ho (Code Review)
Michael Ho has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10744 )

Change subject: IMPALA-1760: Implement shutdown command
..


Patch Set 8:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift
File common/thrift/ImpalaInternalService.thrift:

http://gerrit.cloudera.org:8080/#/c/10744/8/common/thrift/ImpalaInternalService.thrift@931
PS8, Line 931:   // Called to initiate shutdown of this backend.
 :   TRemoteShutdownResult RemoteShutdown(1:TRemoteShutdownParams 
params);
It's unfortunate timing but we are moving away from Thrift to KRPC 
(https://gerrit.cloudera.org/#/c/10855/). I suppose we just need to port this 
over to KRPC later once this patch is merged unless the aforementioned patch 
happens to go in first.



--
To view, visit http://gerrit.cloudera.org:8080/10744
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4d5606ccfec84db4482c1e7f0f198103aad141a0
Gerrit-Change-Number: 10744
Gerrit-PatchSet: 8
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:06:33 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7163: Implement a state machine for the QueryState class

2018-07-03 Thread Sailesh Mukil (Code Review)
Sailesh Mukil has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10813 )

Change subject: IMPALA-7163: Implement a state machine for the QueryState class
..


Patch Set 4:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/10813/2/be/src/runtime/query-state.cc
File be/src/runtime/query-state.cc:

http://gerrit.cloudera.org:8080/#/c/10813/2/be/src/runtime/query-state.cc@437
PS2, Line 437:
 :   }
 :   VLOG_QUERY << "descriptor table for query=" << 
PrintId(query_id())
 :  << "\n" << desc_tbl_->DebugString();
 : 
 :   Status thread_create_status;
 :   DCHECK_GT(rpc_params_.fragment_ctxs.size(), 0);
 :   TPlanFragmentCtx* fragment_ctx = _params_.fragment_ctxs[0];
 :   int fragment_ctx_idx = 0;
 :   fragment_events_start_time_ = MonotonicStopWatch::Now();
 :   for (const TPlanFragmentInstanceCtx& instance_ctx: 
rpc_params_.fragment_instance_ctxs) {
 : // determine corresponding TPlanFragmentCtx
 : if (fragment_ctx->fragment.idx != instance_ctx.fragment_idx) 
{
 :   ++fragment_ctx_idx;
 :
> My point was shouldn't the counting barrier take care of that for us? I.e.
CountingBarrier::Notify() does not set the promise unless it's the one to 
reduce the count to 0 (or in other words only the last fragment instance to 
complete preparing would get to set the promise).

So if a first fragment instance hits an error in Prepare() while the others are 
still running Prepare(), then it won't be able to set the error unless it has 
some way of communicating its error to the other fragment instances, or it 
waits until the rest of them are done and then sets the error.

So, we'd need some other mechanism of allowing us to both:
1) Save the first error, and
2) Wait for all the FInstances to complete Prepare()

Unless I'm missing something?


http://gerrit.cloudera.org:8080/#/c/10813/2/be/src/runtime/query-state.cc@586
PS2, Line 586: 
 :
 :
 :
 :
 :
> Right, the references are held by FIS for resources shared across the query
As we spoke in person, I will roll this part out as a separate patch to make 
reviewing this a bit easier.



--
To view, visit http://gerrit.cloudera.org:8080/10813
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iec5670a7db83ecae4656d7bb2ea372d3767ba7fe
Gerrit-Change-Number: 10813
Gerrit-PatchSet: 4
Gerrit-Owner: Sailesh Mukil 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Comment-Date: Tue, 03 Jul 2018 17:00:35 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-5981: [DOCS] Documented SET=""

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10816 )

Change subject: IMPALA-5981: [DOCS] Documented SET=""
..

IMPALA-5981: [DOCS] Documented SET=""

Also, refactored the Impala SET doc and moved the command SET to
the Impala Shell Commands doc.

Change-Id: I7211405d5cc0a548c05ea5218798591873c14417
Reviewed-on: http://gerrit.cloudera.org:8080/10816
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M docs/topics/impala_set.xml
M docs/topics/impala_shell_commands.xml
2 files changed, 105 insertions(+), 225 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10816
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I7211405d5cc0a548c05ea5218798591873c14417
Gerrit-Change-Number: 10816
Gerrit-PatchSet: 4
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-5981: [DOCS] Documented SET=""

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10816 )

Change subject: IMPALA-5981: [DOCS] Documented SET=""
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10816
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7211405d5cc0a548c05ea5218798591873c14417
Gerrit-Change-Number: 10816
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 16:52:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7163: Implement a state machine for the QueryState class

2018-07-03 Thread Sailesh Mukil (Code Review)
Sailesh Mukil has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10813 )

Change subject: IMPALA-7163: Implement a state machine for the QueryState class
..


Patch Set 4:

(19 comments)

http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h
File be/src/runtime/query-state.h:

http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@65
PS3, Line 65: state denot
> I think we should avoid using the term "query state" given that this class
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@65
PS3, Line 65: dExecState. We
> Backend
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@68
PS3, Line 68: to the ERROR or CANCELLED
> are there multiple error states? corresponding implies that. Or are you inc
I reworded it to be clearer now. It can only either be ERROR or CANCELLED.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@72
PS3, Line 72:  This is to simplify the
: /// query lifecycle so that Prepare() is always completed befor
> Let's not state in terms of past code. How about:
Yes, technically Cancel() or PublishFilter(). So I've re-worded it to that.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@149
PS3, Line 149: s belonging to this query. Each instance receives its own 
execution
> put another way, I guess this blocks until threads have exited INITIALIZED?
Yes, I've added that wording to make it clearer.

The reason I kept MonitorFInstances() separately is because the status 
reporting will be driven from there after IMPALA-2990, making it more readable 
than keeping it in StartFInstances().


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@155
PS3, Line 155:   /// Monitors all the fragment instances and updates the 
BackendExecState according
> it it required that this is called after StartFInstances()?
Yes, I've added that to the comments now.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@210
PS3, Line 210: return strings::Substitute("Status: $0 (fra
> Is that needed in C++?  You'd do it for C but I think C++ effectively inclu
You're right. Removed it.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@213
PS3, Line 213:
> nit: here and below you could delete the word "Function" since that's obvio
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@266
PS3, Line 266: dExecS
> Isn't Cancel() called in case of error as well? If so, does this mean we go
Yes, that's true. I've reworded it to specify only CancelQueryFInstances() RPC 
and a coordinator directed CANCEL from a response to ReportExecStatus() RPC.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@271
PS3, Line 271: repare(). Implies
> how about backend_exec_state_ for consistency?
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@284
PS3, Line 284: u
> here and elsewhere: given BackendExecState is a scalar (enum) value, should
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@286
PS3, Line 286: or E
> from
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@305
PS3, Line 305: ar* Ba
> overall status
Done


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.h@306
PS3, Line 306:
> otherwise it's the status of the first non-OK FIS?
Yes, added that to the comment.

Also, I changed this to a FInstanceStatus. I wanted to do that initially, but I 
forgot to.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.cc
File be/src/runtime/query-state.cc:

http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.cc@201
PS3, Line 201: const char* QueryState::BackendExecStateToString(const 
BackendExecState& state) {
 :   static const unordered_map 
exec_state_to_str{
 :   {BackendExecState::INITIALIZED, "INITIALIZED"},
 :   {BackendExecState::PREPARED, "PREPARED"}, 
{BackendExecState::FINISHED, "FINISHED"},
 :   {BackendExecState::CANCELLED, "CANCELLED"}, 
{BackendExecState::ERROR, "ERROR"}};
 :   retu
> given that the function is inlined this is probably unnecessary. dead code
Good point. Done.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.cc@226
PS3, Line 226:  << BackendExecStateToString(old_state) << ")";
> why doesn't that include FINISHED? This check doesn't seem very intuitive t
After removing the 2 states I had before, this seems pretty unnecessary now. I 
removed it.


http://gerrit.cloudera.org:8080/#/c/10813/3/be/src/runtime/query-state.cc@266
PS3, Line 266: ested cancellati
> how about UpdateBackendExecState (since QueryState is the name of the class
Done



[Impala-ASF-CR] IMPALA-7163: Implement a state machine for the QueryState class

2018-07-03 Thread Sailesh Mukil (Code Review)
Hello Michael Ho, Joe McDonnell, Dan Hecht,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/10813

to look at the new patch set (#4).

Change subject: IMPALA-7163: Implement a state machine for the QueryState class
..

IMPALA-7163: Implement a state machine for the QueryState class

This patch adds a state machine for the QueryState class. The motivation
behind this patch is to make the query lifecycle from the point of
view of an executor much easier to reason about and this patch is key
for a follow on patch for IMPALA-2990 where the status reporting will
be per-query rather than per-fragment-instance. Currently, the state
machine provides no other purpose, and it will mostly be used for
IMPALA-2990.

We introduce 7 possible states for the QueryState which include 3
terminal states (FINISHED, CANCELLED and ERROR) and 4 non-terminal
states (INITIALIZED, PREPARED). The transition from one state to the
next is always handled by a single thread which is also the QueryState
thread. This thread will additionally bear the purpose of sending
periodic updates after IMPALA-2990, which is the primary reason behind
having only this thread modify the state of the query.

2 counting barriers are introduced which are used to indicate how many
fragment instances have finished Preparing and Executing. We
use CountingBarriers to allow fragment instance threads to communicate
their completion of their respective states or errors with the QueryState
thread. Due to this design, it's possible for the fragment instances to
have collectively moved on to future states while the query state thread
lags behind in realizing that. However, that's not a concern for us
since we only care about showing a unified view of what's happening on
this executor to the coordinator (post IMPALA-2990).

The status reporting protocol has not been changed and remains exactly
as it was.

Testing: Ran 'core' and 'exhaustive' tests. TODO: Run stress tests.

Future related work:
1) IMPALA-2990: Make status reporting per-query.
2) Try to logically align the FIS states with the QueryState states.
3) Consider mirroring the QueryState state machine to CoordinatorBackendState

Change-Id: Iec5670a7db83ecae4656d7bb2ea372d3767ba7fe
---
M be/src/runtime/coordinator.cc
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/query-exec-mgr.cc
M be/src/runtime/query-state.cc
M be/src/runtime/query-state.h
5 files changed, 328 insertions(+), 38 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/13/10813/4
--
To view, visit http://gerrit.cloudera.org:8080/10813
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iec5670a7db83ecae4656d7bb2ea372d3767ba7fe
Gerrit-Change-Number: 10813
Gerrit-PatchSet: 4
Gerrit-Owner: Sailesh Mukil 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Sailesh Mukil 


[Impala-ASF-CR] IMPALA-5981: [DOCS] Documented SET=""

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10816 )

Change subject: IMPALA-5981: [DOCS] Documented SET=""
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-docs-submit/334/


--
To view, visit http://gerrit.cloudera.org:8080/10816
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7211405d5cc0a548c05ea5218798591873c14417
Gerrit-Change-Number: 10816
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 16:43:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7224. Improve performance of UpdateCatalogMetrics

2018-07-03 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10846 )

Change subject: IMPALA-7224. Improve performance of UpdateCatalogMetrics
..

IMPALA-7224. Improve performance of UpdateCatalogMetrics

This function is called after every DDL query, and was implemented by
fetching the entire list of table names, even though only the length
of that list was needed. In workloads with millions of tables, this
could add several seconds of overhead following even simple requests
like 'USE' or 'DESCRIBE'.

I tested a backported version of this patch against one such workload.
It reduced the time taken for a simple DESCRIBE query from 12-14sec
down to about 40ms. I also tested locally that the metrics on impalad
were still updated by DDL operations.

Change-Id: Ic5467adbce1e760ff93996925db5611748efafc0
Reviewed-on: http://gerrit.cloudera.org:8080/10846
Reviewed-by: Vuk Ercegovac 
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M be/src/service/frontend.cc
M be/src/service/frontend.h
M be/src/service/impala-server.cc
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/main/java/org/apache/impala/service/JniFrontend.java
6 files changed, 40 insertions(+), 11 deletions(-)

Approvals:
  Vuk Ercegovac: Looks good to me, approved
  Tim Armstrong: Looks good to me, but someone else must approve
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10846
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ic5467adbce1e760ff93996925db5611748efafc0
Gerrit-Change-Number: 10846
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-7237: handle hex digits in ParseSmaps()

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10853 )

Change subject: IMPALA-7237: handle hex digits in ParseSmaps()
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/10853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3dad846dafb25b414bee1858eb63f3eda31d59ac
Gerrit-Change-Number: 10853
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Comment-Date: Tue, 03 Jul 2018 15:05:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7237: handle hex digits in ParseSmaps()

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10853 )

Change subject: IMPALA-7237: handle hex digits in ParseSmaps()
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/2770/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/10853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3dad846dafb25b414bee1858eb63f3eda31d59ac
Gerrit-Change-Number: 10853
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Comment-Date: Tue, 03 Jul 2018 15:05:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] [DOCS] Clarification on admission control and DDL statements

2018-07-03 Thread Balazs Jeszenszky (Code Review)
Balazs Jeszenszky has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10829 )

Change subject: [DOCS] Clarification on admission control and DDL statements
..


Patch Set 3:

Thanks Tim for clarifying the session-thing. I was thinking in impala-shell 
context only.


--
To view, visit http://gerrit.cloudera.org:8080/10829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2e3e82bd34e88e7a13de1864aeb97f01023bc715
Gerrit-Change-Number: 10829
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Balazs Jeszenszky 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 10:55:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7095: clean up scan node profiles

2018-07-03 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10810 )

Change subject: IMPALA-7095: clean up scan node profiles
..


Patch Set 7: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/10810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I77286282d42e7764bfdf94c7ec47cec9d743f787
Gerrit-Change-Number: 10810
Gerrit-PatchSet: 7
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 03 Jul 2018 09:37:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4784: Remove InProcessStatestore

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/10843 )

Change subject: IMPALA-4784: Remove InProcessStatestore
..

IMPALA-4784: Remove InProcessStatestore

InProcessStatestore was only used by statestore-test, expr-test and
session-expiry-test. With a slight refactor of the Statestore class,
InProcessStatestore becomes obsolete.

This patch moves the ownership of the ThriftServer into the Statestore
class and Statestore::Init() now takes a 'port' parameter instead of
using the FLAGS_state_store_port directly.

We also remove the InProcessStatestore completely. A follow on patch will
get rid of the InProcessImpalaServer too (IMPALA-6013)

Testing: Ran 'core' tests.

Change-Id: I2621873e593b36c9612a6402ac6c5d8e3b49cde9
Reviewed-on: http://gerrit.cloudera.org:8080/10843
Reviewed-by: Sailesh Mukil 
Tested-by: Impala Public Jenkins 
---
M be/src/exprs/expr-test.cc
M be/src/service/session-expiry-test.cc
M be/src/statestore/statestore-test.cc
M be/src/statestore/statestore.cc
M be/src/statestore/statestore.h
M be/src/statestore/statestored-main.cc
M be/src/testutil/in-process-servers.cc
M be/src/testutil/in-process-servers.h
8 files changed, 95 insertions(+), 125 deletions(-)

Approvals:
  Sailesh Mukil: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/10843
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I2621873e593b36c9612a6402ac6c5d8e3b49cde9
Gerrit-Change-Number: 10843
Gerrit-PatchSet: 7
Gerrit-Owner: Sailesh Mukil 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-4784: Remove InProcessStatestore

2018-07-03 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10843 )

Change subject: IMPALA-4784: Remove InProcessStatestore
..


Patch Set 6: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/10843
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2621873e593b36c9612a6402ac6c5d8e3b49cde9
Gerrit-Change-Number: 10843
Gerrit-PatchSet: 6
Gerrit-Owner: Sailesh Mukil 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Jul 2018 08:21:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3040: Remove cache directive before dropping a table

2018-07-03 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directive before dropping a table
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/10792/2/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/10792/2/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@1317
PS2, Line 1317:   if (db != null) {
I don't think this is the right approach. Firstly this is still prone to races. 
Secondly, if the dropDatabase() in L1324 fails for some reason, we'd have 
unnecessarily uncached the tables eagerly.

How about fixing the HdfsPartition.dropPartition() to also cleanup the 
partition directive? (refer to my example in the CR comment).



--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 2
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 03 Jul 2018 07:47:12 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3040: Remove cache directive before dropping a table

2018-07-03 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/10792 )

Change subject: IMPALA-3040: Remove cache directive before dropping a table
..


Patch Set 2:

Thanks for the explanation. I guess I understand the issue now.

The basic problem here seems to be that  HdfsTable.dropPartition() does not 
clean up cache directives of the partitions. We do it inside  
CatalogOpExecutor#alterTableDropPartition()

  private Table alterTableDropPartition(Table tbl,.)
  ..
  if (part.isMarkedCached()) {
HdfsCachingUtil.removePartitionCacheDirective(part);
  }
  
  }

but since these partitions are dropped already using Hive, Impala clears the 
state with dropPartition() and uncacheTable() later does not help anymore.  
Here is a simple repro of this issue.

// Create a partitioned cached table and add some partitions
impala> create table cached_tbl_part (i int) partitioned by (j int) cached in 
'testPool'
impala> insert into cached_tbl_part (i,j) select 1, 2;

// Make sure cache directives are created.
$ hdfs cacheadmin -listDirectives | grep cached_tbl_part
277 testPool1 never   /test-warehouse/cached_tbl_part
278 testPool1 never   /test-warehouse/cached_tbl_part/j=2  <

// Drop the partition from hive
hive> alter table cached_tbl_part drop partition (j=2);

// Refresh the table from Impala
impala> refresh cached_tbl_part;

// Cache directives still exist
hdfs cacheadmin -listDirectives | grep cached_tbl_part

277 testPool1 never   /test-warehouse/cached_tbl_part
278 testPool1 never   /test-warehouse/cached_tbl_part/j=2   <--- should 
have been dropped

// Drop the table from Impala

impala> drop table cached_tbl_part;

$ hdfs cacheadmin -listDirectives | grep cached_tbl_part
278 testPool1 never   /test-warehouse/cached_tbl_part/j=2   <--- 
Table's directive is dropped but the partition still remains.


--
To view, visit http://gerrit.cloudera.org:8080/10792
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id7701a499405e961456adea63f3592b43bd69170
Gerrit-Change-Number: 10792
Gerrit-PatchSet: 2
Gerrit-Owner: Tianyi Wang 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Tue, 03 Jul 2018 07:43:37 +
Gerrit-HasComments: No