[Impala-ASF-CR] IMPALA-8181: Abbreviate row counts in EXPLAIN

2019-02-19 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12438 )

Change subject: IMPALA-8181: Abbreviate row counts in EXPLAIN
..


Patch Set 6: Code-Review+2

(2 comments)

Feel free to carry +2 after the test run finishes.

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java
File fe/src/main/java/org/apache/impala/common/PrintUtils.java:

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java@66
PS5, Line 66: if (value >= GIGA) return new 
DecimalFormat(".00G").format(result / GIGA);
> Created an "exact cardinality" method to format the values as suggested. Th
Thanks


http://gerrit.cloudera.org:8080/#/c/12438/6/fe/src/main/java/org/apache/impala/common/PrintUtils.java
File fe/src/main/java/org/apache/impala/common/PrintUtils.java:

http://gerrit.cloudera.org:8080/#/c/12438/6/fe/src/main/java/org/apache/impala/common/PrintUtils.java@156
PS6, Line 156: ,
this is cool, didn't know about this grouping separator.



--
To view, visit http://gerrit.cloudera.org:8080/12438
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
Gerrit-Change-Number: 12438
Gerrit-PatchSet: 6
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Comment-Date: Wed, 20 Feb 2019 07:12:18 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..


Patch Set 13: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12020
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
Gerrit-Change-Number: 12020
Gerrit-PatchSet: 13
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Anonymous Coward (402)
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 20 Feb 2019 06:43:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..


Patch Set 13:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3801/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12020
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
Gerrit-Change-Number: 12020
Gerrit-PatchSet: 13
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Anonymous Coward (402)
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 20 Feb 2019 06:43:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 4
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 20 Feb 2019 05:46:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..

IMPALA-8214: Fix bad plan in load_nested.py

The previous plan had the larger input on the build side of the join and
did a broadcast join, which is very suboptimal.

This speeds up data loading on my minicluster - 18s vs 31s and has a
more significant impact on a real cluster, where queries execute
much faster, the memory requirement is significantly reduced and
the data loading can potentially be broken up into fewer chunks.

I also considered computing stats on the table to let Impala generate
the same plan, but this achieves the same goal more efficiently.

Testing:
Run core tests. Resource estimates in planner tests changed slightly
because of the different distribution of data.

Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Reviewed-on: http://gerrit.cloudera.org:8080/12519
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M testdata/bin/load_nested.py
M 
testdata/workloads/functional-planner/queries/PlannerTest/mt-dop-validation.test
M testdata/workloads/functional-planner/queries/PlannerTest/tpch-nested.test
3 files changed, 12 insertions(+), 12 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 7: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3797/


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 7
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 05:24:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8158: Retrieve thrift profiles through Impyla

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12530 )

Change subject: IMPALA-8158: Retrieve thrift profiles through Impyla
..


Patch Set 2:

(1 comment)

We need to bump impyla_version in infra/python/deps/compiled-requirements.txt, 
right? Otherwise looks good.

http://gerrit.cloudera.org:8080/#/c/12530/2/tests/query_test/test_cancellation.py
File tests/query_test/test_cancellation.py:

http://gerrit.cloudera.org:8080/#/c/12530/2/tests/query_test/test_cancellation.py@223
PS2, Line 223: assert any(client.get_state(handle) == 'RUNNING_STATE' or 
sleep(1)
This is cool



--
To view, visit http://gerrit.cloudera.org:8080/12530
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I769a99f0843297dd2b20f2f5b1a9046c97bb131e
Gerrit-Change-Number: 12530
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 05:02:41 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6503: Support reading complex types from ORC

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12168 )

Change subject: IMPALA-6503: Support reading complex types from ORC
..


Patch Set 16:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3799/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/12168
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I244dc9d2b3e425393f90e45632cb8cdbea6cf790
Gerrit-Change-Number: 12168
Gerrit-PatchSet: 16
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Wed, 20 Feb 2019 04:26:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 20 Feb 2019 04:06:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..

IMPALA-8191: Wait for additional breakpad processes during test

The Breakpad signal handler forks off a process to write a minidump.
During the breakpad tests we send signals to the Impala daemons and then
wait for all processes to go away. Prior to this change we did this by
waiting on the PID returned by process.get_pid(). It is determined by
iterating over psutil.get_pid_list() which is an ordered list of PIDs
running on the system. We return the first process in the list with a
matching command line. In cases where the PID space rolled over, this
could have been the forked off breakpad process and we'd wait on that
one. During the subsequent check that all processes are indeed gone, we
could then pick up the original Impala daemon that had forked off to
write the minidump and was still in the process of shutting down.

To fix this, we wait for every process twice. Processes are identified
by their command and iterating through them twice makes sure we catch
both the original daemon and it's breakpad child.

This change also contains improvements to the logging of processes in
our tests. This should make it easier to identify similar issues in the
future.

Testing: I ran the breakpad tests in exhaustive mode. I didn't try to
exercise it around a PID roll-over, but we shouldn't see the issue in
IMPALA-8191 again.

Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Reviewed-on: http://gerrit.cloudera.org:8080/12501
Reviewed-by: Lars Volker 
Tested-by: Impala Public Jenkins 
---
M tests/common/impala_cluster.py
M tests/custom_cluster/test_breakpad.py
2 files changed, 55 insertions(+), 22 deletions(-)

Approvals:
  Lars Volker: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 4
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 


[Impala-ASF-CR] IMPALA-8158: Retrieve thrift profiles through Impyla

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12530 )

Change subject: IMPALA-8158: Retrieve thrift profiles through Impyla
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2168/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12530
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I769a99f0843297dd2b20f2f5b1a9046c97bb131e
Gerrit-Change-Number: 12530
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 02:46:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7450. Set thread name during refresh/load operations

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11228 )

Change subject: IMPALA-7450. Set thread name during refresh/load operations
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2167/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11228
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic7c850d6bb2eedc375ee567c19eb17add335f60c
Gerrit-Change-Number: 11228
Gerrit-PatchSet: 3
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 20 Feb 2019 02:29:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8158: Retrieve thrift profiles through Impyla

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12530 )

Change subject: IMPALA-8158: Retrieve thrift profiles through Impyla
..


Patch Set 2:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/12530/2/tests/common/impala_connection.py
File tests/common/impala_connection.py:

http://gerrit.cloudera.org:8080/#/c/12530/2/tests/common/impala_connection.py@352
PS2, Line 352:
flake8: E251 unexpected spaces around keyword / parameter equals


http://gerrit.cloudera.org:8080/#/c/12530/2/tests/common/impala_connection.py@352
PS2, Line 352:
flake8: E251 unexpected spaces around keyword / parameter equals


http://gerrit.cloudera.org:8080/#/c/12530/2/tests/query_test/test_cancellation.py
File tests/query_test/test_cancellation.py:

http://gerrit.cloudera.org:8080/#/c/12530/2/tests/query_test/test_cancellation.py@226
PS2, Line 226: r
flake8: F841 local variable 'resp' is assigned to but never used



--
To view, visit http://gerrit.cloudera.org:8080/12530
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I769a99f0843297dd2b20f2f5b1a9046c97bb131e
Gerrit-Change-Number: 12530
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 02:18:45 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8158: Retrieve thrift profiles through Impyla

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12530


Change subject: IMPALA-8158: Retrieve thrift profiles through Impyla
..

IMPALA-8158: Retrieve thrift profiles through Impyla

This change uses an updated version of Impyla to retrieve thrift
profiles through the HS2 api. The companion change in Impyla is PR #333:
https://github.com/cloudera/impyla/pull/333

Unfortunately, some of the current usages of get_thrift_profile rely on
the Beeswax query states and the ImpylaHS2Connection does not have the
required functionality yet. We will have to update these in a future
change, once we unified the query states.

This change also adds a self-contained test for IMPALA-2063

Change-Id: I769a99f0843297dd2b20f2f5b1a9046c97bb131e
---
M tests/common/impala_connection.py
M tests/common/impala_service.py
M tests/common/impala_test_suite.py
M tests/query_test/test_cancellation.py
M tests/query_test/test_observability.py
5 files changed, 72 insertions(+), 71 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/30/12530/2
--
To view, visit http://gerrit.cloudera.org:8080/12530
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I769a99f0843297dd2b20f2f5b1a9046c97bb131e
Gerrit-Change-Number: 12530
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 


[Impala-ASF-CR] IMPALA-7450. Set thread name during refresh/load operations

2019-02-19 Thread Paul Rogers (Code Review)
Paul Rogers has uploaded a new patch set (#3) to the change originally created 
by Todd Lipcon. ( http://gerrit.cloudera.org:8080/11228 )

Change subject: IMPALA-7450. Set thread name during refresh/load operations
..

IMPALA-7450. Set thread name during refresh/load operations

This adds a small utility class for annotating the current thread's name
during potentially long-running operations such as refresh/load. With
this change, jstack output now includes useful thread names like:

During startup:
  "main [invalidating metadata - 128/428 dbs complete]"

While loading a fresh table:
  "pool-4-thread-12 [Loading metadata for: foo_db.foo_table] [Loading
   metadata for all partition(s) of foo_db.foo_table]"

Pool refreshing metadata for a particular path:
  "pool-23-thread-5 [Refreshing file metadata for path:
   hdfs://nameservice1/path/to/partdir..."

Tests: Verified the patch manually by jstacking a catalogd while performing
some workload. Also added a simple unit test to verify the thread renaming
behavior.

Change-Id: Ic7c850d6bb2eedc375ee567c19eb17add335f60c
---
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/HdfsTable.java
M fe/src/main/java/org/apache/impala/catalog/TableLoader.java
A fe/src/main/java/org/apache/impala/util/ThreadNameAnnotator.java
A fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java
5 files changed, 291 insertions(+), 43 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/28/11228/3
--
To view, visit http://gerrit.cloudera.org:8080/11228
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ic7c850d6bb2eedc375ee567c19eb17add335f60c
Gerrit-Change-Number: 11228
Gerrit-PatchSet: 3
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 


[Impala-ASF-CR] IMPALA-7450. Set thread name during refresh/load operations

2019-02-19 Thread Paul Rogers (Code Review)
Paul Rogers has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11228 )

Change subject: IMPALA-7450. Set thread name during refresh/load operations
..


Patch Set 2:

(6 comments)

Addressed review comments and rebased.

http://gerrit.cloudera.org:8080/#/c/11228/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11228/2//COMMIT_MSG@24
PS2, Line 24: This patch is tricky to automate tests for, but I verified it 
manually
: by jstacking a catalogd while performing some workload. Also 
added a
: simple unit test to verify the thread renaming behavior
> Can be removed I guess.
Done


http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
File fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java:

http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java@301
PS2, Line 301: long end;
> Annotate here? This is one of those common interesting entry points for tab
Done


http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java@2185
PS2, Line 2185:   private TGetPartialCatalogObjectResponse 
doGetPartialCatalogObject(
> This is another interesting entry point RPC for catalog v2 stuff. Add some
Done


http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java
File fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java:

http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java@24
PS2, Line 24: public class ThreadNameAnnotatorTest {
> great test :-)
Thanks. Handy little item from the bag-o-tricks...


http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java@44
PS2, Line 44: wait();
> nit: I hope nothing hangs here forever due to a faulty test? Add a largish
Done


http://gerrit.cloudera.org:8080/#/c/11228/2/fe/src/test/java/org/apache/impala/util/ThreadNameAnnotatorTest.java@107
PS2, Line 107:   public void testExternalRename() throws InterruptedException {
> nit: Add a doc of what it does? Probably difficult to understand without an
Done



--
To view, visit http://gerrit.cloudera.org:8080/11228
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic7c850d6bb2eedc375ee567c19eb17add335f60c
Gerrit-Change-Number: 11228
Gerrit-PatchSet: 2
Gerrit-Owner: Todd Lipcon 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 20 Feb 2019 02:12:17 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2166/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 6
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:49:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3798/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 4
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:43:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 4
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:43:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12521 )

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2165/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
Gerrit-Change-Number: 12521
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:28:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 7:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3797/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 7
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:27:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 6: Code-Review+2

Made a cosmetic improvement, carrying Tim's +2


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 6
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:27:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 7: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 7
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:27:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12524

to look at the new patch set (#6).

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..

IMPALA-8209: Include fragment instance ID in memz/ breakdown

The change for IMPALA-7694 had accidentally removed the fragment
instance ID from the memz/ breakdown. This change puts it back and adds
a test to make sure it's there.

This change also pads query IDs with zeros when printing them in the
backend.

Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
---
M be/src/runtime/runtime-state.cc
M be/src/util/debug-util.cc
M tests/webserver/test_web_pages.py
3 files changed, 18 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/12524/6
--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 6
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..


Patch Set 3: Code-Review+2

Thanks for taking this on. Looks good.


--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:22:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2164/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 4
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:22:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8181: Abbreviate row counts in EXPLAIN

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12438 )

Change subject: IMPALA-8181: Abbreviate row counts in EXPLAIN
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2163/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12438
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
Gerrit-Change-Number: 12438
Gerrit-PatchSet: 6
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Comment-Date: Wed, 20 Feb 2019 01:05:45 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Tim Armstrong (Code Review)
Hello Thomas Marshall, David Knupp, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12521

to look at the new patch set (#6).

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..

IMPALA-6662: Make stress test resilient to hangs due to client crashes

Thanks to Sailesh Mukil for the initial version of this patch.

The concurrent_select.py process starts multiple sub processes
(called query runners), to run the queries. It also starts 2 threads
called the query producer thread and the query consumer thread. The
query producer thread adds queries to a query queue and the query
consumer thread pulls off the queue and feeds the queries to the
query runners.

The query runner, once it gets queries, does the following:
...
  with _submit_query_lock:
increment(num_queries_started)
  run_query()# One runner crashes here.
  increment(num_queries_finished)
...

One of the runners crash inside run_query(), thereby never incrementing
num_queries_finished.

Another thread that's supposed to check for memory leaks
(but actually doesn't), periodically acquires '_submit_query_lock' and
waits for the number of running queries to reach 0 before releasing the
lock.

However, in the above case, the number of running queries will never
reach 0 because one of the query runners hasn't incremented
'num_queries_finished' and exited. Therefore, the poll_mem_usage()
function will hold the lock indefinitely, causing no new queries to be
submitted, nor the stress test to complete running.

This patch fixes the problem by changing the global trackers of
num_queries_started and num_queries_finished, etc. to a per
QueryRunner basis. Anytime we want to find the total number of queries
started/finished/cancelled, etc., we aggregate the values from all the
runners. We synchronize access by adding a new lock called the
_query_runners_lock.

In _wait_for_test_to_finish(), we periodically check if a QueryRunner has
died, and if it has, we make sure to update the num_queries_finished to
num_queries_started, since it may have died before updating the 'finished'
value, and we also count the error.

Other changes:
* Boilerplate code is reduced by storing all metrics in a dictionary
  keyed by the metric name, instead of stamping out the code for
  10+ variables.
* Added more comments and debug strings
* Reformatted some code.

Testing:
Ran the stress test with the new patch locally and against a cluster.

Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
---
M tests/stress/concurrent_select.py
1 file changed, 222 insertions(+), 115 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/21/12521/6
--
To view, visit http://gerrit.cloudera.org:8080/12521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
Gerrit-Change-Number: 12521
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2162/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:53:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 3: Code-Review+2

(1 comment)

Addressed Tim's last comment, carrying his +2.

http://gerrit.cloudera.org:8080/#/c/12524/3/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/3/tests/webserver/test_web_pages.py@124
PS3, Line 124:   memz_breakdown = 
self.get_debug_page(self.MEMZ_URL)['detailed']
> I think everything except close_query needs to be inside the try block - we
You're right. Fixed it. m(



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:42:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12524

to look at the new patch set (#4).

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..

IMPALA-8209: Include fragment instance ID in memz/ breakdown

The change for IMPALA-7694 had accidentally removed the fragment
instance ID from the memz/ breakdown. This change puts it back and adds
a test to make sure it's there.

This change also pads query IDs with zeros when printing them in the
backend.

Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
---
M be/src/runtime/runtime-state.cc
M be/src/util/debug-util.cc
M tests/webserver/test_web_pages.py
3 files changed, 19 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/12524/4
--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 4
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3796/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 5
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:42:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8181: Abbreviate row counts in EXPLAIN

2019-02-19 Thread Paul Rogers (Code Review)
Paul Rogers has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12438 )

Change subject: IMPALA-8181: Abbreviate row counts in EXPLAIN
..


Patch Set 5:

(2 comments)

Addressed comments. Found one more cardinality field to abbreviate. Because the 
EXPLAIN output changed again, need to rerun all the pre-review tests. Will post 
an update when that completes.

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java
File fe/src/main/java/org/apache/impala/common/PrintUtils.java:

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java@64
PS5, Line 64: if (value == -1) return "unavailable";
> Shouldn't we instead use printCardinality() or some other helper instead of
Done


http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java@66
PS5, Line 66: if (value >= TERA) return new 
DecimalFormat(".00T").format(result / TERA);
> Curious if we should be consistent with the backend metrics which print wit
Created an "exact cardinality" method to format the values as suggested. Then, 
I went in search of which methods should use the "estimated" cardinality format 
vs. the "exact" cardinality format. I found that, in the planner, all 
cardinalities are estimates.

This rounding shows at least three digits of precision, so if a number is, say, 
1, 12 or 123, we'll see the exact number. But, if the number is large, we'll 
see an abbreviation, which is about as accurate as the underlying estimate (and 
easier to read.)

Can add the extra field later if we discover a place where we have exactly 
accurate values.



--
To view, visit http://gerrit.cloudera.org:8080/12438
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
Gerrit-Change-Number: 12438
Gerrit-PatchSet: 5
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:28:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8181: Abbreviate row counts in EXPLAIN

2019-02-19 Thread Paul Rogers (Code Review)
Hello Bharath Vissapragada, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12438

to look at the new patch set (#6).

Change subject: IMPALA-8181: Abbreviate row counts in EXPLAIN
..

IMPALA-8181: Abbreviate row counts in EXPLAIN

A recent fix added node cardinality to the standard EXPLAIN output,
displaying a large number like 123456780 as 123.46M. This patch applies
the same fix to the remaining row count numbers: metadata, extrapolated
rows, etc.

Tests:
* Rebased PlannerTest .test files as needed for the new row count
  format.
* Reran all tests to check for dependencies on the old format.

Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
---
M fe/src/main/java/org/apache/impala/common/PrintUtils.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/main/java/org/apache/impala/planner/PlanNode.java
M fe/src/main/java/org/apache/impala/planner/ScanNode.java
M 
testdata/workloads/functional-planner/queries/PlannerTest/constant-folding.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/fk-pk-join-detection.test
M testdata/workloads/functional-planner/queries/PlannerTest/max-row-size.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/mt-dop-validation.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering-disabled.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/parquet-filtering.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/resource-requirements.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/sort-expr-materialization.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/spillable-buffer-sizing.test
M testdata/workloads/functional-planner/queries/PlannerTest/tablesample.test
M testdata/workloads/functional-planner/queries/PlannerTest/union.test
M testdata/workloads/functional-query/queries/QueryTest/explain-level2.test
M testdata/workloads/functional-query/queries/QueryTest/explain-level3.test
M testdata/workloads/functional-query/queries/QueryTest/stats-extrapolation.test
M tests/custom_cluster/test_stats_extrapolation.py
M tests/metadata/test_stats_extrapolation.py
20 files changed, 546 insertions(+), 527 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/38/12438/6
--
To view, visit http://gerrit.cloudera.org:8080/12438
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
Gerrit-Change-Number: 12438
Gerrit-PatchSet: 6
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 3: Code-Review+2

(1 comment)

Thanks for fixing the padding, that should make things more consistent

http://gerrit.cloudera.org:8080/#/c/12524/3/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/3/tests/webserver/test_web_pages.py@124
PS3, Line 124:   memz_breakdown = 
self.get_debug_page(self.MEMZ_URL)['detailed']
I think everything except close_query needs to be inside the try block - we 
don't want it to run if wait_for_state throws an exception, right?



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:32:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2161/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:22:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12524

to look at the new patch set (#3).

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..

IMPALA-8209: Include fragment instance ID in memz/ breakdown

The change for IMPALA-7694 had accidentally removed the fragment
instance ID from the memz/ breakdown. This change puts it back and adds
a test to make sure it's there.

This change also pads query IDs with zeros when printing them in the
backend.

Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
---
M be/src/runtime/runtime-state.cc
M be/src/util/debug-util.cc
M tests/webserver/test_web_pages.py
3 files changed, 18 insertions(+), 2 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/12524/3
--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..


Patch Set 2: Code-Review+2

I think get_pids() works better here. Thanks!


--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:03:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..


Patch Set 3: Code-Review+2

Rebased, carrying Phil's +2


--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:08:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12501 )

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3795/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 3
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:08:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Paul Rogers (Code Review)
Paul Rogers has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..


Patch Set 12: Code-Review+2

(1 comment)

Looks great!

http://gerrit.cloudera.org:8080/#/c/12020/12/fe/src/main/java/org/apache/impala/analysis/Analyzer.java
File fe/src/main/java/org/apache/impala/analysis/Analyzer.java:

http://gerrit.cloudera.org:8080/#/c/12020/12/fe/src/main/java/org/apache/impala/analysis/Analyzer.java@2637
PS12, Line 2637:   
globalState_.maskedPrivilegeReqs.add(Pair.create(privReq, authErrorMsg_));
Nice!



--
To view, visit http://gerrit.cloudera.org:8080/12020
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
Gerrit-Change-Number: 12020
Gerrit-PatchSet: 12
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Anonymous Coward (402)
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:06:12 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7107: [DOCS] Updated the doc for storage formats impala cannot insert into

2019-02-19 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12529 )

Change subject: IMPALA-7107: [DOCS] Updated the doc for storage formats impala 
cannot insert into
..


Patch Set 1:

Balazs,
A generated google doc was shared with you for review. Thanks!


--
To view, visit http://gerrit.cloudera.org:8080/12529
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If154e24ade83e2cde2b08878e6f4f88e0048d657
Gerrit-Change-Number: 12529
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:03:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7107: [DOCS] Updated the doc for storage formats impala cannot insert into

2019-02-19 Thread Alex Rodoni (Code Review)
Alex Rodoni has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12529


Change subject: IMPALA-7107: [DOCS] Updated the doc for storage formats impala 
cannot insert into
..

IMPALA-7107: [DOCS] Updated the doc for storage formats impala cannot insert 
into

Change-Id: If154e24ade83e2cde2b08878e6f4f88e0048d657
---
M docs/topics/impala_file_formats.xml
1 file changed, 176 insertions(+), 99 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/12529/1
--
To view, visit http://gerrit.cloudera.org:8080/12529
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: If154e24ade83e2cde2b08878e6f4f88e0048d657
Gerrit-Change-Number: 12529
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 


[Impala-ASF-CR] IMPALA-7107: [DOCS] Updated the doc for storage formats impala cannot insert into

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12529 )

Change subject: IMPALA-7107: [DOCS] Updated the doc for storage formats impala 
cannot insert into
..


Patch Set 1:

Build Started https://jenkins.impala.io/job/gerrit-docs-auto-test/223/

Testing docs change - this change appears to modify docs/ and no code. This is 
experimental - please report any issues to tarmstr...@cloudera.com or on this 
JIRA: IMPALA-7317


--
To view, visit http://gerrit.cloudera.org:8080/12529
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If154e24ade83e2cde2b08878e6f4f88e0048d657
Gerrit-Change-Number: 12529
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 20 Feb 2019 00:02:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8191: Wait for additional breakpad processes during test

2019-02-19 Thread Lars Volker (Code Review)
Hello Philip Zeyliger, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12501

to look at the new patch set (#2).

Change subject: IMPALA-8191: Wait for additional breakpad processes during test
..

IMPALA-8191: Wait for additional breakpad processes during test

The Breakpad signal handler forks off a process to write a minidump.
During the breakpad tests we send signals to the Impala daemons and then
wait for all processes to go away. Prior to this change we did this by
waiting on the PID returned by process.get_pid(). It is determined by
iterating over psutil.get_pid_list() which is an ordered list of PIDs
running on the system. We return the first process in the list with a
matching command line. In cases where the PID space rolled over, this
could have been the forked off breakpad process and we'd wait on that
one. During the subsequent check that all processes are indeed gone, we
could then pick up the original Impala daemon that had forked off to
write the minidump and was still in the process of shutting down.

To fix this, we wait for every process twice. Processes are identified
by their command and iterating through them twice makes sure we catch
both the original daemon and it's breakpad child.

This change also contains improvements to the logging of processes in
our tests. This should make it easier to identify similar issues in the
future.

Testing: I ran the breakpad tests in exhaustive mode. I didn't try to
exercise it around a PID roll-over, but we shouldn't see the issue in
IMPALA-8191 again.

Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
---
M tests/common/impala_cluster.py
M tests/custom_cluster/test_breakpad.py
2 files changed, 55 insertions(+), 22 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/01/12501/2
--
To view, visit http://gerrit.cloudera.org:8080/12501
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ia4dcc5fecb9b5f38ae1504aae40f099837cf1bca
Gerrit-Change-Number: 12501
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Philip Zeyliger 


[Impala-ASF-CR] IMPALA-8207: Fix query loading for perf and stress tests

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12503 )

Change subject: IMPALA-8207: Fix query loading for perf and stress tests
..

IMPALA-8207: Fix query loading for perf and stress tests

Problems with perf queries (run-workload.py):
- TPCH picks up stress test specific queries (TPCH-AGG1/2/3)
- TPCDS picks up queries that were intended just to validate that data
  was loaded properly but that aren't interesting from a perf
  perspective (TPCDS-COUNT-)
- TPCDS picks up both decimal_v1 and decimal_v2 queries. This is
  mostly harmless as for queries with matching names only one gets run
  but it causes some queries with mismatched names to be run twice
  (TPCDS-Q39-1/2 vs. TPCDS-Q39.1/2)

Problems with stress queries (concurrent_select.py):
- TPCDS fails to pick up Q22A as it does not use the decimal_v2
  queries, even though decimal_v2 is the default now.

This problem is exacerbated by the fact that the two scripts have
different code paths for selecting the queries, so in the past changes
that were made to one path were not always made to the other.

This patch merges the two paths to reduce code duplication and prevent
these sorts of issues in the future, and fixes the above issues.

One complication is that historically the stress test has used query
names in the form 'q1' whereas the perf test has used query names in
the form 'TPCH-Q1'. This patch standardizes on using 'TPCH-Q1'.

Testing:
- Added a test that checks that the perf tests pick up the expected
  number of queries.
- Manually ran the scripts and verified that the correct queries are
  selected.

Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Reviewed-on: http://gerrit.cloudera.org:8080/12503
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M testdata/workloads/tpcds/queries/tpcds-decimal_v2-q39-1.test
M testdata/workloads/tpcds/queries/tpcds-decimal_v2-q39-2.test
A tests/infra/test_perf_infra.py
M tests/infra/test_stress_infra.py
M tests/performance/workload.py
M tests/stress/concurrent_select.py
M tests/util/parse_util.py
M tests/util/test_file_parser.py
8 files changed, 107 insertions(+), 63 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/12503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Gerrit-Change-Number: 12503
Gerrit-PatchSet: 4
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8207: Fix query loading for perf and stress tests

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12503 )

Change subject: IMPALA-8207: Fix query loading for perf and stress tests
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/12503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Gerrit-Change-Number: 12503
Gerrit-PatchSet: 3
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 22:31:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12519 )

Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2160/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Tue, 19 Feb 2019 22:29:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12520 )

Change subject: Fix misspelling in stress test code
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 19 Feb 2019 22:09:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12520 )

Change subject: Fix misspelling in stress test code
..

Fix misspelling in stress test code

Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Reviewed-on: http://gerrit.cloudera.org:8080/12520
Reviewed-by: David Knupp 
Tested-by: Impala Public Jenkins 
---
M tests/stress/concurrent_select.py
1 file changed, 7 insertions(+), 7 deletions(-)

Approvals:
  David Knupp: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 


[Impala-ASF-CR] IMPALA-8214: Fix bad plan in load nested.py

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12519


Change subject: IMPALA-8214: Fix bad plan in load_nested.py
..

IMPALA-8214: Fix bad plan in load_nested.py

The previous plan had the larger input on the build side of the join and
did a broadcast join, which is very suboptimal.

This speeds up data loading on my minicluster - 18s vs 31s and has a
more significant impact on a real cluster, where queries execute
much faster, the memory requirement is significantly reduced and
the data loading can potentially be broken up into fewer chunks.

I also considered computing stats on the table to let Impala generate
the same plan, but this achieves the same goal more efficiently.

Testing:
Run core tests. Resource estimates in planner tests changed slightly
because of the different distribution of data.

Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
---
M testdata/bin/load_nested.py
M 
testdata/workloads/functional-planner/queries/PlannerTest/mt-dop-validation.test
M testdata/workloads/functional-planner/queries/PlannerTest/tpch-nested.test
3 files changed, 12 insertions(+), 12 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/19/12519/3
--
To view, visit http://gerrit.cloudera.org:8080/12519
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I55e0ca09590a90ba530efe4e8f8bf587dde3eeeb
Gerrit-Change-Number: 12519
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-5031: oversized signed shifts are undefined

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12514 )

Change subject: IMPALA-5031: oversized signed shifts are undefined
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/12514
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7debbd1ca5bd3ae640701ce0e95a12c5059abfd7
Gerrit-Change-Number: 12514
Gerrit-PatchSet: 2
Gerrit-Owner: Jim Apple 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 21:39:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5031: oversized signed shifts are undefined

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12514 )

Change subject: IMPALA-5031: oversized signed shifts are undefined
..

IMPALA-5031: oversized signed shifts are undefined

Standard section [expr.shift] says that E1 << E2 is undefined if E1 is
of signed type and the result cannot be represented in the
corresponding unsigned type. We can't simply change 1 << bit_width to
1u << bit_width, though, becuase it is the second argument of the
modulo operator, and following [expr.mul], "If the second operand of /
or % is zero the behavior is undefined."

This expression is tripped in RleTest.ValueSkippingFuzzy, with the
following backtrace:

util/rle-test.cc:304:29: runtime error: shift exponent 32 is too large
  for 32-bit type 'int'
#0 RleTest::MakeRandomSequence(unsigned int, int, int, int)::
 {lambda(int)#1}::operator()(int) const util/rle-test.cc:304:29
#1 RleTest::MakeRandomSequence(unsigned int, int, int, int)
 util/rle-test.cc:315:15
#2 RleTest_ValueSkippingFuzzy_Test::TestBody()
 util/rle-test.cc:392:25

Change-Id: I7debbd1ca5bd3ae640701ce0e95a12c5059abfd7
Reviewed-on: http://gerrit.cloudera.org:8080/12514
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M be/src/util/rle-test.cc
1 file changed, 1 insertion(+), 0 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/12514
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I7debbd1ca5bd3ae640701ce0e95a12c5059abfd7
Gerrit-Change-Number: 12514
Gerrit-PatchSet: 3
Gerrit-Owner: Jim Apple 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] [DOCS] A typo was fixed in impala dedicated coordinator.xml

2019-02-19 Thread Alex Rodoni (Code Review)
Alex Rodoni has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/12525 )

Change subject: [DOCS] A typo was fixed in impala_dedicated_coordinator.xml
..

[DOCS] A typo was fixed in impala_dedicated_coordinator.xml

Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Reviewed-on: http://gerrit.cloudera.org:8080/12525
Reviewed-by: Alex Rodoni 
Tested-by: Impala Public Jenkins 
---
M docs/topics/impala_dedicated_coordinator.xml
1 file changed, 1 insertion(+), 1 deletion(-)

Approvals:
  Alex Rodoni: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/12525
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Gerrit-Change-Number: 12525
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] [DOCS] A typo was fixed in impala dedicated coordinator.xml

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12525 )

Change subject: [DOCS] A typo was fixed in impala_dedicated_coordinator.xml
..


Patch Set 1: Verified+1

Build Successful

https://jenkins.impala.io/job/gerrit-docs-auto-test/222/ : Doc tests passed.


--
To view, visit http://gerrit.cloudera.org:8080/12525
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Gerrit-Change-Number: 12525
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:53:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..


Patch Set 12:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2158/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12020
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
Gerrit-Change-Number: 12020
Gerrit-PatchSet: 12
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Anonymous Coward (402)
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:51:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] [DOCS] A typo was fixed in impala dedicated coordinator.xml

2019-02-19 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12525 )

Change subject: [DOCS] A typo was fixed in impala_dedicated_coordinator.xml
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12525
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Gerrit-Change-Number: 12525
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:51:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] [DOCS] A typo was fixed in impala dedicated coordinator.xml

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12525 )

Change subject: [DOCS] A typo was fixed in impala_dedicated_coordinator.xml
..


Patch Set 1:

Build Started https://jenkins.impala.io/job/gerrit-docs-auto-test/222/

Testing docs change - this change appears to modify docs/ and no code. This is 
experimental - please report any issues to tarmstr...@cloudera.com or on this 
JIRA: IMPALA-7317


--
To view, visit http://gerrit.cloudera.org:8080/12525
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Gerrit-Change-Number: 12525
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:51:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12521 )

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2159/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
Gerrit-Change-Number: 12521
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:50:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] [DOCS] A typo was fixed in impala dedicated coordinator.xml

2019-02-19 Thread Alex Rodoni (Code Review)
Alex Rodoni has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12525


Change subject: [DOCS] A typo was fixed in impala_dedicated_coordinator.xml
..

[DOCS] A typo was fixed in impala_dedicated_coordinator.xml

Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
---
M docs/topics/impala_dedicated_coordinator.xml
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/25/12525/1
--
To view, visit http://gerrit.cloudera.org:8080/12525
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ic71fb69e9b59ce82f204ed569089d5ec8dfee66e
Gerrit-Change-Number: 12525
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded a new patch set (#12). ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..

IMPALA-7917 (Part 1): Decouple Sentry from Impala

The first part of this patch is to provide an initial work to decouple
Sentry from Impala by creating a generic authorization provider
interface that Sentry implements. The idea is to allow more
authorization providers in the future. The patch updates the following:
- Renamed Authorizeable to Authorizable to fix typographical error.
- Moved any clases that uses Sentry specific code to
  org.apache.impala.authorization.sentry package and created interfaces
  when necessary.
- Moved all generic authorization related classes to
  org.apache.impala.authorization package.
- Minor clean up on authorization related code.

In this patch, switching the authorization provider implementation
still requires updating the code in many different places. A follow up
patch will make it easy to switch an authorization provider
implementation.

This patch has no functionality change.

Testing:
- Ran all FE tests
- Ran all E2E authorization tests

Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
---
M fe/src/main/java/org/apache/impala/analysis/AdminFnStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/analysis/AuthorizationStmt.java
M fe/src/main/java/org/apache/impala/analysis/CollectionTableRef.java
M fe/src/main/java/org/apache/impala/analysis/CopyTestCaseStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateFunctionStmtBase.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/DescribeTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/DropFunctionStmt.java
M fe/src/main/java/org/apache/impala/analysis/HdfsUri.java
M fe/src/main/java/org/apache/impala/analysis/InsertStmt.java
M fe/src/main/java/org/apache/impala/analysis/ResetMetadataStmt.java
R fe/src/main/java/org/apache/impala/authorization/Authorizable.java
M fe/src/main/java/org/apache/impala/authorization/AuthorizationChecker.java
M fe/src/main/java/org/apache/impala/authorization/AuthorizationConfig.java
R fe/src/main/java/org/apache/impala/authorization/AuthorizationException.java
R fe/src/main/java/org/apache/impala/authorization/AuthorizationPolicy.java
C fe/src/main/java/org/apache/impala/authorization/AuthorizationProvider.java
D fe/src/main/java/org/apache/impala/authorization/AuthorizeableColumn.java
D fe/src/main/java/org/apache/impala/authorization/AuthorizeableUri.java
M fe/src/main/java/org/apache/impala/authorization/Privilege.java
M fe/src/main/java/org/apache/impala/authorization/PrivilegeRequest.java
M fe/src/main/java/org/apache/impala/authorization/PrivilegeRequestBuilder.java
A fe/src/main/java/org/apache/impala/authorization/sentry/ImpalaAction.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/ImpalaActionFactory.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/ImpalaPrivilegeModel.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthProvider.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizable.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableColumn.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableDb.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableFn.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableServer.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableTable.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizableUri.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizationChecker.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizationConfig.java
A 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryAuthorizationPolicy.java
R fe/src/main/java/org/apache/impala/authorization/sentry/SentryConfig.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryPolicyReaderException.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryPolicyService.java
R fe/src/main/java/org/apache/impala/authorization/sentry/SentryProxy.java
R 
fe/src/main/java/org/apache/impala/authorization/sentry/SentryUnavailableException.java
R fe/src/main/java/org/apache/impala/authorization/sentry/SentryUtil.java
M fe/src/main/java/org/apache/impala/catalog/Catalog.java
M fe/src/main/java/org/apache/impala/catalog/CatalogServiceCatalog.java
M fe/src/main/java/org/apache/impala/catalog/FeCatalog.java
M fe/src/main/java/org/apache/impala/catalog/ImpaladCatalog.java

[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12521 )

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..


Patch Set 5:

This is a rebased version of https://gerrit.cloudera.org/#/c/9635/ with some 
additional cleanup and a bugfix for the timeout logic.


--
To view, visit http://gerrit.cloudera.org:8080/12521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
Gerrit-Change-Number: 12521
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:08:19 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/12521 )

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..

IMPALA-6662: Make stress test resilient to hangs due to client crashes

Thanks to Sailesh Mukil for the initial version of this patch.

The concurrent_select.py process starts multiple sub processes
(called query runners), to run the queries. It also starts 2 threads
called the query producer thread and the query consumer thread. The
query producer thread adds queries to a query queue and the query
consumer thread pulls off the queue and feeds the queries to the
query runners.

The query runner, once it gets queries, does the following:
...
  with _submit_query_lock:
increment(num_queries_started)
  run_query()# One runner crashes here.
  increment(num_queries_finished)
...

One of the runners crash inside run_query(), thereby never incrementing
num_queries_finished.

Another thread that's supposed to check for memory leaks
(but actually doesn't), periodically acquires '_submit_query_lock' and
waits for the number of running queries to reach 0 before releasing the
lock.

However, in the above case, the number of running queries will never
reach 0 because one of the query runners hasn't incremented
'num_queries_finished' and exited. Therefore, the poll_mem_usage()
function will hold the lock indefinitely, causing no new queries to be
submitted, nor the stress test to complete running.

This patch fixes the problem by changing the global trackers of
num_queries_started and num_queries_finished, etc. to a per
QueryRunner basis. Anytime we want to find the total number of queries
started/finished/cancelled, etc., we aggregate the values from all the
runners. We synchronize access by adding a new lock called the
_query_runners_lock.

In _wait_for_test_to_finish(), we periodically check if a QueryRunner has
died, and if it has, we make sure to update the num_queries_finished to
num_queries_started, since it may have died before updating the 'finished'
value, and we also count the error.

Other changes:
* Boilerplate code is reduced by storing all metrics in a dictionary
  keyed by the metric name, instead of stamping out the code for
  10+ variables.
* Added more comments and debug strings
* Reformatted some code.

Testing:
Ran the stress test with the new patch locally and against a cluster.

Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
---
M tests/stress/concurrent_select.py
1 file changed, 222 insertions(+), 115 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/21/12521/5
--
To view, visit http://gerrit.cloudera.org:8080/12521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I525bf13e0f3dd660c0d9f5c2bf6eb292e7ebb8af
Gerrit-Change-Number: 12521
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7917 (Part 1): Decouple Sentry from Impala

2019-02-19 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12020 )

Change subject: IMPALA-7917 (Part 1): Decouple Sentry from Impala
..


Patch Set 12:

Rebased to fix merge conflicts. Paul/Bharath, can you take a look at this again?


--
To view, visit http://gerrit.cloudera.org:8080/12020
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If1fd1df0b38ddd7cfa41299e95f5827f8a9e9c1f
Gerrit-Change-Number: 12020
Gerrit-PatchSet: 12
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Anonymous Coward (402)
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Todd Lipcon 
Gerrit-Comment-Date: Tue, 19 Feb 2019 20:07:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/12524/2/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/2/tests/webserver/test_web_pages.py@129
PS2, Line 129: self.client.cancel(query_handle)
Could maybe use finally to run this on both code paths. Should this be 
close_query() instead of cancel() as well to unregister the query?



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 19:42:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2157/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 19:15:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/12524/1/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/1/tests/webserver/test_web_pages.py@124
PS1, Line 124:   self.client.cancel(query_handle)
> Shouldn't we cancel the query after getting the memz output?
Good point, fixed. Looks like the cancelation was slow enough to work in my 
testing. :(



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:58:13 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/12524/2/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/2/tests/webserver/test_web_pages.py@123
PS2, Line 123: e
flake8: E722 do not use bare except'



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:58:27 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2156/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:57:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12524

to look at the new patch set (#2).

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..

IMPALA-8209: Include fragment instance ID in memz/ breakdown

The change for IMPALA-7694 had accidentally removed the fragment
instance ID from the memz/ breakdown. This change puts it back and adds
a test to make sure it's there.

Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
---
M be/src/runtime/runtime-state.cc
M tests/webserver/test_web_pages.py
2 files changed, 19 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/12524/2
--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 2
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8181: Abbreviate row counts in EXPLAIN

2019-02-19 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12438 )

Change subject: IMPALA-8181: Abbreviate row counts in EXPLAIN
..


Patch Set 5:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java
File fe/src/main/java/org/apache/impala/common/PrintUtils.java:

http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java@64
PS5, Line 64: if (value == -1) return "unavailable";
Shouldn't we instead use printCardinality() or some other helper instead of 
doing it here?


http://gerrit.cloudera.org:8080/#/c/12438/5/fe/src/main/java/org/apache/impala/common/PrintUtils.java@66
PS5, Line 66: if (value >= TERA) return new 
DecimalFormat(".00T").format(result / TERA);
Curious if we should be consistent with the backend metrics which print with 
the following format -  pretty-printed-val (actual val)

Ex:

 PeakMemoryUsage: 149.50 KB (153088)
   - PeakReservation: 8.00 KB (8192)
   - PeakUsedReservation: 0
   - PerHostPeakMemUsage: 169.50 KB (173568)
   - RowsProduced: 1 (1)

I think we probably want exact values in some cases, ex: off-by-one cases in 
cardinality etc.



--
To view, visit http://gerrit.cloudera.org:8080/12438
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08faaa9ad7b5ed42dcd7a15a333e8734bb45f10c
Gerrit-Change-Number: 12438
Gerrit-PatchSet: 5
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:35:32 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Add support for sles12

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
Hello Laszlo Gaal,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12422

to look at the new patch set (#2).

Change subject: Add support for sles12
..

Add support for sles12

We add support for building native-toolchain sles images. Since these
images require access to a mirror, they're not built unless the
SLES_MIRROR environment variable is set.

Change-Id: If8a60e3f4ee94f9a4cd76128b8658802870daca9
---
M Makefile
M docker/all/assert-dependencies-present.py
M docker/buildall.sh
A docker/sles12.df
M in-docker.py
5 files changed, 74 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/native-toolchain 
refs/changes/22/12422/2
--
To view, visit http://gerrit.cloudera.org:8080/12422
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: If8a60e3f4ee94f9a4cd76128b8658802870daca9
Gerrit-Change-Number: 12422
Gerrit-PatchSet: 2
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Laszlo Gaal 


[native-toolchain-CR] Enable publishing on the docker-based build

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
hector.aco...@cloudera.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12423 )

Change subject: Enable publishing on the docker-based build
..


Patch Set 3:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/12423/2/in-docker.py
File in-docker.py:

http://gerrit.cloudera.org:8080/#/c/12423/2/in-docker.py@57
PS2, Line 57: Maps docker images to BUILD_TARGET_LABELs which is ultimately 
included
: # in the path for each built package
> It may be worth mentioning here that Impala/bin/bootstrap_toolchain.py cont
Done


http://gerrit.cloudera.org:8080/#/c/12423/2/in-docker.py@130
PS2, Line 130: PUBLISH_D
> Maybe add the AWS credential variables to the passthrough list, in case the
Done



--
To view, visit http://gerrit.cloudera.org:8080/12423
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I32966bf5b5ad32a3caaf2aee0430ea3a67ecb5d6
Gerrit-Change-Number: 12423
Gerrit-PatchSet: 3
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: hector.aco...@cloudera.com 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:27:20 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Add support for sles12

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
hector.aco...@cloudera.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12422 )

Change subject: Add support for sles12
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/12422/1/docker/buildall.sh
File docker/buildall.sh:

http://gerrit.cloudera.org:8080/#/c/12422/1/docker/buildall.sh@25
PS1, Line 25: if [[ -n "${SLES_MIRROR:-""}" ]]; then
:   BUILD_ARGS+=(--build-arg="SLES_MIRROR=$SLES_MIRROR")
> Could you mention this dependency in the commit message?
Done



--
To view, visit http://gerrit.cloudera.org:8080/12422
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If8a60e3f4ee94f9a4cd76128b8658802870daca9
Gerrit-Change-Number: 12422
Gerrit-PatchSet: 2
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: hector.aco...@cloudera.com 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:27:15 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Initial support for building the toolchain in docker

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
hector.aco...@cloudera.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12285 )

Change subject: Initial support for building the toolchain in docker
..


Patch Set 6:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/12285/5/docker/redhat6.df
File docker/redhat6.df:

http://gerrit.cloudera.org:8080/#/c/12285/5/docker/redhat6.df@4
PS5, Line 4: # We need to pin our yum updates to keep old library versions.
> Repo pinning seems to be missing from here. Is that intentional?
Done


http://gerrit.cloudera.org:8080/#/c/12285/5/docker/redhat6.df@6
PS5, Line 6: 's
> nit: seems to be redundant with "-y" also present in the docker/redhat/yum-
Done


http://gerrit.cloudera.org:8080/#/c/12285/5/in-docker.py
File in-docker.py:

http://gerrit.cloudera.org:8080/#/c/12285/5/in-docker.py@112
PS5, Line 112: passthrough_env
> Maybe add the env vars needed for publishing to the S3 bucket:
Done



--
To view, visit http://gerrit.cloudera.org:8080/12285
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If42c9bc06a3d303642eb37dea784b61e2a1f5cc6
Gerrit-Change-Number: 12285
Gerrit-PatchSet: 6
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: hector.aco...@cloudera.com 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:27:09 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Add support for Ubuntu 18.04

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
hector.aco...@cloudera.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12286 )

Change subject: Add support for Ubuntu 18.04
..


Patch Set 7:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/12286/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/12286/2//COMMIT_MSG@13
PS2, Line 13:  ./in-docker.py impala-toolchain-ubuntu18 ./buildall.sh
> Ignore this, I saw the other patchset.
Done


http://gerrit.cloudera.org:8080/#/c/12286/2/docker/ubuntu18.df
File docker/ubuntu18.df:

http://gerrit.cloudera.org:8080/#/c/12286/2/docker/ubuntu18.df@5
PS2, Line 5:
> libtool seems to be missing here; the Kudu build broke for me with:
Done



--
To view, visit http://gerrit.cloudera.org:8080/12286
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie1e63fd41ad293008dec2423a5a361cabadcdb8a
Gerrit-Change-Number: 12286
Gerrit-PatchSet: 7
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: hector.aco...@cloudera.com 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:27:02 +
Gerrit-HasComments: Yes


[native-toolchain-CR] Initial support for building the toolchain in docker

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
Hello Lars Volker, Laszlo Gaal, Philip Zeyliger, Tim Armstrong,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12285

to look at the new patch set (#6).

Change subject: Initial support for building the toolchain in docker
..

Initial support for building the toolchain in docker

Adds Dockerfiles and supporting scripts to build the toolchain in a
docker container. It should now be possible to run any build*.sh script
in a supported platform by running:

./in-docker.py impala-toolchain-debian8 "./build.sh gflags 2.2.0-p2"
 - To build gflags version 2.2.0-p2 in debian8

It is also possible to build the entire toolchain for all platforms by
running:

make -j $P

Change-Id: If42c9bc06a3d303642eb37dea784b61e2a1f5cc6
---
A Makefile
A docker/all/assert-dependencies-present.py
A docker/all/postinstall.sh
A docker/buildall.sh
A docker/debian7.df
A docker/debian8.df
A docker/redhat/CentOS-6.6-Base.repo
A docker/redhat/CentOS-7.2.1511-Base.repo
A docker/redhat/yum-install
A docker/redhat6.df
A docker/redhat7.df
A docker/ubuntu1204.df
A docker/ubuntu1404.df
A docker/ubuntu1604.df
M functions.sh
A in-docker.py
16 files changed, 728 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/native-toolchain 
refs/changes/85/12285/6
--
To view, visit http://gerrit.cloudera.org:8080/12285
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: If42c9bc06a3d303642eb37dea784b61e2a1f5cc6
Gerrit-Change-Number: 12285
Gerrit-PatchSet: 6
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: hector.aco...@cloudera.com 


[native-toolchain-CR] Enable publishing on the docker-based build

2019-02-19 Thread hector.aco...@cloudera.com (Code Review)
Hello Laszlo Gaal,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12423

to look at the new patch set (#3).

Change subject: Enable publishing on the docker-based build
..

Enable publishing on the docker-based build

In order to enable publishing from each container this commit:
 * Adds maven and aws commands
 * Modifies the upload function to conditionally upload to
   artifactory/s3
 * Generate TOOLCHAIN_ID (if not set) and share it with all the
   containers. This is done in the Makefile so that all containers
   get the same TOOLCHAIN_ID.
 * Generate the BUILD_TARGET_LABEL based on the docker image name.

Change-Id: I32966bf5b5ad32a3caaf2aee0430ea3a67ecb5d6
---
M Makefile
M docker/all/assert-dependencies-present.py
M docker/debian7.df
M docker/debian8.df
M docker/redhat6.df
M docker/redhat7.df
M docker/sles12.df
M docker/ubuntu1204.df
M docker/ubuntu1404.df
M docker/ubuntu1604.df
M docker/ubuntu1804.df
M functions.sh
M in-docker.py
M init.sh
14 files changed, 185 insertions(+), 23 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/native-toolchain 
refs/changes/23/12423/3
--
To view, visit http://gerrit.cloudera.org:8080/12423
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I32966bf5b5ad32a3caaf2aee0430ea3a67ecb5d6
Gerrit-Change-Number: 12423
Gerrit-PatchSet: 3
Gerrit-Owner: hector.aco...@cloudera.com 
Gerrit-Reviewer: Laszlo Gaal 


[Impala-ASF-CR] IMPALA-8207: Fix query loading for perf and stress tests

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12503 )

Change subject: IMPALA-8207: Fix query loading for perf and stress tests
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3793/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Gerrit-Change-Number: 12503
Gerrit-PatchSet: 3
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:24:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8207: Fix query loading for perf and stress tests

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12503 )

Change subject: IMPALA-8207: Fix query loading for perf and stress tests
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Gerrit-Change-Number: 12503
Gerrit-PatchSet: 3
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:24:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6897: Catalog server web-ui should expose top-n tables with most number of file. The commit includes: Add functions in CatalogUsageMonitor to monitor and report the catalog

2019-02-19 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12483 )

Change subject: IMPALA-6897: Catalog server web-ui should expose top-n tables 
with most number of file. The commit includes:   Add functions in 
CatalogUsageMonitor to monitor and report the catalog usage of the tables 
have the most number of files   Add methods to ex
..


Patch Set 3:

(10 comments)

Can you paste a screenshot of how this looks like?

http://gerrit.cloudera.org:8080/#/c/12483/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/12483/3//COMMIT_MSG@7
PS3, Line 7: IMPALA-6897: Catalog server web-ui should expose top-n tables with 
most
   : number of file.
nit: rephrase to limit it to 72 chars.


http://gerrit.cloudera.org:8080/#/c/12483/3//COMMIT_MSG@10
PS3, Line 10:   Add functions in CatalogUsageMonitor to monitor and report the 
catalog
nit: Indent these better. Also, try to avoid implementation detail and 
summarize the functionality that was added?


http://gerrit.cloudera.org:8080/#/c/12483/3/be/src/catalog/catalog-server.h
File be/src/catalog/catalog-server.h:

http://gerrit.cloudera.org:8080/#/c/12483/3/be/src/catalog/catalog-server.h@187
PS3, Line 187: .
Update?


http://gerrit.cloudera.org:8080/#/c/12483/3/common/thrift/JniCatalog.thrift
File common/thrift/JniCatalog.thrift:

http://gerrit.cloudera.org:8080/#/c/12483/3/common/thrift/JniCatalog.thrift@710
PS3, Line 710: that table has
nit:...in this table.


http://gerrit.cloudera.org:8080/#/c/12483/3/common/thrift/JniCatalog.thrift@711
PS3, Line 711:   4: optional i64 num_files
nit: Clarify that for partitioned tables, this includes file counts across all  
the partitions?


http://gerrit.cloudera.org:8080/#/c/12483/3/common/thrift/JniCatalog.thrift@724
PS3, Line 724: most_num_files_tables
rename to something like high_file_count_tables or something? (or any other 
better name you can think of)


http://gerrit.cloudera.org:8080/#/c/12483/3/fe/src/main/java/org/apache/impala/catalog/Table.java
File fe/src/main/java/org/apache/impala/catalog/Table.java:

http://gerrit.cloudera.org:8080/#/c/12483/3/fe/src/main/java/org/apache/impala/catalog/Table.java@152
PS3, Line 152: getNumOfFiles
getNumFiles()?


http://gerrit.cloudera.org:8080/#/c/12483/3/fe/src/main/java/org/apache/impala/catalog/Table.java@168
PS3, Line 168:  public void setNumOfFiles(long numOfFiles) {
 : numOfFiles_.set(numOfFiles);
 : if (!isStoredInImpaladCatalogCache()) {
 :   CatalogUsageMonitor.INSTANCE.updateMostFilesTables(this);
 : }
 :   }
This does not make sense for non-HDFS tables. Instead do this in HdfsTable 
class?


http://gerrit.cloudera.org:8080/#/c/12483/3/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12483/3/tests/webserver/test_web_pages.py@271
PS3, Line 271: es")
Could you also validate the number of files?


http://gerrit.cloudera.org:8080/#/c/12483/3/tests/webserver/test_web_pages.py@272
PS3, Line 272:   def __test_catalog_object(self, db_name, tbl_name):
newline



--
To view, visit http://gerrit.cloudera.org:8080/12483
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I04df5756641bb10dbb86d813b8001c4a04d7dc9b
Gerrit-Change-Number: 12483
Gerrit-PatchSet: 3
Gerrit-Owner: Yongzhi Chen 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:22:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12524 )

Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/12524/1/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/12524/1/tests/webserver/test_web_pages.py@124
PS1, Line 124:   self.client.cancel(query_handle)
Shouldn't we cancel the query after getting the memz output?



--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:17:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8209: Include fragment instance ID in memz/ breakdown

2019-02-19 Thread Lars Volker (Code Review)
Lars Volker has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12524


Change subject: IMPALA-8209: Include fragment instance ID in memz/ breakdown
..

IMPALA-8209: Include fragment instance ID in memz/ breakdown

The change for IMPALA-7694 had accidentally removed the fragment
instance ID from the memz/ breakdown. This change puts it back and adds
a test to make sure it's there.

Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
---
M be/src/runtime/runtime-state.cc
M tests/webserver/test_web_pages.py
2 files changed, 17 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/12524/1
--
To view, visit http://gerrit.cloudera.org:8080/12524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I73bf06bf95c88186b16fd03243de9bac946c5cc8
Gerrit-Change-Number: 12524
Gerrit-PatchSet: 1
Gerrit-Owner: Lars Volker 


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12520 )

Change subject: Fix misspelling in stress test code
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3792/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 19 Feb 2019 18:09:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread David Knupp (Code Review)
David Knupp has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12520 )

Change subject: Fix misspelling in stress test code
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 19 Feb 2019 17:45:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12520 )

Change subject: Fix misspelling in stress test code
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2155/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 19 Feb 2019 17:32:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8207: Fix query loading for perf and stress tests

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12503 )

Change subject: IMPALA-8207: Fix query loading for perf and stress tests
..


Patch Set 2: Code-Review+2

I tested this out a bit in my stress runs and it helps a lot.


--
To view, visit http://gerrit.cloudera.org:8080/12503
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1966d6ca8babdda07d47e089b75ba06d0318c0d
Gerrit-Change-Number: 12503
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 17:27:02 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5031: oversized signed shifts are undefined

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12514 )

Change subject: IMPALA-5031: oversized signed shifts are undefined
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3791/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/12514
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7debbd1ca5bd3ae640701ce0e95a12c5059abfd7
Gerrit-Change-Number: 12514
Gerrit-PatchSet: 2
Gerrit-Owner: Jim Apple 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 17:24:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5031: oversized signed shifts are undefined

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12514 )

Change subject: IMPALA-5031: oversized signed shifts are undefined
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/12514
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7debbd1ca5bd3ae640701ce0e95a12c5059abfd7
Gerrit-Change-Number: 12514
Gerrit-PatchSet: 2
Gerrit-Owner: Jim Apple 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 17:16:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6662: Make stress test resilient to hangs due to client crashes

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/9635 )

Change subject: IMPALA-6662: Make stress test resilient to hangs due to client 
crashes
..


Patch Set 6:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/9635/6/tests/stress/concurrent_select.py
File tests/stress/concurrent_select.py:

http://gerrit.cloudera.org:8080/#/c/9635/6/tests/stress/concurrent_select.py@698
PS6, Line 698:   timeout = solo_runtime * max(10, 
query_runner._num_queries_started.value - \
I'm pretty sure this change is the cause of the increased timeouts that Sailesh 
saw. The problem is that it's using the query-runner-local counts to try to 
calculate the number of active queries globally. But each runner has only one 
query at a time, so the second argument to max() is either 0 or 1.

The 10x factor here is also kind-of broken anyway since there's no reason 
queries can't run more than 10x slower under load.



--
To view, visit http://gerrit.cloudera.org:8080/9635
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I10c5dc9b8c2fffc471bac2279e348bc1d1fec3b7
Gerrit-Change-Number: 9635
Gerrit-PatchSet: 6
Gerrit-Owner: Sailesh Mukil 
Gerrit-Reviewer: Dan Hecht 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Lars Volker 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Nithya Janarthanan 
Gerrit-Reviewer: Sailesh Mukil 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 19 Feb 2019 16:57:16 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] Fix misspelling in stress test code

2019-02-19 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/12520


Change subject: Fix misspelling in stress test code
..

Fix misspelling in stress test code

Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
---
M tests/stress/concurrent_select.py
1 file changed, 7 insertions(+), 7 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/20/12520/1
--
To view, visit http://gerrit.cloudera.org:8080/12520
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I410c7ad55693f2fdc6bf7edcc4ae3aeb7a408c43
Gerrit-Change-Number: 12520
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6503: Support reading complex types from ORC

2019-02-19 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/12168 )

Change subject: IMPALA-6503: Support reading complex types from ORC
..


Patch Set 16:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/2154/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/12168
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I244dc9d2b3e425393f90e45632cb8cdbea6cf790
Gerrit-Change-Number: 12168
Gerrit-PatchSet: 16
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 19 Feb 2019 16:08:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6503: Support reading complex types from ORC

2019-02-19 Thread Quanlong Huang (Code Review)
Hello Zoltan Borok-Nagy, Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/12168

to look at the new patch set (#16).

Change subject: IMPALA-6503: Support reading complex types from ORC
..

IMPALA-6503: Support reading complex types from ORC

We've supported reading primitive types from ORC files (IMPALA-5717).
In this patch we add support for complex types (struct/array/map).

In IMPALA-5717, we leverage the ORC lib to parse ORC binaries (data in
io buffer read from DiskIoMgr). The ORC lib can materialize ORC column
binaries into its representation (orc::ColumnVectorBatch). Then we
transform values in orc::ColumnVectorBatch into impala::Tuples in
hdfs-orc-scanner. We don't need to do anything about decoding/decompression
since they are handled by the ORC lib. Fortunately, the ORC lib already
supports complex types, we can still leverage it to support complex types.

What we need to add in IMPALA-6503 are two things:
1. Specify which nested columns we need in the form required by the ORC
  lib (Get list of ORC type ids from tuple descriptors)
2. Transform outputs of ORC lib (nested orc::ColumnVectorBatch) into
  Impala's representation (Slots/Tuples/RowBatches)

To format the materialization, we implement several ORC column readers
in hdfs-orc-scanner. Each kind of reader treats a column type and
transforms outputs of the ORC lib into tuple/slot values.

Tests:
* Enable existing tests for complex types (test_nested_types.py,
test_tpch_nested_queries.py) for ORC.

Change-Id: I244dc9d2b3e425393f90e45632cb8cdbea6cf790
---
M be/src/exec/CMakeLists.txt
M be/src/exec/hdfs-orc-scanner.cc
M be/src/exec/hdfs-orc-scanner.h
A be/src/exec/orc-column-readers.cc
A be/src/exec/orc-column-readers.h
A be/src/exec/orc-metadata-utils.cc
A be/src/exec/orc-metadata-utils.h
M be/src/runtime/descriptors.cc
M fe/src/main/java/org/apache/impala/catalog/HdfsFileFormat.java
A testdata/ComplexTypesTbl/README
A testdata/ComplexTypesTbl/nonnullable.orc
A testdata/ComplexTypesTbl/nullable.orc
M testdata/bin/create-load-data.sh
M testdata/bin/load_nested.py
M testdata/datasets/functional/functional_schema_template.sql
M testdata/datasets/functional/schema_constraints.csv
M 
testdata/workloads/functional-planner/queries/PlannerTest/complex-types-file-formats.test
M testdata/workloads/functional-query/queries/QueryTest/max-nesting-depth.test
M 
testdata/workloads/functional-query/queries/QueryTest/nested-types-subplan.test
M 
testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch-limit.test
M 
testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch-mem-limit-single-node.test
M 
testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch-mem-limit.test
M testdata/workloads/functional-query/queries/QueryTest/nested-types-tpch.test
M testdata/workloads/tpch_nested/tpch_nested_core.csv
M testdata/workloads/tpch_nested/tpch_nested_dimensions.csv
M testdata/workloads/tpch_nested/tpch_nested_exhaustive.csv
M testdata/workloads/tpch_nested/tpch_nested_pairwise.csv
M tests/query_test/test_nested_types.py
M tests/query_test/test_tpch_nested_queries.py
29 files changed, 1,889 insertions(+), 462 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/68/12168/16
--
To view, visit http://gerrit.cloudera.org:8080/12168
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I244dc9d2b3e425393f90e45632cb8cdbea6cf790
Gerrit-Change-Number: 12168
Gerrit-PatchSet: 16
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy