[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 3:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml
File docs/topics/impala_config_options.xml:

http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml@355
PS3, Line 355: small
bounded


http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml@365
PS3, Line 365: Java garbage collection-based
simplify to: Memory-based


http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml@373
PS3, Line 373: but the feature could potentially
 : cause performance risks
do we have standardized phrasing around memory knobs? something like, "may 
require tuning".



--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 05:57:49 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 5: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3384/


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 31 Oct 2018 05:56:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7794: Rewrite flaky ownership authorization tests

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11837 )

Change subject: IMPALA-7794: Rewrite flaky ownership authorization tests
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1226/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11837
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic98f8dbec41360261fd0339d835f3ce6b504ee29
Gerrit-Change-Number: 11837
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Adam Holley 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 04:43:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3385/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 4
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 04:21:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 4
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 04:21:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7794: Rewrite flaky ownership authorization tests

2018-10-30 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11837


Change subject: IMPALA-7794: Rewrite flaky ownership authorization tests
..

IMPALA-7794: Rewrite flaky ownership authorization tests

This patch rewrites the ownership authorization tests to not depend on
delay and timeout, which can help to avoid the flakiness. The patch also
refactors some tests to reuse Sentry and Impala instances without having
to restart them, which can speed up the tests. To keep the same test
coverage, no tests were removed.

Testing:
- Ran all authorization E2E tests multiple times.

Change-Id: Ic98f8dbec41360261fd0339d835f3ce6b504ee29
---
M tests/authorization/test_grant_revoke.py
M tests/authorization/test_owner_privileges.py
M tests/common/sentry_cache_test_suite.py
3 files changed, 298 insertions(+), 412 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/37/11837/3
--
To view, visit http://gerrit.cloudera.org:8080/11837
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ic98f8dbec41360261fd0339d835f3ce6b504ee29
Gerrit-Change-Number: 11837
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 3: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3382/


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 3
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 04:04:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11760 )

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..


Patch Set 10:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1225/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11760
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
Gerrit-Change-Number: 11760
Gerrit-PatchSet: 10
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 02:53:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Paul Rogers (Code Review)
Paul Rogers has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11760 )

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..


Patch Set 7:

Turns out that there is a BE bug that means CASE is not equivalent to 
coalesce(). Disabled all coalesce() rewrites and tests, restoring original 
behavior from master. Coalesce() can be revisited when IMPALA-7793 is fixed.


--
To view, visit http://gerrit.cloudera.org:8080/11760
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
Gerrit-Change-Number: 11760
Gerrit-PatchSet: 7
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 02:24:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11760 )

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..


Patch Set 10:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11760/10/fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java
File fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java:

http://gerrit.cloudera.org:8080/#/c/11760/10/fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java@213
PS10, Line 213:* Simplify COALESCE by skipping leading nulls and applying 
the following transformations:
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/11760/10/fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java@234
PS10, Line 234: List newChildren = 
Lists.newArrayList(expr.getChildren().subList(i, numChildren));
line too long (96 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/11760
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
Gerrit-Change-Number: 11760
Gerrit-PatchSet: 10
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 02:22:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Paul Rogers (Code Review)
Hello Bharath Vissapragada, Philip Zeyliger, Tim Armstrong, Csaba Ringhofer, 
Impala Public Jenkins, Vuk Ercegovac,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11760

to look at the new patch set (#10).

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..

IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

See IMPALA-7655 for backgound. Tim found that the current interpreted
forms of if, isnull and coalesce are slow compared to the
code-generated CASE statement.

This patch rewrites the above functions into the equivalent CASE
structure.

The rewrite engine has many bugs that are beyond the scope of
this change to fix. This change codes around those bugs. The
result is that the conditional rewrite happens some of the time,
and sometimes produces less-than perfect optimizations.

Conditionals in the top-level ORDER BY clause are not rewritten
(IMPALA-7753), but those one or more levels down are. Some expressions
involving NULL are not simplified (IMPALA-7769). Several hacks were
used to work around the fact that the rewrite engine ignores unanalyzed
expressions, yet the rewrite engine does not, in general, re-analyze
the expressions it produces, causing simplifications to be skipped
(IMPALA-7754). And so on.

As a result, the BE retains the original interpreted forms that
are still used in two cases: 1) top-level conditions in the
ORDER BY clause, and 2) if the user disables rewrites.

Further, code generation does not occur for CASE statements in the
SELECT clause when it is in the root fragment (the most common case
in simple tests.) This is another known bug (IMPALA-4356).

One possible performance regression is that the new form of the code
evaluates some expressions twice, where the original interpreted
code evaluated the argument once. E.g. coalesce(id, 10) is rewritten
to CASE WHEN id IS NULL THEN id ELSE 10 END. Here, id is evaluated
twice. If the "id" were replaced by a complex sub-expression, the
gain from compilation could be offset by doing work twice.
(IMPALA-7737)

Still, the fix provides most of what the JIRA ticket requested
within the limitations of the existing code.

Conditional function rewrites are moved into a new class,
RewriteConditionalsRule in order to keep things simple.

Most functions use the simplest possible rewrite, relying on the
existing rewrite rules for further simplification.  The one exception
is coalesce(): the existing code relies on the semantics of the
function and so was retained and slightly improved. The code was
extended to produce a CASE statement directly, retaining existing
simplifications.

Tests for conditional functions were in one large function along with
other rewrite tests. Moved them into a new file, then broke up the
tests by function to allow much easier debugging of each function
one-by-one.  This required moving the common test mechanims into a
new common base class.

Existing tests focus on one or two rules at a time. The conditional
function rewrite, however, relies on the entire set of rules being
applied repeatedly. So, added a new FullRewriteTest case to verify this
behavior. This class contains several commented-out tests that cannot
pass due to existing rewrite bugs noted above.

Changing the rewrite cause the PlannerTest to produce different plans
than previously. Changed the expected results file to match the new
rewrite rules.

Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
---
M be/src/exprs/conditional-functions.h
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
A fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java
M fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
A fe/src/test/java/org/apache/impala/analysis/BaseRewriteRulesTest.java
M fe/src/test/java/org/apache/impala/analysis/ExprRewriteRulesTest.java
A fe/src/test/java/org/apache/impala/analysis/FullRewriteTest.java
A fe/src/test/java/org/apache/impala/analysis/RewriteConditionalFnsRuleTest.java
M testdata/workloads/functional-planner/queries/PlannerTest/aggregation.test
M testdata/workloads/functional-planner/queries/PlannerTest/analytic-fns.test
M testdata/workloads/functional-planner/queries/PlannerTest/implicit-joins.test
M testdata/workloads/functional-planner/queries/PlannerTest/joins.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/nested-collections.test
M testdata/workloads/functional-planner/queries/PlannerTest/outer-joins.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/predicate-propagation.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/runtime-filter-propagation.test
18 files changed, 1,014 insertions(+), 282 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF 

[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:54:52 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3384/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 5
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:54:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 4:

I'm not sure what happened here:
00:55:59 ] [ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-surefire-plugin:2.18:test (default-test) on 
project impala-frontend: Execution default-test of goal 
org.apache.maven.plugins:maven-surefire-plugin:2.18:test failed: The forked VM 
terminated without properly saying goodbye. VM crash or System.exit called?
00:55:59 ] [ERROR] Command was /bin/sh -c cd /home/ubuntu/Impala/fe && 
/usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java 
-Djava.library.path=/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-8-openjdk-amd64/jre/../lib/amd64:/usr/lib/x86_64-linux-gnu/:/usr/lib/x86_64-linux-gnu/:/usr/lib/x86_64-linux-gnu/:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server:/home/ubuntu/Impala/toolchain/cdh_components-663310/hadoop-3.0.0-cdh6.x-SNAPSHOT//lib/native:/home/ubuntu/Impala/toolchain/snappy-1.1.4/lib:/home/ubuntu/Impala/../Impala-lzo/build:/home/ubuntu/Impala/toolchain/gcc-4.9.2/lib64:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64:/usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server:/home/ubuntu/Impala/toolchain/cdh_components-663310/hadoop-3.0.0-cdh6.x-SNAPSHOT//lib/native:/home/ubuntu/Impala/toolchain/snappy-1.1.4/lib:/home/ubuntu/Impala/../Impala-lzo/build:/home/ubuntu/Impala/toolchain/gcc-4.9.2/lib64:/usr/java/packages/lib/amd64:/usr/lib/x86_64-linux-gnu/jni:/lib/x86_64-linux-gnu:/usr/lib/x86_64-linux-gnu:/usr/lib/jni:/lib:/usr/lib:/home/ubuntu/Impala/be/build/debug/service:/home/ubuntu/Impala/be/build/release/service
 -jar /home/ubuntu/Impala/logs/surefire/surefirebooter574563636434827.jar 
/home/ubuntu/Impala/logs/surefire/surefire7967668628670205081tmp 
/home/ubuntu/Impala/logs/surefire/surefire_07807361286211485039tmp

Going to retry.


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 4
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:54:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Adrian Ng (Code Review)
Adrian Ng has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml
File docs/topics/impala_config_options.xml:

http://gerrit.cloudera.org:8080/#/c/11809/3/docs/topics/impala_config_options.xml@363
PS3, Line 363: specified time period. T
in seconds.



--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:23:54 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Adrian Ng (Code Review)
Adrian Ng has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 3:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11809/2/docs/topics/impala_config_options.xml
File docs/topics/impala_config_options.xml:

http://gerrit.cloudera.org:8080/#/c/11809/2/docs/topics/impala_config_options.xml@367
PS2, Line 367:  is high
We can say "reaches 60% of JVM heap size" to be specific.


http://gerrit.cloudera.org:8080/#/c/11809/2/docs/topics/impala_config_options.xml@368
PS2, Line 368: certain
We can say invalidates 10% of the least recently used tables.



--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:14:59 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11760 )

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..


Patch Set 9:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1224/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11760
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
Gerrit-Change-Number: 11760
Gerrit-PatchSet: 9
Gerrit-Owner: Paul Rogers 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Paul Rogers 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:10:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Alex Rodoni (Code Review)
Alex Rodoni has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..

IMPALA-7614: [DOCS] Document the New Invalidate Options

--invalidate_tables_timeout_s
--invalidate_tables_on_memory_pressure

Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Reviewed-on: http://gerrit.cloudera.org:8080/11809
Reviewed-by: Tianyi Wang 
Tested-by: Impala Public Jenkins 
---
M docs/topics/impala_config_options.xml
1 file changed, 35 insertions(+), 34 deletions(-)

Approvals:
  Tianyi Wang: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 


[Impala-ASF-CR] IMPALA-7765: [DOCS] Document IMPALA MAX MEM ESTIMATE FOR ADMISSION option

2018-10-30 Thread Alex Rodoni (Code Review)
Alex Rodoni has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11804 )

Change subject: IMPALA-7765: [DOCS] Document 
IMPALA_MAX_MEM_ESTIMATE_FOR_ADMISSION option
..

IMPALA-7765: [DOCS] Document IMPALA_MAX_MEM_ESTIMATE_FOR_ADMISSION option

Change-Id: Ibef89c98530c6974dc791666cc51c1ded52e7910
Reviewed-on: http://gerrit.cloudera.org:8080/11804
Tested-by: Impala Public Jenkins 
Reviewed-by: Tim Armstrong 
---
M docs/impala.ditamap
A docs/topics/impala_max_mem_estimate_for_admission.xml
2 files changed, 90 insertions(+), 0 deletions(-)

Approvals:
  Impala Public Jenkins: Verified
  Tim Armstrong: Looks good to me, approved

--
To view, visit http://gerrit.cloudera.org:8080/11804
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ibef89c98530c6974dc791666cc51c1ded52e7910
Gerrit-Change-Number: 11804
Gerrit-PatchSet: 4
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-2566: Remove ability to access impala builtin cast functions directly

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11830 )

Change subject: IMPALA-2566: Remove ability to access impala builtin cast 
functions directly
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11830/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11830/1//COMMIT_MSG@15
PS1, Line 15:
It's weird that we're still showing all these functions in _impala_builtins but 
you can't call any of them. I.e. it doesn't feel like the right way to fix it.

I get that's probably a more invasive change but it feels like trading one kind 
of wonkiness for another - maybe it's better just to fix the narrow 
casttochar() issue for now?

I guess maybe this is easier for now than adding tests for all of the callable 
casttochar* variants (which would be the right thing to do if we left them in 
there) but we should file a follow-on to fix the wonkiness at least.


http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
File fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java:

http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java@500
PS1, Line 500: Impala builtin cast functions as reserved for internal use only
> Open to suggestions for a error message
Maybe something like:

"_the_function() is reserved for internal use only. Use 'cast(expr AS type)' 
instead."



--
To view, visit http://gerrit.cloudera.org:8080/11830
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
Gerrit-Change-Number: 11830
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 01:03:17 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 2: Verified+1

Build Successful

https://jenkins.impala.io/job/gerrit-docs-auto-test/126/ : Doc tests passed.


-- 
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:59:19 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 4: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3380/


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 4
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:56:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7765: [DOCS] Document IMPALA MAX MEM ESTIMATE FOR ADMISSION option

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11804 )

Change subject: IMPALA-7765: [DOCS] Document 
IMPALA_MAX_MEM_ESTIMATE_FOR_ADMISSION option
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11804
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibef89c98530c6974dc791666cc51c1ded52e7910
Gerrit-Change-Number: 11804
Gerrit-PatchSet: 3
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:52:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Tianyi Wang (Code Review)
Tianyi Wang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:49:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 2:

Build Started https://jenkins.impala.io/job/gerrit-docs-auto-test/126/

Testing docs change - this change appears to modify docs/ and no code. This is 
experimental - please report any issues to tarmstr...@cloudera.com or on this 
JIRA: IMPALA-7317


--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:48:17 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Alex Rodoni (Code Review)
Hello Tianyi Wang, Adrian Ng, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11809

to look at the new patch set (#2).

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..

IMPALA-7614: [DOCS] Document the New Invalidate Options

--invalidate_tables_timeout_s
--invalidate_tables_on_memory_pressure

Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
---
M docs/topics/impala_config_options.xml
1 file changed, 35 insertions(+), 34 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/09/11809/2
--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Alex Rodoni (Code Review)
Alex Rodoni has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml
File docs/topics/impala_config_options.xml:

http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml@356
PS1, Line 356: invalidate
> invalidates
Done


http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml@366
PS1, Line 366: When
 :   the size of the garbage collection of 
catalogd
 :   exceeds a certain threshold, Impala invalidates a 
certain fraction of
 :   the least recently used tables.
> "size of the garbage collection" isn't accurate, how about:
Done



--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:48:18 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1223/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 2
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:43:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7614: [DOCS] Document the New Invalidate Options

2018-10-30 Thread Tianyi Wang (Code Review)
Tianyi Wang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11809 )

Change subject: IMPALA-7614: [DOCS] Document the New Invalidate Options
..


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml
File docs/topics/impala_config_options.xml:

http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml@356
PS1, Line 356: invalidate
invalidates


http://gerrit.cloudera.org:8080/#/c/11809/1/docs/topics/impala_config_options.xml@366
PS1, Line 366: When
 :   the size of the garbage collection of 
catalogd
 :   exceeds a certain threshold, Impala invalidates a 
certain fraction of
 :   the least recently used tables.
"size of the garbage collection" isn't accurate, how about:
"When the memory pressure is high after a Java garbage collection in catalogd, 
Impala invalidates ... "



--
To view, visit http://gerrit.cloudera.org:8080/11809
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I40c552eeaee81ee6528d9f725bd416b51d8ab837
Gerrit-Change-Number: 11809
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:42:31 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Vuk Ercegovac (Code Review)
Vuk Ercegovac has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11760 )

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..


Patch Set 9:

(10 comments)

still reviewing the tests.

http://gerrit.cloudera.org:8080/#/c/11760/8//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11760/8//COMMIT_MSG@43
PS8, Line 43: (IMPALA-7737)
are there examples of these fns in our benchmarks to quantify the regression? 
if so, would be useful to see the effect.


http://gerrit.cloudera.org:8080/#/c/11760/8//COMMIT_MSG@45
PS8, Line 45: Still, the fix provides most of what the JIRA ticket requested
I'd skip these next three paragraphs.


http://gerrit.cloudera.org:8080/#/c/11760/8//COMMIT_MSG@57
PS8, Line 57:
pls make a section for this called "Testing" so its easier to jump to. also, 
pls condense these so that they're easier to skim. for example:
- split up tests for conditional functions to make them easier to test
- added unit tests for end-to-end rewrite rule interactions
- updated existing planner tests due to rewrites


http://gerrit.cloudera.org:8080/#/c/11760/8/be/src/exprs/conditional-functions.h
File be/src/exprs/conditional-functions.h:

http://gerrit.cloudera.org:8080/#/c/11760/8/be/src/exprs/conditional-functions.h@76
PS8, Line 76: since
: /// various bugs mean that this implementation is still sometimes 
used. But
: /// the goal is to remove these classes at some point.
simpler: "until their use is eliminated by the frontend".


http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java
File fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java:

http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java@37
PS8, Line 37: vanish
is this accurate given the comments in the commit message about order by?


http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java@48
PS8, Line 48: planner runs the rule to simplify CASE
:  * after this rule. Where that other rule can perform 
simplifications,
:  * those simplifications are omitted here
simplify and use the specific rule name for concreteness.


http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java@106
PS8, Line 106:  return rewriteIfNullFn(expr)
clarify whether you think this happens after the rewrite or before. If its 
after, then I expect the example on L109,110 to be in terms of CASE. I'm also 
fine with omitting the example since its assumed that these rules compose.


http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java@130
PS8, Line 130:expr.get
isn't this all that's done here (the most general case) and we'll depend on 
other rewrites for further simplifications?


http://gerrit.cloudera.org:8080/#/c/11760/8/fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java@172
PS8, Line 172:
The simplest rewrite here would be to not look at the child exprs for the 
various scenarios listed above and instead simply translate naively to a case 
statement. From there, we'd get constant folding and case simplification which 
will find the first when clause that evals to true. preceding when clauses that 
remain unknown will be retained, but this transform will need to retain them as 
well. Aggregate handling will result in a brute-force roll-back of the rewrite 
in case simplification, which will result in falling back to the case rewrite 
here. Might want to handle that situation by retaining the coalesce for now. So 
besides that issue, what else do we miss by doing the simple thing and rely on 
case simplification?


http://gerrit.cloudera.org:8080/#/c/11760/8/testdata/workloads/functional-planner/queries/PlannerTest/predicate-propagation.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/predicate-propagation.test:

http://gerrit.cloudera.org:8080/#/c/11760/8/testdata/workloads/functional-planner/queries/PlannerTest/predicate-propagation.test@1137
PS8, Line 1137: |  other predicates: functional.alltypestiny.tinyint_col + 
functional.alltypestiny.smallint_col + functional.alltypestiny.int_col > 10, 
CASE WHEN functional.alltypestiny.tinyint_col + 
functional.alltypestiny.bigint_col IS NULL THEN 1 ELSE 
functional.alltypestiny.tinyint_col + functional.alltypestiny.bigint_col END = 1
so this is the example of the performance regression (same work on multiple 
when clauses)?



--
To view, visit http://gerrit.cloudera.org:8080/11760
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163

[Impala-ASF-CR] IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

2018-10-30 Thread Paul Rogers (Code Review)
Hello Bharath Vissapragada, Philip Zeyliger, Tim Armstrong, Csaba Ringhofer, 
Impala Public Jenkins, Vuk Ercegovac,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11760

to look at the new patch set (#9).

Change subject: IMPALA-7655: Rewrite if, isnull, coalesce to use CASE
..

IMPALA-7655: Rewrite if, isnull, coalesce to use CASE

See IMPALA-7655 for backgound. Tim found that the current interpreted
forms of if, isnull and coalesce are slow compared to the
code-generated CASE statement.

This patch rewrites the above functions into the equivalent CASE
structure.

The rewrite engine has many bugs that are beyond the scope of
this change to fix. This change codes around those bugs. The
result is that the conditional rewrite happens some of the time,
and sometimes produces less-than perfect optimizations.

Conditionals in the top-level ORDER BY clause are not rewritten
(IMPALA-7753), but those one or more levels down are. Some expressions
involving NULL are not simplified (IMPALA-7769). Several hacks were
used to work around the fact that the rewrite engine ignores unanalyzed
expressions, yet the rewrite engine does not, in general, re-analyze
the expressions it produces, causing simplifications to be skipped
(IMPALA-7754). And so on.

As a result, the BE retains the original interpreted forms that
are still used in two cases: 1) top-level conditions in the
ORDER BY clause, and 2) if the user disables rewrites.

Further, code generation does not occur for CASE statements in the
SELECT clause when it is in the root fragment (the most common case
in simple tests.) This is another known bug (IMPALA-4356).

One possible performance regression is that the new form of the code
evaluates some expressions twice, where the original interpreted
code evaluated the argument once. E.g. coalesce(id, 10) is rewritten
to CASE WHEN id IS NULL THEN id ELSE 10 END. Here, id is evaluated
twice. If the "id" were replaced by a complex sub-expression, the
gain from compilation could be offset by doing work twice.
(IMPALA-7737)

Still, the fix provides most of what the JIRA ticket requested
within the limitations of the existing code.

Conditional function rewrites are moved into a new class,
RewriteConditionalsRule in order to keep things simple.

Most functions use the simplest possible rewrite, relying on the
existing rewrite rules for further simplification.  The one exception
is coalesce(): the existing code relies on the semantics of the
function and so was retained and slightly improved. The code was
extended to produce a CASE statement directly, retaining existing
simplifications.

Tests for conditional functions were in one large function along with
other rewrite tests. Moved them into a new file, then broke up the
tests by function to allow much easier debugging of each function
one-by-one.  This required moving the common test mechanims into a
new common base class.

Existing tests focus on one or two rules at a time. The conditional
function rewrite, however, relies on the entire set of rules being
applied repeatedly. So, added a new FullRewriteTest case to verify this
behavior. This class contains several commented-out tests that cannot
pass due to existing rewrite bugs noted above.

Changing the rewrite cause the PlannerTest to produce different plans
than previously. Changed the expected results file to match the new
rewrite rules.

Change-Id: I526654d8546e17b2545c42cc59dab66d9fe1b163
---
M be/src/exprs/conditional-functions.h
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
A fe/src/main/java/org/apache/impala/rewrite/RewriteConditionalFnsRule.java
M fe/src/main/java/org/apache/impala/rewrite/SimplifyConditionalsRule.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeExprsTest.java
A fe/src/test/java/org/apache/impala/analysis/BaseRewriteRulesTest.java
M fe/src/test/java/org/apache/impala/analysis/ExprRewriteRulesTest.java
A fe/src/test/java/org/apache/impala/analysis/FullRewriteTest.java
A fe/src/test/java/org/apache/impala/analysis/RewriteConditionalFnsRuleTest.java
M testdata/workloads/functional-planner/queries/PlannerTest/aggregation.test
M testdata/workloads/functional-planner/queries/PlannerTest/analytic-fns.test
M testdata/workloads/functional-planner/queries/PlannerTest/implicit-joins.test
M testdata/workloads/functional-planner/queries/PlannerTest/joins.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/nested-collections.test
M testdata/workloads/functional-planner/queries/PlannerTest/outer-joins.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/predicate-propagation.test
M 
testdata/workloads/functional-planner/queries/PlannerTest/runtime-filter-propagation.test
18 files changed, 946 insertions(+), 313 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF 

[Impala-ASF-CR] IMPALA-7743: [DOCS] A new option to load incremental statistics from catalog

2018-10-30 Thread Alex Rodoni (Code Review)
Alex Rodoni has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11790 )

Change subject: IMPALA-7743: [DOCS] A new option to load incremental statistics 
from catalog
..

IMPALA-7743: [DOCS] A new option to load incremental statistics from catalog

--pull_incremental_statistics described in the Incremental Stats section.

Change-Id: I8fd9b88138350406065df2f39a48043178759949
Reviewed-on: http://gerrit.cloudera.org:8080/11790
Reviewed-by: Greg Rahn 
Tested-by: Impala Public Jenkins 
---
M docs/shared/impala_common.xml
M docs/topics/impala_perf_stats.xml
2 files changed, 74 insertions(+), 46 deletions(-)

Approvals:
  Greg Rahn: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/11790
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8fd9b88138350406065df2f39a48043178759949
Gerrit-Change-Number: 11790
Gerrit-PatchSet: 4
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Adrian Ng (389)
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Greg Rahn 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3382/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 3
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:10:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 2
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:10:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 3
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 31 Oct 2018 00:10:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7687: [DOCS] Support for multiple DISTINCT in a query

2018-10-30 Thread Alex Rodoni (Code Review)
Alex Rodoni has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11823 )

Change subject: IMPALA-7687: [DOCS] Support for multiple DISTINCT in a query
..

IMPALA-7687: [DOCS] Support for multiple DISTINCT in a query

- Removed notes about the single DISTINCT restriction.
- Rewrote the description for the APPX_COUNT_DISTINCT query option.

Change-Id: I3a6e664b016e9408a3ff809f1811253a91764481
Reviewed-on: http://gerrit.cloudera.org:8080/11823
Tested-by: Impala Public Jenkins 
Reviewed-by: Thomas Marshall 
---
M docs/shared/impala_common.xml
M docs/topics/impala_appx_count_distinct.xml
M docs/topics/impala_count.xml
M docs/topics/impala_distinct.xml
M docs/topics/impala_langref_unsupported.xml
M docs/topics/impala_select.xml
6 files changed, 33 insertions(+), 111 deletions(-)

Approvals:
  Impala Public Jenkins: Verified
  Thomas Marshall: Looks good to me, approved

--
To view, visit http://gerrit.cloudera.org:8080/11823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I3a6e664b016e9408a3ff809f1811253a91764481
Gerrit-Change-Number: 11823
Gerrit-PatchSet: 2
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Pooja Nilangekar (Code Review)
Pooja Nilangekar has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..

IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

Previously, the ReadVLong() function would read a byte and use it
after subsequent calls to ReadBytes() function. As a result, the
value was invalidated depending on the contents of the
boundary_buffer_. This change ensures that all the functions on
the buffer are invoked before subsequent calls to ReadBytes().

Testing:
Enabled test_tpch_scan_ranges() for sequence files and ran it
multiple times to ensure it always produces the desired result.
Manually verified the buffer usage of other calls to ReadBytes()
and GetBytes().

Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
---
M be/src/exec/scanner-context.h
M be/src/exec/scanner-context.inline.h
M tests/query_test/test_scanners.py
3 files changed, 11 insertions(+), 11 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/28/11828/2
--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 2
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Pooja Nilangekar (Code Review)
Pooja Nilangekar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11828/1/be/src/exec/scanner-context.inline.h
File be/src/exec/scanner-context.inline.h:

http://gerrit.cloudera.org:8080/#/c/11828/1/be/src/exec/scanner-context.inline.h@169
PS1, Line 169:   if (is_negative) *value = *value ^ (static_cast(-1));
> nit: could put conditional on one line.
Done



--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 2
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 23:54:20 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11828/1/be/src/exec/scanner-context.inline.h
File be/src/exec/scanner-context.inline.h:

http://gerrit.cloudera.org:8080/#/c/11828/1/be/src/exec/scanner-context.inline.h@169
PS1, Line 169:   if (is_negative) {
nit: could put conditional on one line.



--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 1
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 23:47:59 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1222/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 7
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 23:16:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1221/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 23:11:24 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1220/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 5
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 23:05:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-2566: Remove ability to access impala builtin cast functions directly

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11830 )

Change subject: IMPALA-2566: Remove ability to access impala builtin cast 
functions directly
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1219/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11830
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
Gerrit-Change-Number: 11830
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tianyi Wang 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:58:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7687: [DOCS] Support for multiple DISTINCT in a query

2018-10-30 Thread Thomas Marshall (Code Review)
Thomas Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11823 )

Change subject: IMPALA-7687: [DOCS] Support for multiple DISTINCT in a query
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11823
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3a6e664b016e9408a3ff809f1811253a91764481
Gerrit-Change-Number: 11823
Gerrit-PatchSet: 1
Gerrit-Owner: Alex Rodoni 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Vuk Ercegovac 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:57:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11828 )

Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1218/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 1
Gerrit-Owner: Pooja Nilangekar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Pooja Nilangekar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:51:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..

IMPALA-6658: improve Parquet RLE for low bit widths

RleEncoder buffers values in its own cache to detect run lengths that
can be efficiently encoded. When a run is detected it is written with an
indicator byte which encodes the length of the run. So an encoded
run always has an overhead of at least one byte. This means that for
single bit values, encoding 8 values as a run is inefficient.

Change RleEncoder to have the ability to use run lengths other than 8.
A new parameter to the constructor (min_run_length) allows test callers
(only) to set the minimum run length.

By default RleEncoder will now use run length encoding for runs of
length 16 for single bit values. All other bit widths will use the
existing length 8 runs.

Internally RleEncoder must buffer more values so that the longer runs
can be detected. The internal buffer “buffered_values_” is larger
and is now a circular buffer so that the first 8 bytes of the buffer can
be separately flushed to BitWriter.

Testing:

All end-to-end and unit tests pass

The unit test rle-test is enhanced to run all tests against RleEncoders
using all possible values of min_run_length. In Addition, rle-test is
refactored so that the Rle tests are in a class that inherits from
::testing::Test so that a SetUp() method can be used.
The Overflow test is enhanced to be more exhaustive (while still
completing in a second or two).

Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
---
M be/src/util/rle-encoding.h
M be/src/util/rle-test.cc
2 files changed, 499 insertions(+), 255 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/82/11582/7
--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 7
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..

IMPALA-6658: improve Parquet RLE for low bit widths

RleEncoder buffers values in its own cache to detect run lengths that
can be efficiently encoded. When a run is detected it is written with an
indicator byte which encodes the length of the run. So an encoded
run always has an overhead of at least one byte. This means that for
single bit values, encoding 8 values as a run is inefficient.

Change RleEncoder to have the ability to use run lengths other than 8.
A new parameter to the constructor (min_run_length) allows test callers
(only) to set the minimum run length.

By default RleEncoder will now use run length encoding for runs of
length 16 for single bit values. All other bit widths will use the
existing length 8 runs.

Internally RleEncoder must buffer more values so that the longer runs
can be detected. The internal buffer “buffered_values_” is larger
and is now a circular buffer so that the first 8 bytes of the buffer can
be separately flushed to BitWriter.

Testing:

All end-to-end and unit tests pass

The unit test rle-test is enhanced to run all tests against RleEncoders
using all possible values of min_run_length. In Addition, rle-test is
refactored so that the Rle tests are in a class that inherits from
::testing::Test so that a SetUp() method can be used.
The Overflow test is enhanced to be more exhaustive (while still
completing in a second or two).

Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
---
M be/src/util/rle-encoding.h
M be/src/util/rle-test.cc
2 files changed, 484 insertions(+), 256 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/82/11582/6
--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..


Patch Set 2:

(4 comments)

Thanks Csaba

http://gerrit.cloudera.org:8080/#/c/11582/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11582/2//COMMIT_MSG@20
PS2, Line 20: By default RleEncoder will now use run length encoding for runs of
: length 24 for single bit values, and of length 16 for 2 bit wide 
values.
: All other bit widths will use the existing length 8 runs.
> Ok, the biggest win (50% size for alternating runs of 8 in the 1 bit case)
So you want me to write something like this in the jira?

If you know the structure of the data then better encodings are
possible. For example with bit_width=1, using min_run_length=24 is
better in the case where we avoid interrupting a literal run.

Using the notation of 'RXX' for a repeated run of length XX (so R16 is a
run of lngth 16), and 'LYY' for a literal run of length YY.

 L24 R16 L24 R16 L24
min_run_length 8 4   2   4   2   4
min_run_length 16 (new default)  4   2   4   2   4
min_run_length 244   2   3   2   3 (one long literal run)

So it is possible to optimize by detecting this situation and avoiding
breaking a long literal run for a run of length 16.


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h
File be/src/util/rle-encoding.h:

http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@250
PS4, Line 250: iter
> Can you simplify this expression? MAX_VALUES_PER_LITERAL_RUN must be divisi
Good idea


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@264
PS4, Line 264: insertin
> Aren't we double counting the indicator byte here? My assumption is that th
I think this is correct as is but I am changing it to the clearer
literal_max_size = num_runs * (1 + bytes_per_run)
-- 1 is the indicator
-- bytes_per_run is the encoded bytes


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@265
PS4, Line 265: /  1 byte for the encoded run length
> This seems to assume that a single big literal run is the worsts case - can
I think before this change a long literal was not always the worst case, but 
now it is.



--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 2
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:22:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..

IMPALA-6658: improve Parquet RLE for low bit widths

RleEncoder buffers values in its own cache to detect run lengths that
can be efficiently encoded. When a run is detected it is written with an
indicator byte which encodes the length of the run. So an encoded
run always has an overhead of at least one byte. This means that for
single bit values, encoding 8 values as a run is inefficient.

Change RleEncoder to have the ability to use run lengths other than 8.
A new parameter to the constructor (min_run_length) allows test callers
(only) to set the minimum run length.

By default RleEncoder will now use run length encoding for runs of
length 16 for single bit values. All other bit widths will use the
existing length 8 runs.

Internally RleEncoder must buffer more values so that the longer runs
can be detected. The internal buffer “buffered_values_” is larger
and is now a circular buffer so that the first 8 bytes of the buffer can
be separately flushed to BitWriter.

Testing:

All end-to-end and unit tests pass

The unit test rle-test is enhanced to run all tests against RleEncoders
using all possible values of min_run_length. In Addition, rle-test is
refactored so that the Rle tests are in a class that inherits from
::testing::Test so that a SetUp() method can be used.
The Overflow test is enhanced to be more exhaustive (while still
completing in a second or two).

Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
---
M be/src/util/rle-encoding.h
M be/src/util/rle-test.cc
2 files changed, 510 insertions(+), 255 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/82/11582/5
--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 5
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 


[Impala-ASF-CR] IMPALA-2566: Remove ability to access impala builtin cast functions directly

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11830 )

Change subject: IMPALA-2566: Remove ability to access impala builtin cast 
functions directly
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
File fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java:

http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java@3962
PS1, Line 3962: String expectedError = "Impala builtin cast functions as 
reserved for internal use only";
line too long (93 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/11830
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
Gerrit-Change-Number: 11830
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:20:30 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-2566: Remove ability to access impala builtin cast functions directly

2018-10-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11830 )

Change subject: IMPALA-2566: Remove ability to access impala builtin cast 
functions directly
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
File fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java:

http://gerrit.cloudera.org:8080/#/c/11830/1/fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java@500
PS1, Line 500: Impala builtin cast functions as reserved for internal use only
Open to suggestions for a error message



--
To view, visit http://gerrit.cloudera.org:8080/11830
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
Gerrit-Change-Number: 11830
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:20:15 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-2566: Remove ability to access impala builtin cast functions directly

2018-10-30 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11830


Change subject: IMPALA-2566: Remove ability to access impala builtin cast 
functions directly
..

IMPALA-2566: Remove ability to access impala builtin cast functions
directly

Currently, all impala builtin cast functions are exposed to the user
and can be used directly in sql statements. These methods are used
internally for casting and sometimes require context not available
when used directly. This patch removes direct access to those methods
by throwing an analysis exception.

Testing:
Added relevant frontend tests.

Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
---
M fe/src/main/java/org/apache/impala/analysis/FunctionCallExpr.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
2 files changed, 26 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/30/11830/1
--
To view, visit http://gerrit.cloudera.org:8080/11830
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I6f562607aaaf728fc417066565811bbd269baaa8
Gerrit-Change-Number: 11830
Gerrit-PatchSet: 1
Gerrit-Owner: Bikramjeet Vig 


[Impala-ASF-CR] IMPALA-6374: fix handling of commas in .test files

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11800 )

Change subject: IMPALA-6374: fix handling of commas in .test files
..

IMPALA-6374: fix handling of commas in .test files

The .test file parser implemented an unconventional method for parsing
single-quoted strings in comma-separated value format. This didn't handle
trailing commas in the string correctly.

This commit switches to using a conventional method for parsing
comma-separated value format:
* Commas enclosed by single quotes are not treated as field separators
* Single quotes can be escaped within a string by doubling them.

I looked into using Python's .csv module for this, but it wouldn't
work without modifying the test file format more because it
automatically discards the quotes during parsing, which are actually
semantically important in .test files. E.g. without the quotes we can't
distinguish between the literal string 'regex:...' and the regex
regex:

Testing:
Ran exhaustive tests and fixed .test files that required modifications.
Will rerun before merging.

Added a couple of tests to exercise edge cases in the test file parser.

Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Reviewed-on: http://gerrit.cloudera.org:8080/11800
Reviewed-by: Impala Public Jenkins 
Tested-by: Tim Armstrong 
---
M testdata/workloads/functional-query/queries/QueryTest/functions-ddl.test
M testdata/workloads/functional-query/queries/QueryTest/grant_revoke.test
M testdata/workloads/functional-query/queries/QueryTest/hbase-inserts.test
M testdata/workloads/functional-query/queries/QueryTest/misc.test
A testdata/workloads/functional-query/queries/QueryTest/special-strings.test
M testdata/workloads/functional-query/queries/QueryTest/stats-extrapolation.test
M testdata/workloads/tpcds/queries/tpcds-decimal_v2-q98.test
M testdata/workloads/tpcds/queries/tpcds-q98.test
M tests/common/test_result_verifier.py
M tests/query_test/test_exprs.py
10 files changed, 157 insertions(+), 112 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved
  Tim Armstrong: Verified

--
To view, visit http://gerrit.cloudera.org:8080/11800
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Gerrit-Change-Number: 11800
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6374: fix handling of commas in .test files

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has removed a vote on this change.

Change subject: IMPALA-6374: fix handling of commas in .test files
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/11800
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Gerrit-Change-Number: 11800
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6374: fix handling of commas in .test files

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11800 )

Change subject: IMPALA-6374: fix handling of commas in .test files
..


Patch Set 5: Verified+1

Reran the flaky test: 
https://jenkins.impala.io/job/python26-incompatibility-check/538/


--
To view, visit http://gerrit.cloudera.org:8080/11800
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Gerrit-Change-Number: 11800
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 22:17:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

2018-10-30 Thread Pooja Nilangekar (Code Review)
Pooja Nilangekar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11828


Change subject: IMPALA-7363: Fix buffer usage in 
ScannerContext::Stream::ReadVLong()
..

IMPALA-7363: Fix buffer usage in ScannerContext::Stream::ReadVLong()

Previously, the ReadVLong() function would read a byte and use it
after subsequent calls to ReadBytes() function. As a result, the
value was invalidated depending on the contents of the
boundary_buffer_. This change ensures that all the functions on
the buffer are invoked before subsequent calls to ReadBytes().

Testing:
Enabled test_tpch_scan_ranges() for sequence files and ran it
multiple times to ensure it always produces the desired result.
Manually verified the buffer usage of other calls to ReadBytes()
and GetBytes().

Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
---
M be/src/exec/scanner-context.h
M be/src/exec/scanner-context.inline.h
M tests/query_test/test_scanners.py
3 files changed, 10 insertions(+), 9 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/28/11828/1
--
To view, visit http://gerrit.cloudera.org:8080/11828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ic2b2ffe4b1d67c63cfcea2baedbeff48e65ca417
Gerrit-Change-Number: 11828
Gerrit-PatchSet: 1
Gerrit-Owner: Pooja Nilangekar 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 2:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/11591/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11591/1//COMMIT_MSG@9
PS1, Line 9: Currently execution summary is not included in the profiles of 
running
   : queries, and it's only reported when the query is finished. This 
jira makes
   : the execution summary to the profile reported wh
> Use shorter lines. My recommendation is to wrap at 70 characters.
Thanks, addressed in new rev.


http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.h
File be/src/service/impala-server.h:

http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.h@637
PS1, Line 637: void UpdateE
> You don't want [[noreturn]]
Thanks Joe, my misunderstanding of this annotation. Fixed in new rev.


http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/11591/1/be/src/service/impala-server.cc@1128
PS1, Line 1128:<< PrettyPrinter::Print(cpu_limit_
> I'm thinking this might not hold for some calls to GetRuntimeProfileStr() t
Good catch Joe. Indeed. Fixed in new rev by adding a check.

I tried to run test_observability both locally and at jenkins, the former had 
some failures, however, the latter is clean. Looking into why it failed locally.



--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:51:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1217/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:32:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 6: Code-Review+1

(3 comments)

http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
File fe/src/main/java/org/apache/impala/planner/KuduScanNode.java:

http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java@319
PS5, Line 319: getExplainString
> The behaviour here seems ok to me - it calls toSql(), which consistently es
The behavior overall seems ok to me as well. But in this case, it diverged from 
the original conjunct that was used to push down.


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv
File testdata/data/strings_with_quotes.csv:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv@11
PS5, Line 11: foo\\"bar,11
> Done. I assume you meant that you wanted the value in the table in include
Yep.


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test
File 
testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test@4
PS5, Line 4: 
> I think it's better to exercise and track the known-bad behaviour than leav
Agreed.



--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:24:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1216/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 2
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:21:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5821: Add query with implicit casts to extended explain output.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11719 )

Change subject: IMPALA-5821: Add query with implicit casts to extended explain 
output.
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1215/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11719
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55c3bdacc295137f66b2316a912fc347da30d6b0
Gerrit-Change-Number: 11719
Gerrit-PatchSet: 4
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:09:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 4
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:06:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3380/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 4
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:06:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:04:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 3:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11826/3/testdata/bin/run-hive-server.sh
File testdata/bin/run-hive-server.sh:

http://gerrit.cloudera.org:8080/#/c/11826/3/testdata/bin/run-hive-server.sh@66
PS3, Line 66: export HIVE_METASTORE_HADOOP_OPTS="-Xdebug 
-Xrunjdwp:transport=dt_socket,server=y,suspend=n,address=30010"
line too long (106 > 90)


http://gerrit.cloudera.org:8080/#/c/11826/3/testdata/bin/run-sentry-service.sh
File testdata/bin/run-sentry-service.sh:

http://gerrit.cloudera.org:8080/#/c/11826/3/testdata/bin/run-sentry-service.sh@46
PS3, Line 46: export HADOOP_OPTS="-Xdebug 
-Xrunjdwp:transport=dt_socket,server=y,suspend=n,address=30020"
line too long (91 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:03:35 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..

IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

This patch updates the HMS and Sentry run scripts to start HMS and
Sentry in debug mode in the minicluster to make it easier to debug
issues related to HMS and Sentry.

HMS debug port: 30010
Sentry debug port: 30020

Testing:
- Connected the debugger to both HMS and Sentry.

Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
---
M testdata/bin/run-hive-server.sh
M testdata/bin/run-sentry-service.sh
2 files changed, 4 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/26/11826/3
--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11826/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11826/2//COMMIT_MSG@13
PS2, Line 13: HMS debug port: 30010
> Watch out for ports above 32k. These are often configured as "ephemeral" po
Changed the ports. Done.



--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 3
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 21:03:00 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7761: Add multiple DISTINCT to targeted stress and perf

2018-10-30 Thread Thomas Marshall (Code Review)
Thomas Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11805 )

Change subject: IMPALA-7761: Add multiple DISTINCT to targeted stress and perf
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11805/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11805/1//COMMIT_MSG@16
PS1, Line 16: Testing:
: - Ran the test file locally.
> The queries are fine, but what does it mean to run these locally? I'm not t
By "run locally" I mean that I put together a python test that run the files 
with run_test_case(), like we would do for an equivalent file in the 
functional-query workload. The idea was to verify that I had the 
RESULTS/TYPES/LABELS sections correct.

The targeted-perf workload is used by single_node_perf_run.py. I believe its 
something that Mostafa also used to run to check for regressions between 
releases. Even if no one is running it currently, it seems reasonable to me to 
add these queries for the future when there is someone paying closer attention 
to perf issues.

The targeted-stress workload seems to be pointless. Its only references in 
tests/stress/test_mini_stress.py and all of the tests there have been xfailed 
for years. Probably we should remove targeted-stress, or even better move the 
queries in workloads/tpch/ that aren't actually regular tpch queries to 
targeted-stress and modify the stress test to use them. It would also be great 
to re-enable the tests in test_mini_stress.py, even if just to ensure that the 
RESULTS/TYPES/LABELS sections stay up to date. That's all (hopefully) outside 
the scope of this patch.

Yes, the intention is for these to be run as part of downstream larger-scale 
testing. Is there any way to accomplish that currently? It seems that 
concurrent_select won't find them even if I add them to eg. 
testdata/workloads/tpch/queries/tpch-aggregations.test, as it only checks for 
test files of the form 'tpch-q*.test'



--
To view, visit http://gerrit.cloudera.org:8080/11805
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I400aaf6b6620b4001895eafff785956bffb312c9
Gerrit-Change-Number: 11805
Gerrit-PatchSet: 1
Gerrit-Owner: Thomas Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 20:55:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Philip Zeyliger (Code Review)
Philip Zeyliger has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 2:

(1 comment)

Seems fine except for your port choice.

http://gerrit.cloudera.org:8080/#/c/11826/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11826/2//COMMIT_MSG@13
PS2, Line 13: HMS debug port: 4
Watch out for ports above 32k. These are often configured as "ephemeral" ports 
in linux which will produce somewhat random flakies.


$sysctl net.ipv4.ip_local_port_range
   net.ipv4.ip_local_port_range 
= 3276860999



--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 2
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 20:52:42 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11826 )

Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/11826/2/testdata/bin/run-hive-server.sh
File testdata/bin/run-hive-server.sh:

http://gerrit.cloudera.org:8080/#/c/11826/2/testdata/bin/run-hive-server.sh@66
PS2, Line 66: export HIVE_METASTORE_HADOOP_OPTS="-Xdebug 
-Xrunjdwp:transport=dt_socket,server=y,suspend=n,address=4"
line too long (106 > 90)


http://gerrit.cloudera.org:8080/#/c/11826/2/testdata/bin/run-sentry-service.sh
File testdata/bin/run-sentry-service.sh:

http://gerrit.cloudera.org:8080/#/c/11826/2/testdata/bin/run-sentry-service.sh@46
PS2, Line 46: export HADOOP_OPTS="-Xdebug 
-Xrunjdwp:transport=dt_socket,server=y,suspend=n,address=5"
line too long (91 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 2
Gerrit-Owner: Fredy Wijaya 
Gerrit-Reviewer: Fredy Wijaya 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Philip Zeyliger 
Gerrit-Comment-Date: Tue, 30 Oct 2018 20:50:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

2018-10-30 Thread Fredy Wijaya (Code Review)
Fredy Wijaya has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11826


Change subject: IMPALA-7786: Start HMS and Sentry in debug mode in the 
minicluster
..

IMPALA-7786: Start HMS and Sentry in debug mode in the minicluster

This patch updates the HMS and Sentry run scripts to start HMS and
Sentry in debug mode in the minicluster to make it easier to debug
issues related to HMS and Sentry.

HMS debug port: 4
Sentry debug port: 5

Testing:
- Connected the debugger to both HMS and Sentry.

Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
---
M testdata/bin/run-hive-server.sh
M testdata/bin/run-sentry-service.sh
2 files changed, 4 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/26/11826/2
--
To view, visit http://gerrit.cloudera.org:8080/11826
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I29b025cbde36ef398ea36fbe69eff26e27d93e48
Gerrit-Change-Number: 11826
Gerrit-PatchSet: 2
Gerrit-Owner: Fredy Wijaya 


[Impala-ASF-CR] IMPALA-5821: Add query with implicit casts to extended explain output.

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11719 )

Change subject: IMPALA-5821: Add query with implicit casts to extended explain 
output.
..


Patch Set 4:

I added a new patch because rebase was required


--
To view, visit http://gerrit.cloudera.org:8080/11719
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I55c3bdacc295137f66b2316a912fc347da30d6b0
Gerrit-Change-Number: 11719
Gerrit-PatchSet: 4
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 20:34:54 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5821: Add query with implicit casts to extended explain output.

2018-10-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/11719 )

Change subject: IMPALA-5821: Add query with implicit casts to extended explain 
output.
..

IMPALA-5821: Add query with implicit casts to extended explain output.

If explain_level is at 'extended' level or higher, then enhance the
output from the explain command. (1) Show the analyzed sql in the
explain header, this is the rewritten sql, which includes implicit
casts, and literals are printed with a cast so that their type is
visible. (2) When predicates are shown in the plan these are shown in
the same format.

The toSql() method can be called on a ParseNode tree to return
the sql corresponding ot the tree. In the past toSQl() has been
enhanced to print rewritten sql by partially overloading toSql() [with
toSql(boolean)]. This current change requires changing toSQl() in
many places as NumericLiteral can appear at different points in ia
parse tree. To avoid many new fragile overloads of toSql() I added
toSql(ToSqlOptions), where ToSqlOptions is an enum which controls the
form of the Sql that is returned. This changes many files but is safer
and means that any future options to toSql() can be added painlessly.

If SHOW_IMPLICIT_CASTS is passed to toSql() then
- in CastExpr print the implicit cast
- in NumericLiteral print the literal with a cast to show the type

Add a PlannerTestOption directive that will force the query text showing
implicit casts to be included in the PLAN section of a .test file.

The analyzed query text is wrapped at 80 characters. Note that the
analyzed query cannot always be executed as queries rewritten to use
LEFT SEMI JOIN are not legal sql. In addition some space characters may
be removed from the query for prettier display.

Documentation of this change will be done as IMPALA-7718

EXAMPLE OUTPUT:

[localhost:21000] default> set explain_level=2;
EXPLAIN_LEVEL set to 2
[localhost:21000] default> explain select * from functional_kudu.alltypestiny 
where bigint_col < 1000 / 100;
Query: explain select * from functional_kudu.alltypestiny where bigint_col < 
1000 / 100
Max Per-Host Resource Reservation: Memory=0B Threads=2
Per-Host Resource Estimates: Memory=10MB
Codegen disabled by planner
Analyzed query: SELECT * FROM functional_kudu.alltypestiny WHERE CAST(bigint_col
AS DOUBLE) < CAST(10 AS DOUBLE)
""
F00:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1
|  Per-Host Resources: mem-estimate=4.88MB mem-reservation=0B 
thread-reservation=2
PLAN-ROOT SINK
|  mem-estimate=0B mem-reservation=0B thread-reservation=0
|
00:SCAN KUDU [functional_kudu.alltypestiny]
   predicates: CAST(bigint_col AS DOUBLE) < CAST(10 AS DOUBLE)
   mem-estimate=4.88MB mem-reservation=0B thread-reservation=1
   tuple-ids=0 row-size=97B cardinality=1
   in pipelines: 00(GETNEXT)
Fetched 16 row(s) in 0.03s

TESTING:

All end-to-end tests pass.
Added a new test in ExprRewriterTest which prints sql with implict casts
for some interesting queries.
Add a unit test for the code which wraps text at 80 characters.
The output of some Planner Tests in .test files has been updated to
include the Analyzed sql that is printed when explain_level is
at at least 'extended' level.

Change-Id: I55c3bdacc295137f66b2316a912fc347da30d6b0
---
M fe/src/main/java/org/apache/impala/analysis/AdminFnStmt.java
M 
fe/src/main/java/org/apache/impala/analysis/AlterTableAddDropRangePartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableAddPartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableDropPartitionStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterViewStmt.java
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/AnalyticExpr.java
M fe/src/main/java/org/apache/impala/analysis/AnalyticWindow.java
M fe/src/main/java/org/apache/impala/analysis/ArithmeticExpr.java
M fe/src/main/java/org/apache/impala/analysis/BaseTableRef.java
M fe/src/main/java/org/apache/impala/analysis/BetweenPredicate.java
M fe/src/main/java/org/apache/impala/analysis/BinaryPredicate.java
M fe/src/main/java/org/apache/impala/analysis/BoolLiteral.java
M fe/src/main/java/org/apache/impala/analysis/CaseExpr.java
M fe/src/main/java/org/apache/impala/analysis/CastExpr.java
M fe/src/main/java/org/apache/impala/analysis/CompoundPredicate.java
M fe/src/main/java/org/apache/impala/analysis/ComputeStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateDataSrcStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateDbStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateDropRoleStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateOrAlterViewStmtBase.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableAsSelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeFileStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeStmt.java
M 

[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1214/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 20:02:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7727: Fix TStatusCode to TErrorCode mapping

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11778 )

Change subject: IMPALA-7727: Fix TStatusCode to TErrorCode mapping
..

IMPALA-7727: Fix TStatusCode to TErrorCode mapping

- Uses a "GENERAL" TErrorCode type for all non-OK statuses.
- Detailed regression root cause description in the jira IMPALA-7727.
- Added a regression test.

Change-Id: Ie62527734aa73c1524c731773638590bdac9e789
Reviewed-on: http://gerrit.cloudera.org:8080/11778
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/common/status.cc
M be/src/common/status.h
M be/src/service/child-query.cc
M be/src/util/error-util.cc
M be/src/util/error-util.h
M testdata/workloads/functional-query/queries/QueryTest/compute-stats.test
6 files changed, 37 insertions(+), 27 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/11778
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ie62527734aa73c1524c731773638590bdac9e789
Gerrit-Change-Number: 11778
Gerrit-PatchSet: 6
Gerrit-Owner: Bharath Vissapragada 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7727: Fix TStatusCode to TErrorCode mapping

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11778 )

Change subject: IMPALA-7727: Fix TStatusCode to TErrorCode mapping
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/11778
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie62527734aa73c1524c731773638590bdac9e789
Gerrit-Change-Number: 11778
Gerrit-PatchSet: 5
Gerrit-Owner: Bharath Vissapragada 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 19:55:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 4
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Comment-Date: Tue, 30 Oct 2018 19:47:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..

IMPALA-7783: Skip test_default_timezone when testing a real cluster.

test_shell_commandline.py::test_default_timezone assumes that the
cluster is running on the same platform as the test process, but
that's only guaranteed when the testing a local minicluster. When
run against a real cluster, the test executor can be a completely
different OS.

Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Reviewed-on: http://gerrit.cloudera.org:8080/11820
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M tests/shell/test_shell_commandline.py
1 file changed, 6 insertions(+), 0 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 5
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 


[Impala-ASF-CR] IMPALA-6374: fix handling of commas in .test files

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11800 )

Change subject: IMPALA-6374: fix handling of commas in .test files
..


Patch Set 5: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/3377/


--
To view, visit http://gerrit.cloudera.org:8080/11800
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Gerrit-Change-Number: 11800
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 19:43:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 5:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG@9
PS5, Line 9: This fixes a class of bugs where the planner incorrectly uses the 
raw
   : string from the parser instead of the unescaped string.
> fyi, I have found a similar issue in partition pruning, see IMPALA-7784.
ack


http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG@26
PS5, Line 26: Added regression test that tests handling of backslash escapes on 
all file
: formats. I did not add a regression test for the data source bug 
since it
> nit: long lines, please wrap at 72
Done


http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
File fe/src/main/java/org/apache/impala/planner/KuduScanNode.java:

http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java@319
PS5, Line 319: getExplainString
> It looks like this always prints the normalized/unescaped string even thoug
The behaviour here seems ok to me - it calls toSql(), which consistently 
escapes things. If we didn't escape things, then the expressions in the explain 
output might not be valid expressions in our SQL expressions, which seems 
inconvenient.


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv
File testdata/data/strings_with_quotes.csv:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv@11
PS5, Line 11:
> add foo\"bar, 11 too? That shouldn't be returned with ..where s = "foo\"bar
Done. I assume you meant that you wanted the value in the table in include the 
backslash, which requires a double backslash here.


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test
File 
testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test@1
PS5, Line 1: 
> Can you add an extra comment to make it even clearer that this file contain
Done


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test@4
PS5, Line 4: # IMPALA-7778: escapes are ignored so output is incorrect
> Can't we just xfail if format == 'rc' instead of doing this? or is it becau
I think it's better to exercise and track the known-bad behaviour than leave it 
unexercised.

Also, honestly, I don't know if there is enough usage of RC at this point that 
anyone will be motivated to fix it.



--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 19:28:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Tim Armstrong (Code Review)
Hello Bharath Vissapragada, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11814

to look at the new patch set (#6).

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..

IMPALA-7586: fix predicate pushdown of escaped strings

This fixes a class of bugs where the planner incorrectly uses the raw
string from the parser instead of the unescaped string. This occurs in
several places that push predicates down to the storage layer:
* Kudu scans
* HBase scans
* Data source scans

There are some more complex issues with escapes and the LIKE predicate
that are tracked separately by IMPALA-2422.

This also uncovered a different issue with RCFiles that is tracked by
IMPALA-7778 and is worked around by the tests added.

In order to make bugs like this more obvious in future, I renamed
getValue() to getValueWithOriginalEscapes().

Testing:
Added regression test that tests handling of backslash escapes on all
file formats. I did not add a regression test for the data source bug
since it seems to require some major modification of the data source
test infrastructure.

Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
---
M fe/src/main/java/org/apache/impala/analysis/AdminFnStmt.java
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/ExtractFromExpr.java
M fe/src/main/java/org/apache/impala/analysis/LikePredicate.java
M fe/src/main/java/org/apache/impala/analysis/StringLiteral.java
M fe/src/main/java/org/apache/impala/planner/DataSourceScanNode.java
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
M fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
M testdata/data/README
A testdata/data/strings_with_quotes.csv
M testdata/datasets/functional/functional_schema_template.sql
M testdata/datasets/functional/schema_constraints.csv
A 
testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test
A testdata/workloads/functional-query/queries/QueryTest/string-escaping.test
M tests/query_test/test_scanners.py
15 files changed, 199 insertions(+), 14 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/14/11814/6
--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 6
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1213/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 18:06:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 5:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG@9
PS5, Line 9: This fixes a class of bugs where the planner incorrectly uses the 
raw
   : string from the parser instead of the unescaped string.
fyi, I have found a similar issue in partition pruning, see IMPALA-7784.


http://gerrit.cloudera.org:8080/#/c/11814/5//COMMIT_MSG@26
PS5, Line 26: Added regression test that tests handling of backslash escapes on 
all file
: formats. I did not add a regression test for the data source bug 
since it
nit: long lines, please wrap at 72


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test
File 
testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test@1
PS5, Line 1: 
Can you add an extra comment to make it even clearer that this file contains 
buggy results that we plan to fix later?



--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Oct 2018 17:55:01 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 1:

Build Failed

https://jenkins.impala.io/job/gerrit-code-review-checks/1212/ : Initial code 
review checks failed. See linked job for details on the failure.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 17:45:27 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Bharath Vissapragada (Code Review)
Bharath Vissapragada has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 5: Code-Review+1

(3 comments)

A bunch of minor comments. Fix lgtm.

http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
File fe/src/main/java/org/apache/impala/planner/KuduScanNode.java:

http://gerrit.cloudera.org:8080/#/c/11814/5/fe/src/main/java/org/apache/impala/planner/KuduScanNode.java@319
PS5, Line 319: getExplainString
It looks like this always prints the normalized/unescaped string even though we 
used the original string for predicates in the scan nodes.  We could've 
probably diagnosed the issue faster if this was right.


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv
File testdata/data/strings_with_quotes.csv:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/data/strings_with_quotes.csv@11
PS5, Line 11:
add foo\"bar, 11 too? That shouldn't be returned with ..where s = "foo\"bar"..


http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test
File 
testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test:

http://gerrit.cloudera.org:8080/#/c/11814/5/testdata/workloads/functional-query/queries/QueryTest/string-escaping-rcfile-bug.test@4
PS5, Line 4: # IMPALA-7778: escapes are ignored so output is incorrect
Can't we just xfail if format == 'rc' instead of doing this? or is it because 
if someone fixes it in the future, this test starts failing and they know it 
right away?



--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Oct 2018 17:38:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11591 )

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..


Patch Set 2:

Hi Joe, thanks a lot for the very good review and sorry for late update. I just 
uploaded a new rev. Interestingly, some tests in test_observability failed 
locally but all is clean in jenkins. One question about your comment about line 
637, I saw other places included [[noreturn]], what's the guideline for having 
it or not? thanks.


--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Yongjun Zhang 
Gerrit-Comment-Date: Tue, 30 Oct 2018 17:36:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary.

2018-10-30 Thread Yongjun Zhang (Code Review)
Hello Tim Armstrong, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11591

to look at the new patch set (#2).

Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary.
..

IMPALA-6742: Profiles of running queries should include execution summary.

Currently execution summary is not included in the profiles of running
queries, and it's only reported when the query is finished. This jira makes
the execution summary to the profile reported when queries are still running.

Testing:
Done with real cluster.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
2 files changed, 19 insertions(+), 7 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/2
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 2
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-6742: Profiles of running queries should include execution summary

2018-10-30 Thread Yongjun Zhang (Code Review)
Yongjun Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/11591


Change subject: IMPALA-6742: Profiles of running queries should include 
execution summary
..

IMPALA-6742: Profiles of running queries should include execution summary

Currently execution summary is not included in the profiles of running queries, 
and it's
only reported when the query is finished. This jira makes the execution summary 
to the
profile reported when queries are still running.

Testing:
This is a draft, tests are yet to be done.

Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
---
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
2 files changed, 15 insertions(+), 4 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/91/11591/1
--
To view, visit http://gerrit.cloudera.org:8080/11591
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Idc7f714c9427d4b26d4e78cf27ceca2b0b336699
Gerrit-Change-Number: 11591
Gerrit-PatchSet: 1
Gerrit-Owner: Yongjun Zhang 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/1211/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Oct 2018 16:52:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-5050: Add support to read TIMESTAMP MILLIS and TIMESTAMP MICROS from Parquet

2018-10-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11057 )

Change subject: IMPALA-5050: Add support to read TIMESTAMP_MILLIS and 
TIMESTAMP_MICROS from Parquet
..


Patch Set 18:

> Hi Csaba, should I look at this now or is Zoltan still completing
 > his review?

Hi Tim! I think that Zoltan will not continue the review this week, so it would 
be great if you could look at it.


--
To view, visit http://gerrit.cloudera.org:8080/11057
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4c7c01fffa31b3d2ca3480adf6ff851137dadac3
Gerrit-Change-Number: 11057
Gerrit-PatchSet: 18
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Tue, 30 Oct 2018 16:33:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7727: Fix TStatusCode to TErrorCode mapping

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11778 )

Change subject: IMPALA-7727: Fix TStatusCode to TErrorCode mapping
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11778
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie62527734aa73c1524c731773638590bdac9e789
Gerrit-Change-Number: 11778
Gerrit-PatchSet: 5
Gerrit-Owner: Bharath Vissapragada 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 16:02:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7586: fix predicate pushdown of escaped strings

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11814 )

Change subject: IMPALA-7586: fix predicate pushdown of escaped strings
..


Patch Set 5:

Build Failed

https://jenkins.impala.io/job/gerrit-code-review-checks/1210/ : Initial code 
review checks failed. See linked job for details on the failure.


--
To view, visit http://gerrit.cloudera.org:8080/11814
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I53d6e20dd48ab6837ddd325db8a9d49ee04fed28
Gerrit-Change-Number: 11814
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 30 Oct 2018 16:01:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7727: Fix TStatusCode to TErrorCode mapping

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11778 )

Change subject: IMPALA-7727: Fix TStatusCode to TErrorCode mapping
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3379/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11778
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie62527734aa73c1524c731773638590bdac9e789
Gerrit-Change-Number: 11778
Gerrit-PatchSet: 5
Gerrit-Owner: Bharath Vissapragada 
Gerrit-Reviewer: Bharath Vissapragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Ho 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 16:02:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/3378/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 4
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:57:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 4
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:57:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6658: improve Parquet RLE for low bit widths

2018-10-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11582 )

Change subject: IMPALA-6658: improve Parquet RLE for low bit widths
..


Patch Set 4:

(4 comments)

I went through code again and my impression is that MaxBufferSize does not do 
its job properly.

http://gerrit.cloudera.org:8080/#/c/11582/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11582/2//COMMIT_MSG@20
PS2, Line 20: length 16 for single bit values. All other bit widths will use the
: existing length 8 runs.
:
> I agree that this might be a better way, but I am happy with the simple imp
Ok, the biggest win (50% size for alternating runs of 8 in the 1 bit case) is 
already gained with this change.

Please reflect in the Jira that there is still some potential in improving the 
RLE encoding.


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h
File be/src/util/rle-encoding.h:

http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@250
PS4, Line 250: 1 +
Can you simplify this expression? MAX_VALUES_PER_LITERAL_RUN must be divisible 
by 8 - this could be checked by an assert, and the Ceil + static cast could be 
removed.


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@264
PS4, Line 264: num_runs
Aren't we double counting the indicator byte here? My assumption is that this 
num_runs is meant to add the +1 byte/ literal run.


http://gerrit.cloudera.org:8080/#/c/11582/4/be/src/util/rle-encoding.h@265
PS4, Line 265: return std::max(MinBufferSize(bit_width), literal_max_size);
This seems to assume that a single big literal run is the worsts case - can't 
this underestimate the number of bytes needed due to
IMPALA-6658?



--
To view, visit http://gerrit.cloudera.org:8080/11582
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I191a581d3f699b6669e48ac9dc39c76ed77c4a76
Gerrit-Change-Number: 11582
Gerrit-PatchSet: 4
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Marshall 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:55:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread Michael Brown (Code Review)
Michael Brown has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 3
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:54:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread David Knupp (Code Review)
David Knupp has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11820 )

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/11820/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/11820/2//COMMIT_MSG@7
PS2, Line 7: IMPALA-77
> We need an upstream Jira here, not one for Cloudera.
Doh -- thanks. I just spaced on this.



--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 3
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:50:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7783: Skip test default timezone when testing a real cluster.

2018-10-30 Thread David Knupp (Code Review)
Hello Michael Brown, Attila Jeges, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/11820

to look at the new patch set (#3).

Change subject: IMPALA-7783: Skip test_default_timezone when testing a real 
cluster.
..

IMPALA-7783: Skip test_default_timezone when testing a real cluster.

test_shell_commandline.py::test_default_timezone assumes that the
cluster is running on the same platform as the test process, but
that's only guaranteed when the testing a local minicluster. When
run against a real cluster, the test executor can be a completely
different OS.

Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
---
M tests/shell/test_shell_commandline.py
1 file changed, 6 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/20/11820/3
--
To view, visit http://gerrit.cloudera.org:8080/11820
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ia4d4c503d2c77136cedd8f3fd830b6ce70d4457f
Gerrit-Change-Number: 11820
Gerrit-PatchSet: 3
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 


[Impala-ASF-CR] IMPALA-6374: fix handling of commas in .test files

2018-10-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/11800 )

Change subject: IMPALA-6374: fix handling of commas in .test files
..


Patch Set 5:

Exhaustive tests passed overnight


--
To view, visit http://gerrit.cloudera.org:8080/11800
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I18ddcb0440490ddf8184be66d3681038a1615dd9
Gerrit-Change-Number: 11800
Gerrit-PatchSet: 5
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Brown 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 30 Oct 2018 15:48:32 +
Gerrit-HasComments: No


  1   2   >