[jira] [Commented] (IMPALA-4865) Limit size of literal values from constant folding

ASF subversion and git services (JIRA) Tue, 04 Dec 2018 09:17:11 -0800


    [ 
https://issues.apache.org/jira/browse/IMPALA-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16709004#comment-16709004
 ]


ASF subversion and git services commented on IMPALA-4865:
---------------------------------------------------------

Commit 3f0989a4fcad1b95c79c765d829fce72dd122f21 in impala's branch 
refs/heads/master from [[email protected]]
[ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=3f0989a ]

IMPALA-7811: optionally count JVM heap towards process mem limit

Adds a flag --mem_limit_includes_jvm that alters memory accounting to
include the amount of memory we think that the JVM is likely to use.
By default this flag is false, so behaviour is unchanged.

We're not ready to change the default but I want to check this in to
enable experimentation.

Two metrics are counted towards the process limit:
* The maximum JVM heap size. We count this because the JVM memory
  consumption can expand up to this threshold at any time.
* JVM non-heap committed memory. This can be a non-trivial amount of
  memory (e.g. I saw 150MB on one production cluster). There isn't a
  hard upper bound on this memory that I know of but should not
  grow rapidly.

This requires adjustments in a couple of other places:
* Admission control previous assumed that all of the process memory
  limit was available to queries (an assumption that is not strictly
  true because of untracked memory, etc, but close enough). However,
  the JVM heap makes a large part of the process limit unusable to
  queries, so we should only admit up to "process limit - max JVM heap
  size" per node.
* The buffer pool is now a percentage of the remaining process limit
  after the JVM heap, instead of the total process limit.

Currently, end-to-end tests fail if run with this flag for two reasons:
* The default JVM heap size is 1/4 of physical memory, which means that
  essentially all of the process memory limit is consumed by the JVM
  heaps when we running 3 impala daemons per host, unless -Xmx is
  explicitly set.
* If the heap size is limited to 1-2GB like below, then most tests pass
  but TestInsert.test_insert_large_string fails because IMPALA-4865
  lets it create giant strings that eat up all the JVM heap.

  start-impala-cluster.py \
      --impalad_args=--mem_limit_includes_jvm=true --jvm_args="-Xmx1g"

Testing:
Add a custom cluster test that uses the new option and validates the
the memory consumption values.

Change-Id: I39dd715882a32fc986755d573bd46f0fd9eefbfc
Reviewed-on: http://gerrit.cloudera.org:8080/10928
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Limit size of literal values from constant folding
> --------------------------------------------------
>
>                 Key: IMPALA-4865
>                 URL: https://issues.apache.org/jira/browse/IMPALA-4865
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 2.8.0
>            Reporter: Henry Robinson
>            Priority: Major
>              Labels: newbie
>
> Consider (from {{test_insert.py}}) the following query:
> {{create table foo as select repeat('AZ', 128 * 1024 * 1024) as s}}
> {{repeat()}} gets constant-folded into a 256MB expression payload, which is 
> then transmitted to every fragment instance that evaluates the expression. 
> This can lead to timeouts. 
> The workaround is {{ENABLE_EXPR_REWRITES=0}}.
> It's not obvious what the answer is - should the FE be aware of the 
> serialized size of an expression when deciding whether to constant fold it?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (IMPALA-4865) Limit size of literal values from constant folding

Reply via email to