Michael Smith created IMPALA-13012:
--------------------------------------

             Summary: Completed queries write fails regularly under heavy load
                 Key: IMPALA-13012
                 URL: https://issues.apache.org/jira/browse/IMPALA-13012
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
    Affects Versions: Impala 4.4.0
            Reporter: Michael Smith


Under heavy test load (running EE tests), Impala regularly fails to write 
completed queries with errors like
{code}
W0411 19:11:07.764967 32713 workload-management.cc:435] failed to write 
completed queries table="sys.impala_query_log" record_count="10001"
W0411 19:11:07.764983 32713 workload-management.cc:437] AnalysisException: 
Exceeded the statement expression limit (250000)
Statement has 370039 expressions.
{code}

After a few attempts, it floods logs with an error for each query that could 
not be written
{code}
E0411 19:11:24.646953 32713 workload-management.cc:376] could not write 
completed query table="sys.impala_query_log" 
query_id="3142ceb1380b58e6:715b83d900000000"
{code}

This seems like poor default behavior. Options for addressing it:
# Decrease the default for {{query_log_max_queued}}. Inserts are pretty 
constant at 37 expressions per entry. I'm not sure why that isn't 49, since 
that's the number of columns we have; maybe some fields are frequently omitted. 
I would cap {{query_log_max_queued}} to {{statement_expression_limit / 
number_of_columns ~ 5100}}.
# Allow workload management to {{set statement_expression_limit}} higher using 
a similar formula. This may be relatively safe as the expressions are simple.
# Ideally we would skip expression parsing and construct TExecRequest directly, 
but that's a much larger effort.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to