Fifteen created IMPALA-10342:
--------------------------------

             Summary: Alleviating congestion caused by row-level warnings 
                 Key: IMPALA-10342
                 URL: https://issues.apache.org/jira/browse/IMPALA-10342
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
            Reporter: Fifteen
         Attachments: image-2020-11-19-17-30-22-918.png

By default, when encounting error, both `get_json_object()` and 
`DecimalOperators::IntToDecimalVal` will raise warning. During to their 
stateless nature, functions keep throwing messages. Hence the warning flood 
will easily overwhelm cluster's processing capacity.

To be specific, we have observed these bottlenecks:

*Exchange Receiver*:   the default value for `rpc_max_message_size` is 50MB. 
The flooding warning messages carried by ReportExecStatusPB will exceed that 
limit, causing report without profile.  Even though the report message size is 
less than those limit, the bandwidth consumption is non-trivial.

*Storage:* like https://issues.apache.org/jira/browse/IMPALA-5256 , warning 
messages produces huge log files since `stdout/stderr` won't be redirected when 
glog is rolling log files. 

*Coordinator*: runtime profiles will be seriialized to thrift and stored in 
Coordinator's memory. The warning flood will make `Untracked Memory` rising 
rapidly. I have made a mem sample and found most memory were used by 
RuntimeProfile and Strings. 

!image-2020-11-19-17-30-22-918.png!

 

Solution:
 # We have a straightforward solution by changing `AddWarning()` to `no-op`.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to