[ 
https://issues.apache.org/jira/browse/IMPALA-13130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17853074#comment-17853074
 ] 

ASF subversion and git services commented on IMPALA-13130:
----------------------------------------------------------

Commit 3f827bfc2447d8c11a4f09bcb96e86c53b92d753 in impala's branch 
refs/heads/master from Michael Smith
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=3f827bfc2 ]

IMPALA-13130: Prioritize EndDataStream messages

Prioritize EndDataStream messages over other types handled by
DataStreamService, and avoid rejecting them when memory limit is
reached. They take very little memory (~75 bytes) and will usually help
reduce memory use by closing out in-progress operations.

Adds the 'data_stream_sender_eos_timeout_ms' flag to control EOS
timeouts. Defaults to 1 hour, and can be disabled by setting to -1.

Adds unit tests ensuring EOS are processed even if mem limit is reached
and ahead of TransmitData messages in the queue.

Change-Id: I2829e1ab5bcde36107e10bff5fe629c5ee60f3e8
Reviewed-on: http://gerrit.cloudera.org:8080/21476
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Under heavy load, Impala does not prioritize data stream operations
> -------------------------------------------------------------------
>
>                 Key: IMPALA-13130
>                 URL: https://issues.apache.org/jira/browse/IMPALA-13130
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Michael Smith
>            Assignee: Michael Smith
>            Priority: Major
>
> Under heavy load - where Impala reaches max memory for the DataStreamService 
> and applies backpressure via 
> https://github.com/apache/impala/blob/4.4.0/be/src/rpc/impala-service-pool.cc#L191-L199
>  - DataStreamService does not differentiate between types of requests and may 
> reject requests that could help reduce load.
> The DataStreamService deals with TransmitData, PublishFilter, UpdateFilter, 
> UpdateFilterFromRemote, and EndDataStream. It seems like we should prioritize 
> completing EndDataStream, especially under heavy load, to complete work and 
> release resources more quickly.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to