[
https://issues.apache.org/jira/browse/IMPALA-13130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17853074#comment-17853074
]
ASF subversion and git services commented on IMPALA-13130:
----------------------------------------------------------
Commit 3f827bfc2447d8c11a4f09bcb96e86c53b92d753 in impala's branch
refs/heads/master from Michael Smith
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=3f827bfc2 ]
IMPALA-13130: Prioritize EndDataStream messages
Prioritize EndDataStream messages over other types handled by
DataStreamService, and avoid rejecting them when memory limit is
reached. They take very little memory (~75 bytes) and will usually help
reduce memory use by closing out in-progress operations.
Adds the 'data_stream_sender_eos_timeout_ms' flag to control EOS
timeouts. Defaults to 1 hour, and can be disabled by setting to -1.
Adds unit tests ensuring EOS are processed even if mem limit is reached
and ahead of TransmitData messages in the queue.
Change-Id: I2829e1ab5bcde36107e10bff5fe629c5ee60f3e8
Reviewed-on: http://gerrit.cloudera.org:8080/21476
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>
> Under heavy load, Impala does not prioritize data stream operations
> -------------------------------------------------------------------
>
> Key: IMPALA-13130
> URL: https://issues.apache.org/jira/browse/IMPALA-13130
> Project: IMPALA
> Issue Type: Bug
> Reporter: Michael Smith
> Assignee: Michael Smith
> Priority: Major
>
> Under heavy load - where Impala reaches max memory for the DataStreamService
> and applies backpressure via
> https://github.com/apache/impala/blob/4.4.0/be/src/rpc/impala-service-pool.cc#L191-L199
> - DataStreamService does not differentiate between types of requests and may
> reject requests that could help reduce load.
> The DataStreamService deals with TransmitData, PublishFilter, UpdateFilter,
> UpdateFilterFromRemote, and EndDataStream. It seems like we should prioritize
> completing EndDataStream, especially under heavy load, to complete work and
> release resources more quickly.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]