[
https://issues.apache.org/jira/browse/FLINK-14314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zhu Zhu updated FLINK-14314:
----------------------------
Description:
With FLINK-14058, it is assumed that a shared slot should be large enough to be
used by one instance of each JobVertex in the group simultaneously.
To support it, a shared slot resources should be the sum of all JobVertex
resources in the group.
Here's the concrete proposal:
1. Add a physicalSlotResourceProfile in SlotProfile. If the task is not in a
shared slot, it is the task resource profile. Otherwise it is the slot sharing
group resource profile. It should be used for physical slot allocation. Rename
previous ResourceProfile to be taskResourceProfile for logical slot allocation.
2. SharedSlotOversubscribedException and its handling can be removed, including
part of the children slots releasing and re-allocating. This is because partial
fulfillment should not happen anymore with #1. A simple sanity check can be
kept for oversubscribing.
was:
With FLINK-14058, it is assumed that a shared slot should be large enough to be
used by one instance of each JobVertex in the group simultaneously.
To support it, a shared slot resources should be the sum of all JobVertex
resources in the group.
Here's the concrete proposal:
1. Add a physicalSlotResourceProfile in SlotProfile. It should be used for
physical slot allocation. Rename previous ResourceProfile to be
taskResourceProfile for logical slot allocation.
2. SharedSlotOversubscribedException and its handling can be removed, including
part of the children slots releasing and re-allocating. This is because partial
fulfillment should not happen anymore with #1. A simple sanity check can be
kept for oversubscribing.
> Allocate shared slot resources respecting the resources of all vertices in
> the group
> ------------------------------------------------------------------------------------
>
> Key: FLINK-14314
> URL: https://issues.apache.org/jira/browse/FLINK-14314
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Coordination
> Affects Versions: 1.10.0
> Reporter: Zhu Zhu
> Assignee: Zhu Zhu
> Priority: Major
> Fix For: 1.10.0
>
>
> With FLINK-14058, it is assumed that a shared slot should be large enough to
> be used by one instance of each JobVertex in the group simultaneously.
> To support it, a shared slot resources should be the sum of all JobVertex
> resources in the group.
> Here's the concrete proposal:
> 1. Add a physicalSlotResourceProfile in SlotProfile. If the task is not in a
> shared slot, it is the task resource profile. Otherwise it is the slot
> sharing group resource profile. It should be used for physical slot
> allocation. Rename previous ResourceProfile to be taskResourceProfile for
> logical slot allocation.
> 2. SharedSlotOversubscribedException and its handling can be removed,
> including part of the children slots releasing and re-allocating. This is
> because partial fulfillment should not happen anymore with #1. A simple
> sanity check can be kept for oversubscribing.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)