[
https://issues.apache.org/jira/browse/YUNIKORN-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arthur Wang updated YUNIKORN-2573:
----------------------------------
Description:
[github
pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
Github CI occasionally fail.
Root cause:
[https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/context.go#L665]
{code:java}
partition.updatePartitionResource(node.SetCapacity(resources.NewResourceFromProto(sr)))
{code}
We calculate the delta resources by updating node capacity.
Then we update resources map in partition.
The test would failed with following order
node.SetCapacity() ->
[waitForAvailableNodeResource()|https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/tests/operation_test.go#L520]
->
[partitionInfo.GetTotalPartitionResource()|https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/tests/operation_test.go#L525]
-> partition.updatePartitionResource()
was:
[github
pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
Github CI occasionally fail.
Root cause:
[https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/context.go#L665]
`partition.updatePartitionResource(node.SetCapacity(resources.NewResourceFromProto(sr)))`
We calculate the delta resources by updating node capacity.
Then we resources map in partition.
> Flaky test TestUpdateNodeCapacityWithMultipleNodes
> --------------------------------------------------
>
> Key: YUNIKORN-2573
> URL: https://issues.apache.org/jira/browse/YUNIKORN-2573
> Project: Apache YuniKorn
> Issue Type: Bug
> Reporter: Arthur Wang
> Assignee: Arthur Wang
> Priority: Major
>
> [github
> pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
> Github CI occasionally fail.
>
> Root cause:
> [https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/context.go#L665]
>
> {code:java}
> partition.updatePartitionResource(node.SetCapacity(resources.NewResourceFromProto(sr)))
> {code}
>
> We calculate the delta resources by updating node capacity.
> Then we update resources map in partition.
> The test would failed with following order
> node.SetCapacity() ->
> [waitForAvailableNodeResource()|https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/tests/operation_test.go#L520]
> ->
> [partitionInfo.GetTotalPartitionResource()|https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/tests/operation_test.go#L525]
> -> partition.updatePartitionResource()
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]