[
https://issues.apache.org/jira/browse/YUNIKORN-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arthur Wang updated YUNIKORN-2573:
----------------------------------
Description:
[github
pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
Github CI occasionally fail.
Root cause:
[https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/context.go#L665]
`partition.updatePartitionResource(node.SetCapacity(resources.NewResourceFromProto(sr)))`
We calculate the delta resources by updating node capacity.
Then we resources map in partition.
was:
[github
pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
Github CI occasionally fail.
Still working on finding root cause.
Since there always an error or warning from scheduler health check when running
multiple tests at the same time,
maybe it's some test setting issue.
> Flaky test TestUpdateNodeCapacityWithMultipleNodes
> --------------------------------------------------
>
> Key: YUNIKORN-2573
> URL: https://issues.apache.org/jira/browse/YUNIKORN-2573
> Project: Apache YuniKorn
> Issue Type: Bug
> Reporter: Arthur Wang
> Assignee: Arthur Wang
> Priority: Major
>
> [github
> pipeline|https://github.com/apache/yunikorn-core/actions/runs/8770718393/job/24067600801]
> Github CI occasionally fail.
>
> Root cause:
> [https://github.com/apache/yunikorn-core/blob/a1a10f8e8621288c6919aad269540b44c6e20227/pkg/scheduler/context.go#L665]
> `partition.updatePartitionResource(node.SetCapacity(resources.NewResourceFromProto(sr)))`
> We calculate the delta resources by updating node capacity.
> Then we resources map in partition.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]