Yu-Lin Chen created YUNIKORN-2043:
-------------------------------------
Summary: SimplePreemptor e2e test doesn’t really wait for pod’s
Killing event
Key: YUNIKORN-2043
URL: https://issues.apache.org/jira/browse/YUNIKORN-2043
Project: Apache YuniKorn
Issue Type: Bug
Components: test - e2e
Reporter: Yu-Lin Chen
Assignee: Yu-Lin Chen
In the e2e simple_preemptor_test, the current timeout for ‘waiting Killing
event’ is in nanosecond.
# 120 ns:
[test/e2e/simple_preemptor/simple_preemptor_test.go#L169|https://github.com/apache/yunikorn-k8shim/blob/master/test/e2e/simple_preemptor/simple_preemptor_test.go#L169]
# 1200 ns:
[test/e2e/simple_preemptor/simple_preemptor_test.go#L215|https://github.com/apache/yunikorn-k8shim/blob/master/test/e2e/simple_preemptor/simple_preemptor_test.go#L215]
The interval is
Millisecond*100([code|https://github.com/apache/yunikorn-k8shim/blob/master/test/e2e/framework/helpers/k8s/k8s_utils.go#L725]).
So there is no actual 'waiting' happening here.
This issue will trigger an error in the CI even though the pod has been killed.
(Note: Not all SimplePreemptor fails are caused by this issue, there are still
other unkown issues to be investigated.)
CI error:
[https://github.com/apache/yunikorn-k8shim/actions/runs/6495711809/job/17641528620#step:5:2046]
{code:java}
• [FAILED] [15.593 seconds]
SimplePreemptor [It] Verify_basic_simple_preemption. Use case: Only one pod is
running and same pod has been selected as victim
/home/runner/work/yunikorn-k8shim/yunikorn-k8shim/test/e2e/simple_preemptor/simple_preemptor_test.go:148
[FAILED] Unexpected error:
<context.deadlineExceededError>:
context deadline exceeded
{}
occurred
In [It] at:
/home/runner/work/yunikorn-k8shim/yunikorn-k8shim/test/e2e/simple_preemptor/simple_preemptor_test.go:170
@ 10/12/23
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]