Purushottam Sinha created FLINK-39962:
-----------------------------------------

             Summary: Runtime: 
DeclarativeSlotPoolBridgeTest.testAcceptingOfferedSlotsWithoutResourceManagerConnected
 flaky
                 Key: FLINK-39962
                 URL: https://issues.apache.org/jira/browse/FLINK-39962
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Coordination
            Reporter: Purushottam Sinha


Problem
The test fails intermittently in CI with NegativeArraySizeException: -1, 
failing the core test module. It is a test-only threading race, not a 
production defect.

  Evidence
  - Stack: NegativeArraySizeException in LinkedHashMap$LinkedValues.toArray via 
DeclarativeSlotPoolBridge.cancelPendingRequests 
(DeclarativeSlotPoolBridge.java:228-229, new 
ArrayList<>(pendingRequests.values())) ← onClose (:176).
  - The bridge uses forMainThread(); onStart schedules 
checkIdleSlotTimeout/checkBatchSlotTimeout on that thread, but the test drives 
requestNewAllocatedSlot/offerSlots/close() from the test thread. A scheduled 
task mutates pendingRequests while close() copies it.
  - Observed on master commit 6627acb. CI: 
https://github.com/apache/flink/actions/runs/27864565978/job/82466897527

  Proposed fix
  - Drive the test lifecycle through componentMainThreadExecutor, or use a 
ManuallyTriggeredScheduledExecutor so timeout tasks cannot fire concurrently 
with close().



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to