juliuszsompolski opened a new pull request, #42908:
URL: https://github.com/apache/spark/pull/42908

   ### What changes were proposed in this pull request?
   
   Deflake tests in ReattachableExecuteSuite and increase 
CONNECT_EXECUTE_REATTACHABLE_OBSERVER_RETRY_BUFFER_SIZE.
   
   ### Why are the changes needed?
   
   Two tests could be flaky with errors `INVALID_CURSOR.POSITION_NOT_AVAILABLE`.
   This is caused when a server releases the response when it falls more than 
CONNECT_EXECUTE_REATTACHABLE_OBSERVER_RETRY_BUFFER_SIZE behind the latest 
response it sent. However, because of HTTP2 flow control, the responses could 
still be in transit. In the test suite, we were explicitly disconnecting the 
iterators and later reconnect... In some cases they could not reconnect, 
because the response they last seen have fallen too fare behind.
   
   This not only changes the suite, but also adjust the default config. This 
potentially makes the reconnecting more robust. In normal situation, it should 
not lead to increased memory pressure, because the clients also release the 
responses using ReleaseExecute as soon as they are received. Normally, buffered 
responses should be freed by ReleaseExecute and this retry buffer is only a 
fallback mechanism. Therefore, it is safe to increase the default.
   
   In practice, this would only have effect in cases where there are actual 
network errors, and the increased buffer size should make the reconnects more 
robust in these cases.
   
   ### Does this PR introduce _any_ user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   ReattachableExecuteSuite.
   Did more manual experiments of how far the response sent by client can be 
behind the response sent by server (because of HTTP2 flow control window)
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   No.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to