[
https://issues.apache.org/jira/browse/SPARK-38965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wan Kun updated SPARK-38965:
----------------------------
Description:
We should retry transfer blocks if *errorHandler.shouldRetryError(e)* return
true,
Even though that exception may not a IOException, for example:
{code:java}
org.apache.spark.network.server.BlockPushNonFatalFailure: Block
shufflePush_0_0_3316_5647 experienced merge collision on the server side
{code}
was:
For those exceptions which errorHandler.shouldRetryError(e) return true, we
should retry transfer blocks.
Even though that exception may not a IOException, for example:
{code:java}
org.apache.spark.network.server.BlockPushNonFatalFailure: Block
shufflePush_0_0_3316_5647 experienced merge collision on the server side
{code}
> Retry transfer blocks for exceptions listed in the error handler
> -----------------------------------------------------------------
>
> Key: SPARK-38965
> URL: https://issues.apache.org/jira/browse/SPARK-38965
> Project: Spark
> Issue Type: Bug
> Components: Shuffle
> Affects Versions: 3.3.0
> Reporter: Wan Kun
> Priority: Minor
>
> We should retry transfer blocks if *errorHandler.shouldRetryError(e)* return
> true,
> Even though that exception may not a IOException, for example:
> {code:java}
> org.apache.spark.network.server.BlockPushNonFatalFailure: Block
> shufflePush_0_0_3316_5647 experienced merge collision on the server side
> {code}
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]