[
https://issues.apache.org/jira/browse/SPARK-27989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jose Luis Pedrosa updated SPARK-27989:
--------------------------------------
Description:
Any failure in the executor when trying to connect to the driver, will make
impossible a connection from that process, which will trigger the creation of
another executor scheduled.
was:
Due to Java caching of negative DNS resolution (failed requests are never
retried).
Any failure in the DNS when trying to connect to the driver, will make
impossible a connection from that process.
This happens specially in Kubernetes where network setup of pods can take some
time,
> Add retries on the connection to the driver
> -------------------------------------------
>
> Key: SPARK-27989
> URL: https://issues.apache.org/jira/browse/SPARK-27989
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 2.4.3
> Reporter: Jose Luis Pedrosa
> Priority: Minor
>
>
> Any failure in the executor when trying to connect to the driver, will make
> impossible a connection from that process, which will trigger the creation of
> another executor scheduled.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]