Jose Luis Pedrosa created SPARK-28149:
-----------------------------------------
Summary: Disable negeative DNS caching.
Key: SPARK-28149
URL: https://issues.apache.org/jira/browse/SPARK-28149
Project: Spark
Issue Type: Improvement
Components: Kubernetes
Affects Versions: 2.4.3
Reporter: Jose Luis Pedrosa
By default JVM caches the failures for the DNS resolutions, by default is
cached by 10 seconds.
Alpine JDK used in the images for kubernetes has a default timout of 5 seconds.
This means that in clusters with slow init time (network sidecar pods, slow
network start up) executor will never run, because the first attempt to connect
to the driver will fail, and that failure will be cached, causing the retries
to happen in a tight loop without actually trying again.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]