[
https://issues.apache.org/jira/browse/SPARK-23182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-23182.
-------------------------------
Resolution: Fixed
Fix Version/s: 3.0.0
Issue resolved by pull request 20512
[https://github.com/apache/spark/pull/20512]
> Allow enabling of TCP keep alive for RPC connections
> ----------------------------------------------------
>
> Key: SPARK-23182
> URL: https://issues.apache.org/jira/browse/SPARK-23182
> Project: Spark
> Issue Type: Improvement
> Components: Spark Core
> Affects Versions: 2.2.2, 2.4.0
> Reporter: Petar Petrov
> Assignee: Petar Petrov
> Priority: Minor
> Fix For: 3.0.0
>
>
> We rely heavily on preemptible worker machines in GCP/GCE. These machines
> disappear without closing the TCP connections to the master which increases
> the number of established connections and new workers can not connect because
> of "Too many open files" on the master.
> To solve the problem we need to enable TCP keep alive for the RPC connections
> to the master but it's not possible to do so via configuration.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]