[ 
https://issues.apache.org/jira/browse/IMPALA-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong resolved IMPALA-4069.
-----------------------------------
    Resolution: Won't Fix

KRPC basically solved this issue. We could potentially create connections as 
soon as we're aware of another executor via the statestore but I don't think 
it's super-important.

> Introduce startup option to create and cache backend connections on startup
> ---------------------------------------------------------------------------
>
>                 Key: IMPALA-4069
>                 URL: https://issues.apache.org/jira/browse/IMPALA-4069
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Distributed Exec
>    Affects Versions: Impala 2.5.0
>            Reporter: Mostafa Mokhtar
>            Priority: Major
>              Labels: scalability
>
> Add impalad startup flag specifying the number of connections per backend to 
> create and cache. 
> After startup impala-server.backends.client-cache.total-clients should 
> reflect number of backends x cached connections per backend. 
> [[email protected]] description of the problem
> {code}
> Internal Impala network connections between nodes for query execution are not 
> multiplexed. This means as the number of queries increase the number of 
> network connections increases between Impala executors. With higher #nodes, 
> the combination of query bursts and number of executors can lead to lots of 
> new connections attempts. For example, a query with 10+joins on a 100-node 
> cluster could require 1000+ connections simultaneously on coordinator.  When 
> the spike is too high or if there is not sufficient CPU available to handle 
> the bursts, this causes connection failures. 
> The total number of connections does not seem to be the issue, but there is 
> currently a practical limit on the number of simultaneous new concurrent 
> connection TCP request spikes at once. 
> Impala caches backend connections and reuse them later. With cache, the 
> simultaneous spikes of new connection request is only those above previous 
> established maximum.
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to