[ 
https://issues.apache.org/jira/browse/IMPALA-6432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sailesh Mukil resolved IMPALA-6432.
-----------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 2.12.0

Yes, closing it.

> Default rpc_negotiation_timeout_ms may cause queries to fail on large clusters
> ------------------------------------------------------------------------------
>
>                 Key: IMPALA-6432
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6432
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Distributed Exec
>    Affects Versions: Impala 2.11.0
>            Reporter: Mostafa Mokhtar
>            Assignee: Sailesh Mukil
>            Priority: Critical
>             Fix For: Impala 2.12.0
>
>
> While running the stress test on a 140 node secure cluster with "default" 
> configs queries were failing with negotiation timeouts.
> With rpc_negotiation_timeout_ms=60000 queries appear to run fine, ideally we 
> should understand why this takes very long and bump the default to a more 
> reasonable value. 
> {code}
> W0121 16:51:04.014796 156630 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.254978 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.221.35:38297
> 0121 16:51:03.194696 (+2939718us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:03.194697 (+ 1us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:03.194708 (+ 11us) server_negotiation.cc:366] Connection header 
> received
> 0121 16:51:03.194720 (+ 12us) server_negotiation.cc:322] Received NEGOTIATE 
> NegotiatePB request
> 0121 16:51:03.194721 (+ 1us) server_negotiation.cc:407] Received NEGOTIATE 
> request from client
> 0121 16:51:03.194725 (+ 4us) server_negotiation.cc:334] Sending NEGOTIATE 
> NegotiatePB response
> 0121 16:51:03.194761 (+ 36us) server_negotiation.cc:190] Negotiated authn=SASL
> 0121 16:51:03.206182 (+ 11421us) server_negotiation.cc:322] Received 
> SASL_INITIATE NegotiatePB request
> 0121 16:51:03.206183 (+ 1us) server_negotiation.cc:754] Received 
> SASL_INITIATE request from client
> 0121 16:51:03.206184 (+ 1us) server_negotiation.cc:765] Client requested to 
> use mechanism: GSSAPI
> 0121 16:51:03.206190 (+ 6us) server_negotiation.cc:794] Calling 
> sasl_server_start()
> 0121 16:51:03.990155 (+783965us) server_negotiation.cc:334] Sending 
> SASL_CHALLENGE NegotiatePB response
> 0121 16:51:04.014736 (+ 24581us) negotiation.cc:293] Negotiation complete: 
> Timed out: Server connection negotiation failed: server connection from 
> 10.17.221.35:38297: BlockingWrite timed out
> Metrics: \{"server-negotiator.queue_time_us":2939697}
> W0121 16:51:04.014961 156629 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.512560 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.232.27:38602
> 0121 16:51:03.502222 (+2989662us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:03.502223 (+ 1us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:03.502231 (+ 8us) server_negotiation.cc:366] Connection header 
> received
> 0121 16:51:03.502242 (+ 11us) server_negotiation.cc:322] Received NEGOTIATE 
> NegotiatePB request
> 0121 16:51:03.502243 (+ 1us) server_negotiation.cc:407] Received NEGOTIATE 
> request from client
> 0121 16:51:03.502247 (+ 4us) server_negotiation.cc:334] Sending NEGOTIATE 
> NegotiatePB response
> 0121 16:51:03.502272 (+ 25us) server_negotiation.cc:190] Negotiated authn=SASL
> 0121 16:51:03.506579 (+ 4307us) server_negotiation.cc:322] Received 
> SASL_INITIATE NegotiatePB request
> 0121 16:51:03.506581 (+ 2us) server_negotiation.cc:754] Received 
> SASL_INITIATE request from client
> 0121 16:51:03.506581 (+ 0us) server_negotiation.cc:765] Client requested to 
> use mechanism: GSSAPI
> 0121 16:51:03.506588 (+ 7us) server_negotiation.cc:794] Calling 
> sasl_server_start()
> 0121 16:51:04.003539 (+496951us) server_negotiation.cc:334] Sending 
> SASL_CHALLENGE NegotiatePB response
> 0121 16:51:04.014903 (+ 11364us) negotiation.cc:293] Negotiation complete: 
> Timed out: Server connection negotiation failed: server connection from 
> 10.17.232.27:38602: BlockingWrite timed out
> Metrics: \{"server-negotiator.queue_time_us":2989644}
> W0121 16:51:04.026866 156629 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.796311 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.232.48:52481
> 0121 16:51:04.026814 (+3230503us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:04.026815 (+ 1us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:04.026839 (+ 24us) negotiation.cc:293] Negotiation complete: Timed 
> out: Server connection negotiation failed: server connection from 
> 10.17.232.48:52481
> Metrics: \{"server-negotiator.queue_time_us":3230488}
> W0121 16:51:04.027504 156632 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.807588 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.232.43:55531
> 0121 16:51:04.027449 (+3219861us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:04.027450 (+ 1us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:04.027481 (+ 31us) negotiation.cc:293] Negotiation complete: Timed 
> out: Server connection negotiation failed: server connection from 
> 10.17.232.43:55531
> Metrics: \{"server-negotiator.queue_time_us":3219845}
> W0121 16:51:04.028131 156631 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.809899 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.234.52:41207
> 0121 16:51:04.028062 (+3218163us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:04.028064 (+ 2us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:04.028094 (+ 30us) negotiation.cc:293] Negotiation complete: Timed 
> out: Server connection negotiation failed: server connection from 
> 10.17.234.52:41207
> Metrics: \{"server-negotiator.queue_time_us":3218143}
> W0121 16:51:04.028815 156630 negotiation.cc:302] Failed RPC negotiation. 
> Trace:
> 0121 16:51:00.832191 (+ 0us) reactor.cc:481] Submitting negotiation task for 
> server connection from 10.17.234.39:40621
> 0121 16:51:04.028765 (+3196574us) server_negotiation.cc:169] Beginning 
> negotiation
> 0121 16:51:04.028766 (+ 1us) server_negotiation.cc:358] Waiting for 
> connection header
> 0121 16:51:04.028793 (+ 27us) negotiation.cc:293] Negotiation complete: Timed 
> out: Server connection negotiation failed: server connection from 
> 10.17.234.39:40621
> Metrics: \{"server-negotiator.queue_time_us":3196559}
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to