[ 
https://issues.apache.org/jira/browse/IMPALA-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16990091#comment-16990091
 ] 

Todd Lipcon commented on IMPALA-3189:
-------------------------------------

This should be largely better with KRPC since we maintain long-running 
connections between nodes. Do people still see this issue on the first query 
after startup?

> Address scalability issue with N^2 KDC requests on cluster startup
> ------------------------------------------------------------------
>
>                 Key: IMPALA-3189
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3189
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Distributed Exec, Security
>    Affects Versions: Impala 2.5.0
>            Reporter: Henry Robinson
>            Priority: Critical
>              Labels: kerberos, scalability
>
> When Impala runs a query that shuffles data amongst all nodes in a 
> Kerberos-secured cluster, every node will need to acquire a TGS for every 
> other node. In a cluster of 100 nodes or more, this can overwhelm the KDC, 
> and queries can exit with an error ("Could not contact KDC for realm").
> A simple workaround is to run a warm-up query until it succeeds (which can 
> take a few minutes after cluster startup). The KDC can also be scaled (e.g. 
> with secondary KDC nodes). 
> Impala can also consider either forcing a TGS request on start-up in a 
> staggered fashion, or we can move to recommending SSL + client certificates 
> for server<->server communication.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to