[ https://issues.apache.org/jira/browse/HIVE-23469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Attila Magyar reassigned HIVE-23469: ------------------------------------ > Use hostname + pod UID for shuffle manager caching > -------------------------------------------------- > > Key: HIVE-23469 > URL: https://issues.apache.org/jira/browse/HIVE-23469 > Project: Hive > Issue Type: Bug > Components: Tez > Reporter: Attila Magyar > Assignee: Attila Magyar > Priority: Major > > When a pod restarts, it uses the same hostname and shuffle port. Now when > fetcher threads connects to download the shuffle data it will use the cached > connection info and since the pod has died it's shuffle data will also get > cleaned up. When the pod restarts, it receives connection from clients to > download specific shuffle data but the daemon will not have it because of the > restart. > In ShuffleManager.java's knownSrcHosts the key should be updated to HostInfo > which is a combination of host+port and the host's unique ID. The host host > Id changes when a node is killed or restarted. > > > -- This message was sent by Atlassian Jira (v8.3.4#803005)