Rajesh Balamohan created TEZ-4156:
-------------------------------------
Summary: Fix Tez to reuse IPC connections
Key: TEZ-4156
URL: https://issues.apache.org/jira/browse/TEZ-4156
Project: Apache Tez
Issue Type: Improvement
Reporter: Rajesh Balamohan
When tracking DAG progress, TezClientUtils ends up creating new remote user.
Because of this new UGI creation, IPC connections are not reused internally.
https://github.com/apache/tez/blob/master/tez-api/src/main/java/org/apache/tez/client/TezClientUtils.java#L965
More info from Hadoop side:
In hadoop's IPC layer, connectionIds are checked based on UserGroupInformation.
https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/ipc/Client.java#L1600
However, UserGroupInformation comparison is based on ==
https://github.com/apache/hadoop/blob/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/security/UserGroupInformation.java#L1789
--
This message was sent by Atlassian Jira
(v8.3.4#803005)