[ https://issues.apache.org/jira/browse/TEZ-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gopal V updated TEZ-1157: ------------------------- Attachment: TEZ-1157.7.patch > Optimize broadcast :- Tasks pertaining to same job in same machine should not > download multiple copies of broadcast data > ------------------------------------------------------------------------------------------------------------------------ > > Key: TEZ-1157 > URL: https://issues.apache.org/jira/browse/TEZ-1157 > Project: Apache Tez > Issue Type: Sub-task > Reporter: Rajesh Balamohan > Assignee: Gopal V > Labels: performance > Attachments: TEZ-1152.WIP.patch, TEZ-1157.3.WIP.patch, > TEZ-1157.4.WIP.patch, TEZ-1157.5.WIP.patch, TEZ-1157.6.patch, > TEZ-1157.7.patch, TEZ-broadcast-shuffle+vertex-parallelism.patch > > > Currently tasks (belonging to same job) running in the same machine download > its own copy of broadcast data. Optimization could be to download one copy > in the machine, and the rest of the tasks can refer to this downloaded copy. -- This message was sent by Atlassian JIRA (v6.3.4#6332)