[
https://issues.apache.org/jira/browse/TEZ-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011277#comment-14011277
]
Bikas Saha commented on TEZ-1157:
---------------------------------
For significant changes like this its usually beneficial to discuss the
problems, solutions and consider some alternatives. This helps others
understand the situation better and allows the implementer to get different
perspectives that they had not considered. A patch can sometimes be hard to
discuss because then the code has to be carefully traced to understand whats
happening and folks start deep diving on that without spending time
understanding the problems and alternatives.
> Optimize broadcast :- Tasks pertaining to same job in same machine should not
> download multiple copies of broadcast data
> ------------------------------------------------------------------------------------------------------------------------
>
> Key: TEZ-1157
> URL: https://issues.apache.org/jira/browse/TEZ-1157
> Project: Apache Tez
> Issue Type: Sub-task
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Labels: performance
> Attachments: TEZ-1152.WIP.patch
>
>
> Currently tasks (belonging to same job) running in the same machine download
> its own copy of broadcast data. Optimization could be to download one copy
> in the machine, and the rest of the tasks can refer to this downloaded copy.
--
This message was sent by Atlassian JIRA
(v6.2#6252)