rbalamohan opened a new pull request #78: URL: https://github.com/apache/tez/pull/78
https://issues.apache.org/jira/browse/TEZ-4245 Split information without any locality information (localhost/null/empty) should be treated equally, so that split grouping can do meaningful grouping based on cluster size. This is to avoid creating small split groups, which can significantly increase runtime due to sequential processing (i.e same map task getting lots of inputs and system ends up spending time in open/seek/close on objectstores). ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
