[ 
https://issues.apache.org/jira/browse/TEZ-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14955759#comment-14955759
 ] 

Bikas Saha commented on TEZ-1692:
---------------------------------

bq. but it may not be the most efficient removal
??

bq. Do you see something specific that could cause a large increase in resource 
utilization
Memory wise I think we should be at parity with the latest patch. Likely also 
with cpu. But like I said, this can only be measured.
Not asking for a test case. Just a manual check with some logs (with and 
without) should be enough for a sanity check. Earlier we would be under 10ms 
tops with various tpcds queries. Just wanted to make sure we aren't regressing 
due to some non obvious reason.

> Reduce code duplication between TezMapredSplitsGrouper and 
> TezMapreduceSplitsGrouper
> ------------------------------------------------------------------------------------
>
>                 Key: TEZ-1692
>                 URL: https://issues.apache.org/jira/browse/TEZ-1692
>             Project: Apache Tez
>          Issue Type: Improvement
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: TEZ-1692.1.txt, TEZ-1692.2.txt, TEZ-1692.3.txt
>
>
> The two are almost identical - with lots of repeated logic. The main 
> difference being the mapred / mapreduce InputSplit being grouped.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to