[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3511:
-----------------------------------------------

    Attachment: MAPREDUCE-3511-20120109.txt

Thanks for looking into it, Robert!

bq. If the old counter API is going to be the long term fix then perhaps we 
should not mark is as deprecated any more.
The counters that I am now using are from {{mapreduce.Counters}} which aren't 
deprecated. Irrespective of this, it makes sense to undeprecate other old v1 
stuff (mapped.*), I'll propose and merge MAPREDUCE-1735 into trunk/23 also.

bq. [...] a minor performance improvement [...] I would prefer to see the datum 
start out as null, and only have its fields set if it is not null, inside 
getDatum.
Not sure if {{getDatum()}} will be called multiple times as each event will be 
logged only once. Makes sense to implement your proposal anyways just to be 
sure. Done.

bq. Why was TokenCache.java modified at all? It does not seem to be related to 
this JIRA.
It isn't. But without that, due to HADOOP-7963/MAPREDUCE-3639 the patch 
couldn't be tested on cluster. I'll revert those changes from the patch.

bq. You added a TODO in CompletedTask.java and CompletedJob.java
Done, added that to remind myself to avoid any clones :) Removing the clones 
now.

bq. good catch on TestHsWebServicesTasks.java, TestAMWebServicesAttempts.java, 
TestHsWebServicesTasks.java 
Am surprised Jenkins didn't catch these.

bq.  I also don't think we need any more tests because, all we are doing is 
reducing memory usage, which is very hard to write a unit test for.
Yes, +1 :)

bq. Inside JobHistoryEventHandler.java you added in // TODO: Only job-counters 
is enough? How about the myriad clones in this code-path. is this TODO still 
needed?
Avoided that extra clone for a finished Job.
                
> Counters occupy a good part of AM heap
> --------------------------------------
>
>                 Key: MAPREDUCE-3511
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>          Components: mr-am, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Siddharth Seth
>            Assignee: Vinod Kumar Vavilapalli
>            Priority: Blocker
>             Fix For: 0.23.1
>
>         Attachments: MAPREDUCE-3511-20120107.1.txt, 
> MAPREDUCE-3511-20120109.txt
>
>
> Per task counters seem to be occupying a good part of an AMs heap. Looks like 
> more than 50% of what's used by a TaskAttemptImpl object.
> This could be optimized by interning strings or possibly using mrv1 counters 
> which are optimized. Currently counters are converted from mrv1 to mrv2 
> format for in memory storage. The conversion could be delayed till it's 
> actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to