[ 
https://issues.apache.org/jira/browse/TEZ-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14355938#comment-14355938
 ] 

Siddharth Seth commented on TEZ-776:
------------------------------------

One option would be to make these changes now, and agree upon moving storage 
into the EdgePlugin in a month or so (I can take this up) - before maybe a 
0.7.1, and definitely before a 0.8 release. What gets in the way of this is 
users having to change their edge plugins twice (Hive has one, not sure about 
Pig, Cascading and others).
[~bikassaha] - I'm not sure if the patch is an incompatible change or not, but 
is it possible to make these changes in a way that external EdgeManagers (Hive 
has one, not sure about Pig and others) don't need to change. One option for 
this would be to use the existing APIs for a default implementation of the new 
methods - and pick the relevant target task. Fairly inefficient though.
The reason I say this is that the API changes in this patch and API changes for 
moving event storage into the EdgePlugin - will not be the same.  If we can 
make this in a compatible manner, and maybe add notes on the APIs saying 
they'll be removed soon - in a month or so, once EdgePlugin storage changes are 
made, downstream projects would need to change their implementation only once.

> Reduce AM mem usage caused by storing TezEvents
> -----------------------------------------------
>
>                 Key: TEZ-776
>                 URL: https://issues.apache.org/jira/browse/TEZ-776
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Siddharth Seth
>            Assignee: Bikas Saha
>         Attachments: TEZ-776.ondemand.1.patch, TEZ-776.ondemand.2.patch, 
> TEZ-776.ondemand.3.patch, TEZ-776.ondemand.4.patch, TEZ-776.ondemand.5.patch, 
> TEZ-776.ondemand.patch, With_Patch_AM_hotspots.png, 
> With_Patch_AM_profile.png, Without_patch_AM_CPU_Usage.png, 
> events-problem-solutions.txt, with_patch_jmc_output_of_AM.png, 
> without_patch_jmc_output_of_AM.png
>
>
> This is open ended at the moment.
> A fair chunk of the AM heap is taken up by TezEvents (specifically 
> DataMovementEvents - 64 bytes per event).
> Depending on the connection pattern - this puts limits on the number of tasks 
> that can be processed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to