[ 
https://issues.apache.org/jira/browse/TEZ-776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14325647#comment-14325647
 ] 

Bikas Saha commented on TEZ-776:
--------------------------------

Yes. ConfigureEvent could be sent always. Perhaps resuse of 
InputDataInformationEvent.
Yes. They dont need to go to the same targetIndex. Which is why I am not 
exploring this option in priority.

Yes. on demand routing would trade memory for cpu overhead.

>From what I understand 3 is figuring out a compact way to encode the routing 
>table. So I am not sure how it is going to reduce the overhead of storing the 
>exploded events (reference overhead) or the events themselves (after exploding 
>them). Seemed like it depends on composite event routing and ondemand routing. 
>So its like a mix of subsets of 1 and 2 with the encoding used to reduce the 
>routing cpu overhead?

Moving routing into vertex plugins instead of edge plugins is shifting 
responsibilities architecturally. Not sure how it solves the event overhead or 
reference overhead. Making a hack in the shuffle plugin is probably not a good 
idea. Users can be extending it or using their own for shuffles when they want 
their own version of auto-reduce.

TezEvents are wrappers around data movement events. They are replicated only 
for composite dm events since they are exploded. tezevent store identical 
src/dest metainfo for the exploded events. Not sure about the math of reducing 
the ref count from 16 to 2-3. Not replicating them and sharing them across dm 
events may cause race conditions during rpc serialization.

> Reduce AM mem usage caused by storing TezEvents
> -----------------------------------------------
>
>                 Key: TEZ-776
>                 URL: https://issues.apache.org/jira/browse/TEZ-776
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Siddharth Seth
>            Assignee: Bikas Saha
>         Attachments: events-problem-solutions.txt
>
>
> This is open ended at the moment.
> A fair chunk of the AM heap is taken up by TezEvents (specifically 
> DataMovementEvents - 64 bytes per event).
> Depending on the connection pattern - this puts limits on the number of tasks 
> that can be processed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to