Rajesh Balamohan created TEZ-2172:
-------------------------------------

             Summary: FetcherOrderedGrouped using List to store 
InputAttemptIdentifier can lead to some inefficiency during remove() operation
                 Key: TEZ-2172
                 URL: https://issues.apache.org/jira/browse/TEZ-2172
             Project: Apache Tez
          Issue Type: Improvement
            Reporter: Rajesh Balamohan


As part of fixing TEZ-2001, FetcherOrderedGrouped stores InputAttemptIdentifier 
in List.  This can lead to some inefficiency - since the size of this list can 
be ~30, and remove() calls can be expensive. 

Option 1:  by using the spillId in the hashCode - or a wrapping structure for 
just this. However, SpillId can not be added to the hashCode as it would break 
ShuffleScheduler shuffleInfoEventsMap. 

Option 2: consider using Map with an identifier. 

Need to consider other options as well. Creating this jira as a placeholder to 
fix this issue.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to