[ 
https://issues.apache.org/jira/browse/HADOOP-3116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12594790#action_12594790
 ] 

amar_kamat edited comment on HADOOP-3116 at 5/6/08 11:21 PM:
-------------------------------------------------------------

One simple thing we can do is to use a {{MAP}} from _id-within-job_ to _TIP_ 
instead of an array.

      was (Author: amar_kamat):
    One simple thing we can do is to use a {{MAP}} from _attempt-id_ to _TIP_ 
instead of an array.
  
> Investigate whether the array in the JobInProgress that holds TIP references 
> can be removed
> -------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3116
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3116
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.17.0
>            Reporter: Devaraj Das
>             Fix For: 0.18.0
>
>
> The array, in JobInProgress, that holds the references to TIPs is required 
> now only to serve the clients (web UI via JSPs and JobClient APIs, that 
> traverse the array and create the output). The array can now be removed since 
> HADOOP-2119 introduces some datastructures for running/non-running TIPs and 
> we could probably add some more if required (e.g., for completed TIPs). That 
> way, we will gain when we have large jobs (many tasks) in the sense that we 
> don't have to go through the big array every time a client request is made. 
> We could instead iterate over the datastructures. Also, we should do these 
> traversals without locking the JobTracker to avoid cases where long traversal 
> leads to loss of heartbeats, etc. Some staleness at the client side (in favor 
> of improving the JobTracker's performance) is probably okay.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to