[ 
https://issues.apache.org/jira/browse/HADOOP-3116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HADOOP-3116:
--------------------------------

    Description: The array, in JobInProgress, that holds the references to TIPs 
is required now only to serve the clients (web UI via JSPs and JobClient APIs, 
that traverse the array and create the output). The array can now be removed 
since HADOOP-2119 introduces some datastructures for running/non-running TIPs 
and we could probably add some more if required (e.g., for completed TIPs). 
That way, we will gain when we have large jobs (many tasks) in the sense that 
we don't have to go through the big array every time a client request is made. 
We could instead iterate over the datastructures. Also, we should do these 
traversals without locking the JobTracker to avoid cases where long traversal 
leads to loss of heartbeats, etc. Some staleness at the client side (in favor 
of improving the JobTracker's performance) is probably okay.  (was: The array 
that holds the references to TIPs is required now only to serve the clients 
(web UI via JSPs and JobClient APIs, that traverse the array and create the 
output). The array can now be removed since HADOOP-2119 introduces some 
datastructures for running/non-running TIPs and we could probably add some more 
if required (e.g., for completed TIPs). That way, we will gain when we have 
large jobs (many tasks) in the sense that we don't have to go through the big 
array every time a client request is made. We could instead iterate over the 
datastructures. Also, we should do these traversals without locking the 
JobTracker to avoid cases where long traversal leads to loss of heartbeats, 
etc. Some staleness at the client side (in favor of improving the JobTracker's 
performance) is probably okay.)
        Summary: Investigate whether the array in the JobInProgress that holds 
TIP references can be removed  (was: Investigate whether the array that holds 
TIP references can be removed)

> Investigate whether the array in the JobInProgress that holds TIP references 
> can be removed
> -------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-3116
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3116
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.17.0
>            Reporter: Devaraj Das
>             Fix For: 0.18.0
>
>
> The array, in JobInProgress, that holds the references to TIPs is required 
> now only to serve the clients (web UI via JSPs and JobClient APIs, that 
> traverse the array and create the output). The array can now be removed since 
> HADOOP-2119 introduces some datastructures for running/non-running TIPs and 
> we could probably add some more if required (e.g., for completed TIPs). That 
> way, we will gain when we have large jobs (many tasks) in the sense that we 
> don't have to go through the big array every time a client request is made. 
> We could instead iterate over the datastructures. Also, we should do these 
> traversals without locking the JobTracker to avoid cases where long traversal 
> leads to loss of heartbeats, etc. Some staleness at the client side (in favor 
> of improving the JobTracker's performance) is probably okay.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to