[ 
https://issues.apache.org/jira/browse/HADOOP-3961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664066#action_12664066
 ] 

Xing Shi commented on HADOOP-3961:
----------------------------------

double blowupOnThisTask = ts.getOutputSize() /
((double) tip.getMapInputSize() + 1);

for example the tip.getMapInputSize() =0, and the ts.getOutputSize() =15000(I 
don't know why it is 15000)
I use streaming -mapper "cat" -reducer "cat".

here is the log:
2009-01-15 12:21:02,182 INFO  mapred.ResourceEstimator 
(ResourceEstimator.java:updateWithCompletedTask(78)) - measured blowup on tas
k_200901141908_0059_m_002291 was 15000/1 = 15000.0
2009-01-15 12:21:02,182 INFO  mapred.ResourceEstimator 
(ResourceEstimator.java:updateWithCompletedTask(90)) - new estimate is blowup
 = 23.271476813887574
2009-01-15 12:21:02,183 INFO  mapred.JobInProgress 
(JobInProgress.java:findNewMapTask(1746)) - Choosing a non-local task 
task_200901
141908_0059_m_002324
2009-01-15 12:21:02,183 INFO  mapred.JobTracker 
(JobTracker.java:createTaskEntry(1408)) - Adding task 
'attempt_200901141908_0059_m_0
02324_0' to tip task_200901141908_0059_m_002324, for tracker 
'tracker_xxxx:localhost.localdomain/127.0.0.1:
50961'
2009-01-15 12:21:02,183 INFO  mapred.JobInProgress 
(JobInProgress.java:addRunningTaskToTIP(1292)) - Choosing rack-local task 
task_20
0901141908_0059_m_002324
2009-01-15 12:21:02,282 INFO  mapred.JobInProgress 
(JobInProgress.java:completedTask(1910)) - Task 
'attempt_200901141908_0059_m_0022
92_0' has completed task_200901141908_0059_m_002292 successfully.
2009-01-15 12:21:02,282 INFO  mapred.ResourceEstimator 
(ResourceEstimator.java:updateWithCompletedTask(78)) - measured blowup on tas
k_200901141908_0059_m_002292 was 15000/1 = 15000.0 

so the blowup is very large.

and then the new estimate newly computed also large, so:
2009-01-15 12:21:20,944 INFO  mapred.ResourceEstimator 
(ResourceEstimator.java:updateWithCompletedTask(90)) - new estimate is blowup = 
819.2585444888223
2009-01-15 12:21:20,945 WARN  mapred.JobInProgress 
(JobInProgress.java:findNewMapTask(1652)) - No room for map task. Node 
/default-rack/xxxx has 132707577856 bytes free; but we expect map to take 
345805904529

> resource estimation works badly in some cases
> ---------------------------------------------
>
>                 Key: HADOOP-3961
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3961
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.19.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>            Priority: Blocker
>             Fix For: 0.19.0
>
>         Attachments: resourceEstFix.patch
>
>
> The disk space estimation introduced in HADOOP-657 performs badly under some 
> circumstances.  In particular, if maps have very small output, the initial 
> estimate of output size equals input size throws everything off.
> Fix is to use a smaller initial estimate, and to update it appropriately.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to