[jira] [Commented] (TEZ-2317) Successful task attempts getting killed

Bikas Saha (JIRA) Wed, 15 Apr 2015 15:44:45 -0700

    [ 
https://issues.apache.org/jira/browse/TEZ-2317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497163#comment-14497163
 ]


Bikas Saha commented on TEZ-2317:
---------------------------------

bq. Optimize by not sending a commit go/no-go request if there is no hdfs 
output (DataSink) involved. In the above case, it is always intermediate output
Fix in Pig
bq. Handle the commit go/no-go request after processing events in the event 
queue. May be something like ask the task to come back after some time
In this jira
bq. We saw that for 3058 KilledTaskAttempts TA_KILL_REQUEST events was 383519. 
This is way high.
That is because each canCommit request from the task was resulting in a kill 
event being enqueued. Not killing (in this jira) will fix that.
bq.In the attached AM-taskkill.log which has grepped statements for a single 
task that was killed, it has 327 repeats of below message. Need to see why so 
much and fix that.
The log happens for each canCommit call from the task that gets denied because 
the AM task state is not running. Can change to debug in this patch. The pig 
processor is calling canCommit every 100ms.

> Successful task attempts getting killed
> ---------------------------------------
>
>                 Key: TEZ-2317
>                 URL: https://issues.apache.org/jira/browse/TEZ-2317
>             Project: Apache Tez
>          Issue Type: Bug
>            Reporter: Rohini Palaniswamy
>            Assignee: Bikas Saha
>             Fix For: 0.7.0
>
>         Attachments: AM-taskkill.log
>
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-2317) Successful task attempts getting killed

Reply via email to