[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wang Yan updated MAPREDUCE-7276:
--------------------------------
    Description: 
When running a hive job with HFDS disk quota specified, and enable the fast 
fail feature to fast fail a job when hdfs disk quota limitation is exceeded 
(mapreduce.job.dfs.storage.capacity.kill-limit-exceed=true), the job is 
expected to fail fast when one task attempt fails due to the quota limitation.

But I confirmed that the fast fail feature is not working. This seems to be a 
bug of TaskAttemptImpl(application master) not handling fast fail properly 
during theĀ FAILED_FINISHING_TRANSITION event.

  was:
When running a hive job with HFDS disk quota specified, and enable the fast 
fail feature to fast fail a job when hdfs disk quota limitation is exceeded 
(mapreduce.job.dfs.storage.capacity.kill-limit-exceed=true), the job is 
expected to fail fast when one task attempt fails due to the quota limitation.

But I confirmed that the fast fail feature is not working. This seems to be a 
bug of application master not handling events properly during finishing failure 
a task.


> fast fail is not working when a task finishes with failure
> ----------------------------------------------------------
>
>                 Key: MAPREDUCE-7276
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7276
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: applicationmaster
>    Affects Versions: 3.2.0, 2.9.2
>         Environment: hadoop 2.9.2, hive 2.3.2 and hive 0.13
>            Reporter: Wang Yan
>            Priority: Minor
>
> When running a hive job with HFDS disk quota specified, and enable the fast 
> fail feature to fast fail a job when hdfs disk quota limitation is exceeded 
> (mapreduce.job.dfs.storage.capacity.kill-limit-exceed=true), the job is 
> expected to fail fast when one task attempt fails due to the quota limitation.
> But I confirmed that the fast fail feature is not working. This seems to be a 
> bug of TaskAttemptImpl(application master) not handling fast fail properly 
> during theĀ FAILED_FINISHING_TRANSITION event.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org

Reply via email to